The Leading Source for Global News and Information Covering the Ecosystem of High Productivity Computing
From the Editor | Main Blog Index
May 28, 2009
As we reported yesterday, the five venture capital firms supporting Linux cluster vendor SiCortex have pulled their funding, forcing the Massachusetts-based company to shut down its operations. As the company prepares to sell off its assets, there are said to be only a handful of employees remaining. Unless a buyer comes in that is willing to take over the business more or less intact, support for dozens of SiCortex systems currently deployed at user sites will come to an abrupt end.
As of today, the company had 37 customers listed on their Web site, including such big names as Argonne National Lab, MIT, NASA, Karlsruhe Institute of Technology, Chevron, and General Electric. More than 20 universities had also purchased SiCortex gear. Undoubtedly, some of these customers will be able to migrate their HPC applications onto spare capacity as they phase out their orphaned SiCortex machines. For others though, the transition is going to be a very painful one.
I spoke with a university customer who had purchased a mid-range SiCortex cluster, where it represents the institution's newest HPC platform. The system administrator there, who requested to remain anonymous, was wondering what they were going to do without vendor support. "This really puts us in a huge bind," he told me.
According to him, the university shelled out around $150,000 for the SiCortex cluster, the largest investment ever made at this particular lab. They had planned for the system to be their HPC platform of the future, where it was supposed to augment the lab's aging x86-based clusters. SiCortex was chosen because of space and power constraints at the datacenter, and because university researchers there were working on computational models that scaled extremely well for high core-count architectures. Compared to conventional x86 clusters, SiCortex systems use greater numbers of less powerful MIPS cores to deliver some of the best performance per watt metrics in the industry.
The SiCortex design has garnered a lot of critical acclaim, but according to my system admin source, it hasn't been exactly smooth sailing with the company's hardware. He said while the SiCortex's support and service has been "outstanding," the machines themselves have had their problems. The first system purchased by the university, the SC648, didn't live up to its advertised performance. The company ended up swapping the system with the more powerful SC1458 machine. However, even this system has been trouble. According to the system administrator, during the 16 to 18 months the system has been running, they have not gone for a consecutive four month span without some kind of hardware failure.
Currently two of the 235 compute nodes on the system are down. While that is probably not an outlandish failure rate considering the size of the cluster, without vendor support, the lifetime of such a machine will be greatly reduced. The real fear is that a critical failure could occur at any time, rendering the system totally worthless.
One unfortunate consequence of SiCortex's demise is that, for awhile at least, there will be no high core-count HPC platform in the price range of a mid-sized cluster, and with the performance efficiency of an IBM Blue Gene. Sometime in the first half of 2010, you should be able to get an 8-socket, 8-core Nehalem-EX server that supports 128 threads. A dozen of these servers would provide over 1,500 threads, more or less equivalent to a 1,458-core SiCortex machine, at least thread-wise. But the price and power consumption (not to mention performance) of a Nehalem cluster of this size are likely to be a good deal more than the equivalent SiCortex system.
The broader tragedy is that the company appeared headed for commercial success. According to my last conversation with the company in April, the business had achieved record growth in Q1, and had collected a big pipeline of customers for the remainder of 2009. SiCortex was already pitching its more powerful next-generation systems (more cores and faster processors). If the VCs hadn't blinked, the company may have begun to turn a profit this year or next, despite the economic downturn.
The result of SiCortex's demise is that HPC users will likely become even more conservative in their choice of cluster vendors. If they can. For the university in this story, there are no easy answers since the funds may not be there to replace the SC1458 system anytime soon. Even if they were, the researchers were getting used to running their apps on a high-core count machine. "We'll figure something out," said the system admin. "But in the meantime, my scientists are freaking."
-----
[ UPDATE: For a couple of great perspectives from former employees at SiCortex, check out Matt Reilly's blog here and Jeff Darcy's comments on his Web site.]
Posted by Michael Feldman - May 28 @ 3:28PM
(Digg, Technorati, more)
There are 1 discussion items posted.
SiCortex / Betamax
Submitted by KevinButerbaugh on 05/29/2009 - 8:17AM
Actually, the subject line pretty much sums it up. On one hand, I feel sorry for the SysAdmin and his / her researchers. On the other hand, they bought a Betamax VCR (I'm showing my age - I know!) after the handwriting was already on the wall and now they're simply paying the price for that decision.
Kevin
P.S. I find it amusing that just to the left of the comment box I'm typing this in is a SiCortex ad...
Post #1
PGI Accelerator™ Fortran 95/03 and C99 compilers for x64+NVIDIA
Accelerate applications on x64+GPU platforms by adding OpenMP-like compiler directives to existing Fortran and C programs. Available now for Linux, MacOS and Windows. Download a free 15 day trial.
Platform HPC Workgroup Manager
Platform HPC Workgroup Manager integrates all the cluster productivity tools you need to deploy, run and manage your HPC environment.
Michael Feldman is the editor of HPCwire.
More Michael Feldman
Re: Multicore Watershed by Nastyanna
HPC? not so much by ewahl
Re: Podcast: A Trio of HPC Apps by sibat0705
Re: Podcast: A Trio of HPC Apps by sibat0705
Re: Cray Corrals Big Defense Deal by watchesuk
We think by watchesuk
Re: IBM and HPC by truly64
HPC = servers but a lot more by lawries
Lena by Nastyanna
Lena by Nastyanna
Multi core deployment becomes a memory game by truly64
Re: Venture Capital Drought? Not So Much. by Ron Van Holst
Re: AMD Confirms 12-Core Opteron Production by Nastyanna
Re: Cray Corrals Big Defense Deal by Nastyanna
Re: Podcast: Cray Awarded Defense Deal; SGI Makes Storage Buy; IBM Invents New Algorithm by Nastyanna
Painful Truth by jeffrey.mcallister
SGI = graphics + HPC by johnbarr
HPC = servers but a lot more by truly64
Oracle SPARC != Fujitsu SPARC by Alan M. Feldstein
Sun & HPC != Oracle & HPC by Merblich
a third vendor for lossless low latency 10GbE fabric by lee.fisher@hp.com
Response to GAH by KevinButerbaugh
Response to KevinButerbaugh by GAH
Response to KevinButerbaugh by GAH
Response to GAH by KevinButerbaugh
Response to bdrupp by KevinButerbaugh
Climate Crisis and Exaflops by bdrupp
Climate Crisis and Exaflops by John Hules
Climate Crisis and Exaflops by GAH
Climate Crisis by KevinButerbaugh
IBM "Brain Simulation" article is not properly presented. by Merritt
563 out of 1206 by vvolkov
Little Iron by gadunk
At least it's not "cloud" by KevinButerbaugh
Native QPI Interface? by commike
Mmmmmm by hellcats
New transistorized IC chip scales. by symmecon
Itanium at IDF by Alan M. Feldstein
Communication time by jnapper
"The financial meltdown and computing" by donpellegrino
Human Models by mdgabriel
High-End SPARC Chip for Scientific Applications by Alan M. Feldstein
RapidMind by Mr LolO
Rapidmind by dminor
Longer run times by JohnWest
re: Algo trading Angst by jshore
Results of Testing by in_the_crease
The Moscow State University supercomputer, Lomonosov, has been selected for a high-performance makeover, with the goal of tripling its processing power to achieve petaflop-level performance in 2010. T-Platforms, who developed and manufactured the supercomputer, is the odds-on favorite to lead the project.
Read More...
Right on schedule, Intel has launched its Xeon 5600 processors, codenamed "Westmere EP." The 5600 represents the 32nm sequel to the Xeon 5500 (Nehalem EP) for dual-socket servers. Intel is touting better performance and energy efficiency, along with new security features, as the big selling points of the new Xeons.
Read More...
The ACM Turing Award goes to the creator of the modern personal computer; and Voltaire announces a mid-range InfiniBand switch and new technology that accelerates distributed applications. We recap those stories and more in our weekly wrapup.
Read More...
Mar 17 | The Register | But what about the tier ones? Read more...
Mar 17 | Cadalyst Magazine | A new generation of workstations is changing the nature of technical computing. Read more...
Mar 17 | Linux Magazine | Latest iteration of Sun Grid Engine able to tap into Cloud. Read more...
Mar 16 | Bio-IT World | Biotech firm builds genetic models from patient data. Read more...
Mar 15 | The Register | EMC's grand vision for unified global storage. Read more...
Jan 12 | | In-depth look at vSMP Foundation server virtualization technology, technical implementation, use cases and capabilities. The technical whitepaper provides an architectural overview and details on the three vSMP Foundation products: vSMP Foundation for SMP, vSMP Foundation for Cluster and vSMP Foundation for Cloud.
Jan 18 | | This white paper discusses Gore’s copper cable assemblies, and how they continue to exceed the standards for providing reliable, cost-effective solutions for high-performance computer applications.
Join this online panel discussion for live Q&A with leading industry experts, analysts, and end-users to discuss the latest innovations, best practices, barriers to implementation, and measurable benefits of server virtualization with a particular focus on today's real world solutions.
Learn about scalable fault-tolerant architectures and examples of energy efficient and scalable supercomputing clusters using dual QDR InfiniBand to combine capacity computing with network failover capabilities with the help of programming languages such as MPI and a robust Linux cluster management package.
LIVE@SCO9: The IBM team discusses new innovations in hardware, software and services that help clients better understand their workloads and get insight from their R&D efforts. Technology demonstrations include the soon-to-be-released Power7 HPC processor, the DCS990 system with 2.4 petabytes of storage, the xCAT management tool, secure HPC cloud computing and more. Winners of two HPCwire Readers' and Editors’ Choice Awards! Take the IBM virtual tour at SC09 or more information go online to: http://www-03.ibm.com/systems/deepcomputing/sc09.html