Since 1986 - Covering the Fastest Computers in the World and the People Who Run Them
From the Editor | Main Blog Index
May 28, 2009
As we reported yesterday, the five venture capital firms supporting Linux cluster vendor SiCortex have pulled their funding, forcing the Massachusetts-based company to shut down its operations. As the company prepares to sell off its assets, there are said to be only a handful of employees remaining. Unless a buyer comes in that is willing to take over the business more or less intact, support for dozens of SiCortex systems currently deployed at user sites will come to an abrupt end.
As of today, the company had 37 customers listed on their Web site, including such big names as Argonne National Lab, MIT, NASA, Karlsruhe Institute of Technology, Chevron, and General Electric. More than 20 universities had also purchased SiCortex gear. Undoubtedly, some of these customers will be able to migrate their HPC applications onto spare capacity as they phase out their orphaned SiCortex machines. For others though, the transition is going to be a very painful one.
I spoke with a university customer who had purchased a mid-range SiCortex cluster, where it represents the institution's newest HPC platform. The system administrator there, who requested to remain anonymous, was wondering what they were going to do without vendor support. "This really puts us in a huge bind," he told me.
According to him, the university shelled out around $150,000 for the SiCortex cluster, the largest investment ever made at this particular lab. They had planned for the system to be their HPC platform of the future, where it was supposed to augment the lab's aging x86-based clusters. SiCortex was chosen because of space and power constraints at the datacenter, and because university researchers there were working on computational models that scaled extremely well for high core-count architectures. Compared to conventional x86 clusters, SiCortex systems use greater numbers of less powerful MIPS cores to deliver some of the best performance per watt metrics in the industry.
The SiCortex design has garnered a lot of critical acclaim, but according to my system admin source, it hasn't been exactly smooth sailing with the company's hardware. He said while the SiCortex's support and service has been "outstanding," the machines themselves have had their problems. The first system purchased by the university, the SC648, didn't live up to its advertised performance. The company ended up swapping the system with the more powerful SC1458 machine. However, even this system has been trouble. According to the system administrator, during the 16 to 18 months the system has been running, they have not gone for a consecutive four month span without some kind of hardware failure.
Currently two of the 235 compute nodes on the system are down. While that is probably not an outlandish failure rate considering the size of the cluster, without vendor support, the lifetime of such a machine will be greatly reduced. The real fear is that a critical failure could occur at any time, rendering the system totally worthless.
One unfortunate consequence of SiCortex's demise is that, for awhile at least, there will be no high core-count HPC platform in the price range of a mid-sized cluster, and with the performance efficiency of an IBM Blue Gene. Sometime in the first half of 2010, you should be able to get an 8-socket, 8-core Nehalem-EX server that supports 128 threads. A dozen of these servers would provide over 1,500 threads, more or less equivalent to a 1,458-core SiCortex machine, at least thread-wise. But the price and power consumption (not to mention performance) of a Nehalem cluster of this size are likely to be a good deal more than the equivalent SiCortex system.
The broader tragedy is that the company appeared headed for commercial success. According to my last conversation with the company in April, the business had achieved record growth in Q1, and had collected a big pipeline of customers for the remainder of 2009. SiCortex was already pitching its more powerful next-generation systems (more cores and faster processors). If the VCs hadn't blinked, the company may have begun to turn a profit this year or next, despite the economic downturn.
The result of SiCortex's demise is that HPC users will likely become even more conservative in their choice of cluster vendors. If they can. For the university in this story, there are no easy answers since the funds may not be there to replace the SC1458 system anytime soon. Even if they were, the researchers were getting used to running their apps on a high-core count machine. "We'll figure something out," said the system admin. "But in the meantime, my scientists are freaking."
-----
[ UPDATE: For a couple of great perspectives from former employees at SiCortex, check out Matt Reilly's blog here and Jeff Darcy's comments on his Web site.]
Posted by Michael Feldman - May 28 @ 3:28PM, Pacific Daylight Time
(Digg, Technorati, more)
There are 1 discussion items posted.
SiCortex / Betamax
Submitted by KevinButerbaugh on 05/29/2009 - 8:17AM
Actually, the subject line pretty much sums it up. On one hand, I feel sorry for the SysAdmin and his / her researchers. On the other hand, they bought a Betamax VCR (I'm showing my age - I know!) after the handwriting was already on the wall and now they're simply paying the price for that decision.
Kevin
P.S. I find it amusing that just to the left of the comment box I'm typing this in is a SiCortex ad...
Post #1
Michael Feldman is the editor of HPCwire.
More Michael Feldman
CPU by igoogler
CPU by igoogler
16 core by gretta
Re: CHREC Is Doubling FPGAs in Novo-G Super by gretta
nice by gretta
cheap auto insurance by GenieBump
Re: CHREC Is Doubling FPGAs in Novo-G Super by GenieBump
cheap auto insurance by GenieBump
my comment by jessie31
my comment by jessie31
Exciting Ride by melonakos
Oracle exiting? by JF@OCF
thanks by jessie31
my comment by jessie31
Re: Verari Reboot Paves Way for New HPC Strategy by jessie31
Re: Verari Reboot Paves Way for New HPC Strategy by jessie31
Re: Westmere Ushers in the Second Coming of Multicore by jessie31
SGE ... by cdespoix
Oracle by PhilT
quality by db05
New IC Design Modeling Software Needed for Next Generation by symmecon
I thought this was HPC by Don Lee
rethuglican apologist above; by chammitt
a zinger by melonakos
RoCE does not require DCB by Paul Grun
iWARP provides RDMA over Ethernet – Part 2 by David Fair
iWARP provides RDMA over Ethernet - Part 1 by David Fair
Why can't people get this right?? by BradBooth
Re: Podcast: HPC in the Cloud; Cray Cozies Up To ISVs by Kate
Good resource by Kate
2008R2 parity with linux by tprince
Google needs 10 terabit ethernet by zipdisk2003
Re: IBM and HPC by Proteus
Pop filter and deesser by ChristophWeber
MPI collective operations by jsquyres
Compairson to Core i7-980X by rsingle
HPC? not so much by ewahl
Re: IBM and HPC by truly64
HPC = servers but a lot more by lawries
Multi core deployment becomes a memory game by truly64
Re: Venture Capital Drought? Not So Much. by Ron Van Holst
Painful Truth by jeffrey.mcallister
SGI = graphics + HPC by johnbarr
HPC = servers but a lot more by truly64
Oracle SPARC != Fujitsu SPARC by Alan M. Feldstein
Sun & HPC != Oracle & HPC by Merblich
Response to GAH by KevinButerbaugh
Response to KevinButerbaugh by GAH
Response to KevinButerbaugh by GAH
Response to GAH by KevinButerbaugh
Response to bdrupp by KevinButerbaugh
Climate Crisis and Exaflops by bdrupp
Climate Crisis and Exaflops by John Hules
Climate Crisis and Exaflops by GAH
Climate Crisis by KevinButerbaugh
IBM "Brain Simulation" article is not properly presented. by Merritt
The National Science Foundation has awarded funding to four projects as part of the Future Internet Architecture program; and the 3PAR bidding war is won by HP. We recap those stories and more in our weekly wrapup.
Read More...
Intel Corp has released Parallel Studio 2011, a set of four tools designed to mainstream software development on multicore x86 architectures. The update folds in a number of parallel programming technologies that the company has acquired or developed independently over the past few years, including the Cilk Arts and RapidMind technologies, and Intel's own Ct data parallel language framework.
Read More...
There's nothing like a blazing hot summer to focus one's attention on the best ways to keep cool. That goes for datacenter operators as well, who are equally worried about keeping their servers properly chilled. While there is no shortage of innovative cooling solutions being proffered by various vendors, a new liquid immersion cooling solution from startup Green Revolution Cooling could end up being the best of them all.
Read More...
Sep 02 | Could see first products in three years. Read more...
Sep 01 | A hand-picked selection of video presentations from the TED conference -- because the next big thing has to start somewhere. Read more...
Aug 30 | CERN project adapts its computation and storage strategy as hardware gets cheaper and better. Read more...
Aug 26 | Chinese-made chip adds vector SIMD unit; delivers 128 gigaflops in 40 watts. Read more...
Aug 25 | Hot Chips presentation offers insights on supercomputer design. Read more...
Jul 29 | | Panasas storage solutions deliver high throughput with many concurrent backup IO streams to standard backup applications such as Veritas NetBackup™ or EMC® NetWorker™. Download this whitepaper to understand the essential elements for effective backup and restore: the tape subsystem, networking, file system workload and administrative policy.
Jul 28 | | As compelling economics and performance drive GPUs into HPC clusters, developers are scrambling to catch up. Download this whitepaper from Platform Computing to understand how to capture the benefits of exciting new GPU capabilities.
In this webinar you will hear about the current storage challenges facing the HPC community, how Panasas storage solutions provide exceptional performance, scalability, and manageability, and how you can achieve the lowest total Cost of Ownership with a system that installs and configures in 15 minutes.
Join this online panel discussion for live Q&A with leading industry experts, analysts, and end-users to discuss the latest innovations, best practices, barriers to implementation, and measurable benefits of server virtualization with a particular focus on today's real world solutions.
Learn about scalable fault-tolerant architectures and examples of energy efficient and scalable supercomputing clusters using dual QDR InfiniBand to combine capacity computing with network failover capabilities with the help of programming languages such as MPI and a robust Linux cluster management package.