HPC Matters is a joint blog consisting of contributors from the Tabor Communications team on their observations and insights into HPC matters.
December 03, 2010
The Air Force Research Laboratory (AFRL) has marked the launch of its Condor PS3 Cluster with a formal ribbon-cutting ceremony on December 1. Initially unveiled last year, the cluster is made up of 1,760 Sony PlayStation 3 processors and 168 general-purpose graphical processing units, providing an estimated 500 teraflops of performance. That places Condor among the top 50 of the world's fastest systems. The pricetag? A mere $2 million.
Mark Barnell, director of AFRL's High Power Computing, explained that comparable systems would cost at least $20 million to $40 million.
Sony sells the systems at a loss with the aim of recouping the money selling expensive games and gaming accessories, such as memory sticks -- but in the process they are inadvertently subsidizing the military and anyone else who wants to use the gaming hardware for scientific purposes.
This is not your parent's Pong, that's for sure. Instead it's probably the biggest supercomputing bang you can get for your buck. The Air Force Research Lab took a really sophisticated gaming system that uses the power of the cutting-edge Cell processor to boost speed, and put it to task running scientific applications. That's the kind of outside-the-box thinking that can lead to high success or tanking failure. But in this case, it is working out very well, as Barnell elucidates in an article at Airman Magazine:
"By using the cell processors in the PS3s and the GPGPUs in unison, we've produced a system that does a very good job at handling this kind of [surveillance] information. We've developed the most powerful heterogeneous supercomputer in the world for a fraction of the cost of building it using individual chips and servers."
Barnell has also stated that the Condor Cluster is the DoD's most powerful "interactive" supercomputer. He explains what this means in a Q&A at SmartPlanet.com:
From the perspective of most of the supercomputing centers in the DoD, when millions or tens of millions of dollars are invested, you don't want to waste cycles. So these computers are run in what is called "batch mode." They keep these systems running at very high levels, all of the time, so the applications that use them are carefully managed and optimized.
On this computer, we're not tied to these metrics, mainly because it was so inexpensive. We do a lot of research and development on this system, so we start with only a few nodes, make sure [the software] works, and scale up from there. We have a lot of users [in the DoD] who, when they're actually developing code, have a tendency to hang a machine or two. When you do that, the computer becomes ineffective until we reboot it.
It should be noted that Condor uses the old PS3 systems, not the new PS3 Slims. Sony made a decision not to support Linux anymore, so if the systems are given firmware upgrades, they won't be able to run the Linux OS anymore. They can't even be sent in for repair because the mandatory upgrade will render them useless to AFRL. Sony could choose to reverse the policy, and with the all the publicity around the Air Force's new wunder-cluster, they may just change their minds.
In the meantime, the AFRL is using the system for targeted applications such as neuromorphic artificial intelligence research, synthetic aperture radar enhancement, image enhancement and pattern recognition research. For further details on these interesting projects, check out coverage from DVIDS.
The Condor Cluster will be available to all DoD users on a shared basis. It uses less than one-tenth the power of a comparable system, making it cost-effective and green.
Posted by Tiffany Trader - December 03, 2010 @ 4:47 PM, Pacific Standard Time
![]()
Tiffany Trader is the editor of HPC in the Cloud. With a background in HPC publishing, she brings a wealth of knowledge and experience to bear on a range of topics relevant to the technical cloud computing space.
No Recent Blog Comments
In a recent solicitation, the NSF laid out needs for furthering its scientific and engineering infrastructure with new tools to go beyond top performance, Having already delivered systems like Stampede and Blue Waters, they're turning an eye to solving data-intensive challenges. We spoke with the agency's Irene Qualters and Barry Schneider about..
Read more...
Large-scale, worldwide scientific initiatives rely on some cloud-based system to both coordinate efforts and manage computational efforts at peak times that cannot be contained within the combined in-house HPC resources. Last week at Google I/O, Brookhaven National Lab’s Sergey Panitkin discussed the role of the Google Compute Engine in providing computational support to ATLAS, a detector of high-energy particles at the Large Hadron Collider (LHC).
Read more...
The Xeon Phi coprocessor might be the new kid on the high performance block, but out of all first-rate kickers of the Intel tires, the Texas Advanced Computing Center (TACC) got the first real jab with its new top ten Stampede system.We talk with the center's Karl Schultz about the challenges of programming for Phi--but more specifically, the optimization...
Read more...
May 16, 2013 |
When it comes to cloud, long distances mean unacceptably high latencies. Researchers from the University of Bonn in Germany examined those latency issues of doing CFD modeling in the cloud by utilizing a common CFD and its utilization in HPC instance types including both CPU and GPU cores of Amazon EC2.
Read more...
May 15, 2013 |
Supercomputers at the Department of Energy’s National Energy Research Scientific Computing Center (NERSC) have worked on important computational problems such as collapse of the atomic state, the optimization of chemical catalysts, and now modeling popping bubbles.
Read more...
May 10, 2013 |
Program provides cash awards up to $10,000 for the best open-source end-user applications deployed on 100G network.
Read more...
May 09, 2013 |
The Japanese government has revealed its plans to best its previous K Computer efforts with what they hope will be the first exascale system...
Read more...
05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas | From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.
04/15/2013 | Bull | “50% of HPC users say their largest jobs scale to 120 cores or less.” How about yours? Are your codes ready to take advantage of today’s and tomorrow’s ultra-parallel HPC systems? Download this White Paper by Analysts Intersect360 Research to see what Bull and Intel’s Center for Excellence in Parallel Programming can do for your codes.
In this demonstration of SGI DMF ZeroWatt disk solution, Dr. Eng Lim Goh, SGI CTO, discusses a function of SGI DMF software to reduce costs and power consumption in an exascale (Big Data) storage datacenter.
The Cray CS300-AC cluster supercomputer offers energy efficient, air-cooled design based on modular, industry-standard platforms featuring the latest processor and network technologies and a wide range of datacenter cooling requirements.