Visit additional Tabor Communication Publications
December 15, 2010
This week, the Partnership for Advanced Computing in Europe (PRACE) announced its third petascale supercomputer for the organization's Tier 0 research infrastructure. The upcoming machine, known as SuperMUC, will be built by IBM and is estimated to deliver 3 peak petaflops when it is deployed in 2012 at the Leibniz Supercomputing Centre (LRZ) in Garching, Germany.
SuperMUC will follow the 1.0 petaflop "JUGENE" (Jülich Blue Gene" Blue Gene/P supercomputer, already in service at Forschungszentrum Juelich (FZJ), and the 1.25 petaflop Bull-built "Curie" system at the Commissariat a l'Energie Atomique (CEA) in France. Those machines currently hold the 9 and 6 spots, respectively on TOP500 list. When SuperMUC is installed at LRZ in the middle of 2012, it too will likely be a top 10 system, although by then all ten machines should be operating in the multi-petaflop range.
All Tier 0 machines will support PRACE's mission to provide a pan-European HPC research infrastructure for scientific computing. As of June 2010, four of the twenty member nations have anted up 100 million Euros apiece to fund supercomputer deployment and operation over the next five years. The goal to field as many as six of these petascale systems for Europe during this time. With the tri-petaflop system from IBM, they're halfway there.
The SuperMUC system headed for LRZ will use IBM's iDataPlex System X platform, and will incorporate Intel's next-generation Xeon processors. Most likely that means SuperMUC will be sporting Sandy Bridge Xeons, given that these are next up on the Intel server processor roadmap.
The next-gen Xeons are scheduled to be released in Q3 2011 (Sandy Bridge EP) and Q4 2011 (Sandy Bridge EX), which should provide plenty of time for a mid-2012 system deployment. SuperMUC will incorporate more than 14,000 of these future chips, although the exact core count is still under wraps. Sandy Bridge Xeons will come in 4-core, 6-core, and 8-core flavors, so we can assume the system will have at least 56,000 x86 cores.
Storage-wise, SuperMUC will hook into 10 petabyte file system based on IBM's GPFS. The GPFS storage system is spec'ed to deliver 200 GB/second of aggregate I/O bandwidth. A two-petabyte NAS storage system, with 10 GB/second of bandwidth, will also be available. Aggregate RAM storage is on the order of 384 terabytes.
Besides next-gen Xeons, SuperMUC will also employ a number of other newer technologies. First, SuperMUC will use FDR (Fourteen Data Rate) InfiniBand as the cluster interconnect, technology which is expected to be in the field by 2011. But the system's most significant innovation is its novel hot water cooling system pioneered by IBM with its Aquasar supercomputer located at ETH Zurich.
The advantages of water over air as a cooling medium are considerable. IBM says the system will consume 40 percent less energy than a comparable air-cooled machine. According to Klaus Gottschalk, IBM's lead HPC architect for the system, the processors and other components in the supercomputer will be cooled with water up to 60 degrees C (140 degrees F). The cooling system itself is comprised of micro-channel liquid coolers which are attached directly to the processors, where most heat is generated.
"With this chip-level cooling, the thermal resistance between the processor and the water is reduced to the extent that even cooling water temperatures of up to 60 degrees C ensure that the operating temperatures of the processors remain well below the maximally allowed 85 degrees C," explains Gottschalk. "The high input temperature of the coolant results in an even higher-grade heat at the output, which in this case is up to 65 degrees C."
SuperMUC also represents the first implementation of an energy aware HPC software stack on x86, says Gottschalk. Application energy consumption will be monitored, stored and reported to the user. When an application is ready to run, the scheduler will decide which processor frequency is optimal for the application, based on administrative policies. System nodes not in use will be put in sleep mode, or if capacity expectations warrant, shut down entirely.
The rationale, of course, is to reduce power consumption as much as possible. Although IBM and PRACE are not revealing SuperMUC's expected power draw, for a 3-petaflop supercomputer based on x86 CPUs, it's apt to be considerable. And in Europe, where energy costs tend to be even higher than in the US, power is going to be a driving consideration for these big PRACE systems.
The price tag for SuperMUC, which includes power and other operational costs for five or six years, is 83 million Euros. That doesn't include the additional 50 million Euros to expand LRZ's buildings needed to house the new system. That funding, as well as the aforementioned operational costs, will be provided by the State of Bavaria and Germany.
Jun 19, 2013 |
Supercomputer architectures have evolved considerably over the last 20 years, particularly in the number of processors that are linked together. One aspect of HPC architecture that hasn't changed is the MPI programming model.
Jun 18, 2013 |
The world's largest supercomputers, like Tianhe-2, are great at traditional, compute-intensive HPC workloads, such as simulating atomic decay or modeling tornados. But data-intensive applications--such as mining big data sets for connections--is a different sort of workload, and runs best on a different sort of computer.
Jun 18, 2013 |
Researchers are finding innovative uses for Gordon, the 285 teraflop supercomputer housed at the San Diego Supercomputer Center (SDSC) that has a unique Flash-based storage system. Since going online, researchers have put the incredibly fast I/O to use on a wide variety of workloads, ranging from chemistry to political science.
Jun 17, 2013 |
The advent of low-power mobile processors and cloud delivery models is changing the economics of computing. But just as an economy car is good at different things than a full size truck, an HPC workload still has certain computing demands that neither the fastest smartphone nor the most elastic cloud cluster can fulfill.
Jun 14, 2013 |
For all the progress we've made in IT over the last 50 years, there's one area of life that has steadfastly eluded the grasp of computers: understanding human language. Now, researchers at the Texas Advanced Computing Center (TACC) are utilizing a Hadoop cluster on its Longhorn supercomputer to move the state of the art of language processing a little bit further.
05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas | From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.
04/15/2013 | Bull | “50% of HPC users say their largest jobs scale to 120 cores or less.” How about yours? Are your codes ready to take advantage of today’s and tomorrow’s ultra-parallel HPC systems? Download this White Paper by Analysts Intersect360 Research to see what Bull and Intel’s Center for Excellence in Parallel Programming can do for your codes.
Join HPCwire Editor Nicole Hemsoth and Dr. David Bader from Georgia Tech as they take center stage on opening night at Atlanta's first Big Data Kick Off Week, filmed in front of a live audience. Nicole and David look at the evolution of HPC, today's big data challenges, discuss real world solutions, and reveal their predictions. Exactly what does the future holds for HPC?
Join our webinar to learn how IT managers can migrate to a more resilient, flexible and scalable solution that grows with the data center. Mellanox VMS is future-proof, efficient and brings significant CAPEX and OPEX savings. The VMS is available today.