Argonne National Laboratory is planning to move up to a 10-petaflop Blue Gene/Q supercomputer next year, supporting the DOE lab’s scientific research. The new machine continues Argonne’s six-year Blue Gene tradition, which has installed every iteration of the architecture in IBM’s BG franchise.
Argonne installed its first Blue Gene supercomputer, a 5-teraflop Blue Gene/L system, in 2005, which garnered a number 58 placement on the TOP500 list that year. In 2008, the lab upgraded to a 500-teraflop Blue Gene/P, which initially placed it at number 4 on the list. The upcoming Blue Gene/Q, called “Mira,” will almost certainly give Argonne at top 10 spot in 2012. More importantly, Mira will represent a 2,000-fold increase in peak processing power in the space of six years.
The new Argonne machine will join another DOE Blue Gene/Q, the 20-teraflop “Sequoia” supercomputer, to be installed at Lawrence Livermore National Laboratory in 2012. That system is slated to run weapons simulations in support of the National Nuclear Security Administration’s program to maintain the US nuclear stockpile.
By contrast, Argonne’s Mira will be devoted entirely to open science applications like climate studies, battery research, engine design, and cosmology. The DOE has selected 16 projects that will have first crack at the Q system when it’s booted up next year. Like its predecessors, Mira will be available as an INCITE and ASCR Leadership Computing Challenge (ALCC) resource, where CPU-hours are awarded to what the DOE determines are the most deserving researchers, based on a peer-reviewed competitive process.
“Argonne’s new IBM supercomputer will help address the critical demand for complex modeling and simulation capabilities, which are essential to improving our economic prosperity and global competitiveness,” said Rick Stevens, associate laboratory director for computing, environment and life sciences at Argonne National Laboratory.
The Mira system is based on IBM’s next-generation PowerPC SoC, in this case the 16-core Power A2 processor (PDF), a 64-bit CPU capable of handling 4 threads simultaneously. The processor has 32 KB of L1 cache — 16 KB for data and 16 KB for instructions. L2 cache is made up of 8 MB of embedded DRAM (eDRAM ), a high-density on-chip memory technology that IBM uses for Blue Gene and its latest Power7 processors. Memory and I/O controllers are integrated on-chip.
Each server node will contain a single A2 processor and sport either 8 or 16 GB of memory. A fully populated Blue Gene/Q rack contains 1024 nodes, representing 16K cores. I/O has been split from the server nodes so that configurations can scale compute and I/O independently. A rack can accommodate between 8 and 128 I/O nodes. Conveniently, the I/O nodes use the same Power A2 chip as the compute servers.
Server-to-server communication is performed over a 5D Torus, which is capable of up to 40 gigabits per second, four times the speed of the Blue Gene/P interconnect. The 5D Torus employs fiber optics, the first Blue Gene design to do so.
Compute performance is delivered by using a large number of relatively low-speed cores — a hallmark of the Blue Gene architecture. Unlike the speedy 3.3 GHz Power7 chips that will go into the future Blue Waters supercomputer at the NCSA, the A2 processor for Blue Gene hums along at a modest 1.6 GHz (although faster versions of this chip can hit 3 GHz). According to IBM, Mira will encapsulate 750K cores, which works out to about 48,000 CPUs. Total memory is 750 TB, backed by 70 petabytes of disk storage.
The low-speed, high-core approach makes for a very energy-efficient package. A Blue Gene/Q prototype grabbed first place on the November 2010 Green500 list, with a Linpack rating of 1684.2 megaflops/watt. That bested even the latest Fermi GPU accelerated supers, like the TSUBAME 2.0 system recently installed at Tokyo Tech, as well as IBM’s fastest Cell (PowerXCell 8i) processor-accelerated QS22 clusters. To further boost energy efficiency and maintain reliability, all Blue Gene/Q racks are water cooled.
Because of its size, Argonne is looking at Mira as a stepping stone to exaflop supercomputing. With less than a million cores though, programmers will have to use some imagination to scale their codes to the hundreds of millions of cores envisioned in a true exascale system.
However, by the time IBM and others start building such machines, the Blue Gene PowerPC-based architecture is likely to be subsumed into the company’s Power-based line-up (which at the processor ISA level, at least, is quite similar). Based on a recent conversation with Herb Schultz, marketing manager for IBM’s Deep Computing unit, the Power and Blue Gene lines may merge around the middle of this decade. That would suggest that Blue Gene/Q could very well be the last in the Blue Gene lineage.