GPUs Will Morph ORNL’s Jaguar Into 20-Petaflop Titan

By Michael Feldman

October 11, 2011

Jaguar’s days as a CPU-only supercomputer are numbered. Over the next year, the 2.3 petaflop machine at the Department of Energy’s Oak Ridge National Lab (ORNL) will be upgraded by Cray with the new NVIDIA “Kepler” GPUs, producing a system with about 10 times Jaguar’s peak performance. The transformed supercomputer will be renamed Titan and should deliver in the neigborhood of 20 peak petaflops sometime in late 2012.

The current Jaguar system, which has already been upgraded numerous times since it was first deployed in 2009, currently sits at number three on the TOP500 list with a Linpack reading of 1.76 petaflops. Titan will certainly keep the machine in the top 5, even as machines with tens of petaflops start making their way into the big labs over the next couple years.

Titan will also represent the US entry in the echelons of top tier GPU-accelerated supercomputing. As it stands today, three of the top five systems are GPU accelerated: Tianhe-1A and Nebulae in China, and TSUBAME 2.0 in Japan. The current top GPU machine in the US is Edge, a 240-teraflop Appro cluster at Lawrence Livermore National Laboratory. Even Russia, Germany, Italy have larger systems.

According to Steve Scott, the newly minted chief technology officer for NVIDIA’s Tesla Business Unit, the fact that ORNL is making such a significant commitment to GPU computing is a big endorsement for the architecture. It’s no secret that HPC is now constrained by energy use. Moore’s Law has managed to shrink the transistor geometries, but the power wall has become the defining limitation for performance increases. “It’s all about power efficiency” Scott told HPCwire, “which is why we think the GPU story is so compelling.”

While GPUs are not truly general-purpose processors, their ability to perform data-parallel computation in a much more energy-efficient manner than CPUs has vaulted them to prominence in the HPC realm. “It’s hard to overstate the importance of the sea change that has happened in high performance computing,” notes Scott. “This wonderful ride we’ve been on for the past 30 years — every time we halve the size of transistor, the voltage drops, power stays the same, and performance improves exponentially — has been fantastic, but it’s done.”

Although the US, in general, has been a bit late in embracing GPU technology for HPC, the Titan supercomputer has been on the drawing board at Oak Ridge for at least a couple of years. But the technology necessary to implement that machine is just now catching up with those requirements.

Beginning this fall, most of 18,688 of Jaguar’s current XT5 nodes will be retrofitted with Cray’s new XK6 blades, which the company unveiled in May. The immediate result is that the current dual-socket 6-core AMD Opteron nodes will be swapped out for a single 16-core “Interlagos” CPU node and the interconnect will upgraded from SeaStar 2 to Gemini. Each XK6 blade encompasses four compute nodes, with an Opteron on each one, and the ability to connect each of those CPUs to a Tesla GPU on a PCIe daughter card.

Initially, 960 of those XK6 nodes will be outfitted with the Fermi-class Tesla M2090 GPUs, with the other odd 17 thousand remaining as CPU-only blades for the time being. This first phase of Titan is expected to be completed before the end of the year. Then in the second half of 2012, all 18,688 nodes, including the original Fermi-equipped blades, will be populated with NVIDIA’s next-generation Kepler Teslas.

NVIDIA has not provided detailed specs on the Kepler GPUs, but according to Scott their performance per watt will be more be than double that of the Fermi parts, while fitting into the same power envelope. Given the current Fermi Tesla cards (GPUs plus memory) deliver 665 gigaflops, the new Kepler GPU should yield at least 1330 gigaflops.

For the time being, Oak Ridge is promising only 10 to 20 petaflops for the final system, although the peak performance could go considerably higher. According to Buddy Bland, project director at ORNL’s Leadership Computing Facility, they currently don’t have the money in hand to upgrade all 18K nodes. The actual scope of the Titan build-out will “depend on the budget available.”

Theoretically though, if all existing nodes are populated with the new Kepler parts, the system should deliver at least 24.8 petaflops of GPU power. An equal number of Interlagos CPUs should contribute more than two additional petaflops on top of that. By the time all the dust has settled, Titan could be within spitting distance of 30 petaflops. 

The amount of power the new system will draw is also unknown, but it will certainly have a better performance per watt ratio than Jaguar, which sucks up nearly 7 MW for its 2.33 peak petaflops. By contrast, Japan’s Fermi-accelerated TSUBAME system uses just 1.4 MW for its 2.29 petaflops. Since ORNL’s new machine will use the more efficient Kepler GPUs, its efficiency should be significantly better. “We view Titan as the leading indicator of where people are going as they look to solve the energy challenges for the next five to ten years,” says Scott.

How all those peak flops turn into actual application performance remains to be seen. Extracting high levels of sustained computation from these multi-petaflop machines is notoriously difficult, with only a handful of codes able to attain more than a petaflop of performance. Adding GPUs to the mix has made that harder, at least in the short term.

In this regard, Oak Ridge, with one of the premier computational lab’s on the planet, has a good chance of pushing the envelope. Using smaller GPU clusters, computations scientists at ORNL and elsewhere have been busy porting six flagship science codes to CUDA, include Wang-Landau/LSMS for material science; S3D for engine combustion; PFLOTRAN for underground C02 sequestration and for underground contaminant containment; Denovo for radiation transport code in nuclear engineering; CAM-SE for climate change modeling; and LAMMPS, a molecular dynamics simulation code. Scott says ORNL, Cray and NVIDIA have been working together to adapt these science codes for heterogenous computing so that they are ready to go when Titan boots up.

This first phase of Titan is expected to generate more than $60 million in revenue for Cray, which could end up in the company’s hands before the end of the year. Over the lifetime of the contract, Cray is looking to collect more than $97 million, although if upgrade options are exercised, that number could go considerably higher.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

How the United States Invests in Supercomputing

November 14, 2018

The CORAL supercomputers Summit and Sierra are now the world's fastest computers and are already contributing to science with early applications. Ahead of SC18, Maciej Chojnowski with ICM at the University of Warsaw discussed the details of the CORAL project with Dr. Dimitri Kusnezov from the U.S. Department of Energy. Read more…

By Maciej Chojnowski

At SC18: Humanitarianism Amid Boom Times for HPC

November 14, 2018

At SC18 in Dallas, the feeling on the ground is one of forward-looking buoyancy. Like boom times that cycle through the Texas oil fields, the HPC industry is enjoying a prosperity seen only every few decades, one driven Read more…

By Doug Black

Nvidia’s Jensen Huang Delivers Vision for the New HPC

November 14, 2018

For nearly two hours on Monday at SC18, Jensen Huang, CEO of Nvidia, presented his expansive view of the future of HPC (and computing in general) as only he can do. Animated. Backstopped by a stream of data charts, produ Read more…

By John Russell

HPE Extreme Performance Solutions

AI Can Be Scary. But Choosing the Wrong Partners Can Be Mortifying!

As you continue to dive deeper into AI, you will discover it is more than just deep learning. AI is an extremely complex set of machine learning, deep learning, reinforcement, and analytics algorithms with varying compute, storage, memory, and communications needs. Read more…

IBM Accelerated Insights

From Deep Blue to Summit – 30 Years of Supercomputing Innovation

This week, in honor of the 30th anniversary of the SC conference, we are highlighting some of the most significant IBM contributions to supercomputing over the past 30 years. Read more…

New Panasas High Performance Storage Straddles Commercial-Traditional HPC

November 13, 2018

High performance storage vendor Panasas has launched a new version of its ActiveStor product line this morning featuring what the company said is the industry’s first plug-and-play, portable parallel file system that delivers up to 75 Gb/s per rack on industry standard hardware combined with “enterprise-grade reliability and manageability.” Read more…

By Doug Black

How the United States Invests in Supercomputing

November 14, 2018

The CORAL supercomputers Summit and Sierra are now the world's fastest computers and are already contributing to science with early applications. Ahead of SC18, Maciej Chojnowski with ICM at the University of Warsaw discussed the details of the CORAL project with Dr. Dimitri Kusnezov from the U.S. Department of Energy. Read more…

By Maciej Chojnowski

At SC18: Humanitarianism Amid Boom Times for HPC

November 14, 2018

At SC18 in Dallas, the feeling on the ground is one of forward-looking buoyancy. Like boom times that cycle through the Texas oil fields, the HPC industry is en Read more…

By Doug Black

Nvidia’s Jensen Huang Delivers Vision for the New HPC

November 14, 2018

For nearly two hours on Monday at SC18, Jensen Huang, CEO of Nvidia, presented his expansive view of the future of HPC (and computing in general) as only he can Read more…

By John Russell

New Panasas High Performance Storage Straddles Commercial-Traditional HPC

November 13, 2018

High performance storage vendor Panasas has launched a new version of its ActiveStor product line this morning featuring what the company said is the industry’s first plug-and-play, portable parallel file system that delivers up to 75 Gb/s per rack on industry standard hardware combined with “enterprise-grade reliability and manageability.” Read more…

By Doug Black

SC18 Student Cluster Competition – Revealing the Field

November 13, 2018

It’s November again and we’re almost ready for the kick-off of one of the greatest computer sports events in the world – the SC Student Cluster Competitio Read more…

By Dan Olds

US Leads Supercomputing with #1, #2 Systems & Petascale Arm

November 12, 2018

The 31st Supercomputing Conference (SC) - commemorating 30 years since the first Supercomputing in 1988 - kicked off in Dallas yesterday, taking over the Kay Ba Read more…

By Tiffany Trader

OpenACC Talks Up Summit and Community Momentum at SC18

November 12, 2018

OpenACC – the directives-based parallel programing model for optimizing applications on heterogeneous architectures – is showcasing user traction and HPC im Read more…

By John Russell

How ASCI Revolutionized the World of High-Performance Computing and Advanced Modeling and Simulation

November 9, 2018

The 1993 Supercomputing Conference was held in Portland, Oregon. That conference and it’s show floor provided a good snapshot of the uncertainty that U.S. supercomputing was facing in the early 1990s. Many of the companies exhibiting that year would soon be gone, either bankrupt or acquired by somebody else. Read more…

By Alex R. Larzelere

Cray Unveils Shasta, Lands NERSC-9 Contract

October 30, 2018

Cray revealed today the details of its next-gen supercomputing architecture, Shasta, selected to be the next flagship system at NERSC. We've known of the code-name "Shasta" since the Argonne slice of the CORAL project was announced in 2015 and although the details of that plan have changed considerably, Cray didn't slow down its timeline for Shasta. Read more…

By Tiffany Trader

TACC Wins Next NSF-funded Major Supercomputer

July 30, 2018

The Texas Advanced Computing Center (TACC) has won the next NSF-funded big supercomputer beating out rivals including the National Center for Supercomputing Ap Read more…

By John Russell

IBM at Hot Chips: What’s Next for Power

August 23, 2018

With processor, memory and networking technologies all racing to fill in for an ailing Moore’s law, the era of the heterogeneous datacenter is well underway, Read more…

By Tiffany Trader

Requiem for a Phi: Knights Landing Discontinued

July 25, 2018

On Monday, Intel made public its end of life strategy for the Knights Landing "KNL" Phi product set. The announcement makes official what has already been wide Read more…

By Tiffany Trader

House Passes $1.275B National Quantum Initiative

September 17, 2018

Last Thursday the U.S. House of Representatives passed the National Quantum Initiative Act (NQIA) intended to accelerate quantum computing research and developm Read more…

By John Russell

CERN Project Sees Orders-of-Magnitude Speedup with AI Approach

August 14, 2018

An award-winning effort at CERN has demonstrated potential to significantly change how the physics based modeling and simulation communities view machine learni Read more…

By Rob Farber

Summit Supercomputer is Already Making its Mark on Science

September 20, 2018

Summit, now the fastest supercomputer in the world, is quickly making its mark in science – five of the six finalists just announced for the prestigious 2018 Read more…

By John Russell

New Deep Learning Algorithm Solves Rubik’s Cube

July 25, 2018

Solving (and attempting to solve) Rubik’s Cube has delighted millions of puzzle lovers since 1974 when the cube was invented by Hungarian sculptor and archite Read more…

By John Russell

Leading Solution Providers

US Leads Supercomputing with #1, #2 Systems & Petascale Arm

November 12, 2018

The 31st Supercomputing Conference (SC) - commemorating 30 years since the first Supercomputing in 1988 - kicked off in Dallas yesterday, taking over the Kay Ba Read more…

By Tiffany Trader

TACC’s ‘Frontera’ Supercomputer Expands Horizon for Extreme-Scale Science

August 29, 2018

The National Science Foundation and the Texas Advanced Computing Center announced today that a new system, called Frontera, will overtake Stampede 2 as the fast Read more…

By Tiffany Trader

HPE No. 1, IBM Surges, in ‘Bucking Bronco’ High Performance Server Market

September 27, 2018

Riding healthy U.S. and global economies, strong demand for AI-capable hardware and other tailwind trends, the high performance computing server market jumped 28 percent in the second quarter 2018 to $3.7 billion, up from $2.9 billion for the same period last year, according to industry analyst firm Hyperion Research. Read more…

By Doug Black

Intel Announces Cooper Lake, Advances AI Strategy

August 9, 2018

Intel's chief datacenter exec Navin Shenoy kicked off the company's Data-Centric Innovation Summit Wednesday, the day-long program devoted to Intel's datacenter Read more…

By Tiffany Trader

Germany Celebrates Launch of Two Fastest Supercomputers

September 26, 2018

The new high-performance computer SuperMUC-NG at the Leibniz Supercomputing Center (LRZ) in Garching is the fastest computer in Germany and one of the fastest i Read more…

By Tiffany Trader

Houston to Field Massive, ‘Geophysically Configured’ Cloud Supercomputer

October 11, 2018

Based on some news stories out today, one might get the impression that the next system to crack number one on the Top500 would be an industrial oil and gas mon Read more…

By Tiffany Trader

MLPerf – Will New Machine Learning Benchmark Help Propel AI Forward?

May 2, 2018

Let the AI benchmarking wars begin. Today, a diverse group from academia and industry – Google, Baidu, Intel, AMD, Harvard, and Stanford among them – releas Read more…

By John Russell

Google Releases Machine Learning “What-If” Analysis Tool

September 12, 2018

Training machine learning models has long been time-consuming process. Yesterday, Google released a “What-If Tool” for probing how data point changes affect a model’s prediction. The new tool is being launched as a new feature of the open source TensorBoard web application... Read more…

By John Russell

  • arrow
  • Click Here for More Headlines
  • arrow
Do NOT follow this link or you will be banned from the site!
Share This