HPCwire

Since 1986 - Covering the Fastest Computers
in the World and the People Who Run Them

Language Flags

Visit additional Tabor Communication Publications

Datanami
Digital Manufacturing Report
HPC in the Cloud
Green Computing Report

Tabor Communications
Corporate Video

GPGPUs, China Look to Build on Supercomputing Successes of 2010


Rather than go through a litany of predictions about the high performance computing space for 2011, it's probably more instructive to point to two clear trends established in 2010 that are set to change the face of HPC over the next five to 10 years: the rise of GPGPU supercomputing and the emergence of China as an HPC superpower.

As we reported last year, GPU-powered petaflop machines and China supercomputing arose simultaneously over the last two years. The Asian nation has been most aggressive in taking advantage of GPGPUs to build big machines. This culminated in China grabbing the number one spot on the November 2010 TOP500 list with the 2.5-petaflops Tianhe-1A supercomputer, powered by NVIDIA Tesla GPUs.

The trends are just beginning, though. As of today, only 11 of the top 500 systems are GPGPU-powered, representing just 2 percent of the total. But three of those are in the top 5, and seven are in the top 100. A year before, there were just two GPU-equipped supercomputers on the entire list. The US and Europe, which developed much of the foundational hardware and software technology, have so far lagged in adopting GPGPU to build petascale machines.

As far as national presence, the US is still by far the dominant supercomputing power with 282 of the top 500 systems. China claims just 41 systems, which places it a distant second. But that's up from just 21 top machines in 2009. It's noteworthy that no other country in the list is accelerating its presence so rapidly.

The expectation is that both these trends will continue in 2011 and beyond. At the recent "Analyst Crossfire " forum held at SC10 in New Orleans (YouTube videos of which can be found in three parts: here, here, and here), the topics of GPGPU computing and China's emergence as a major HPC player were discussed at length. The four panelists -- Thomas Sterling, Professor of Computer Science at LSU, Jay Boisseau, Director of the Texas Advanced Computing Center (TACC), Peter ffoulkes, VP Marketing Adaptive Computing; and Michael Wolfe, Sr. Compiler Engineer at PGI -- generally agreed that China and GPU supercomputing will solidify the gains made they made in 2010, but that the long-term prospects for GPGPUs are less assured.

TACC's Boisseau and Adaptive's ffoulkes noted the GPGPU technology to build petascale supercomputers is available to everyone, but it was the Chinese that brought their resources to bear first. (Actually, the Japanese with their TSUBAME system were the true early adopters of GPGPU supercomputing, culminating in the number 2 ranked TSUBAME 2.0 machine in November 2010). The Chinese, though, moved purposefully to build a number of petascale or near-petascale machines, determining that GPGPU technology was the shortest path to that end. "I think they [the Chinese] are going to be a fixture in the TOP500 and especially even in the top 10 for several years -- and onward," said Boisseau.

The Chinese currently have 5 of their top 41 machines in the top 100 and are looking to build on that as they build out their supercomputing infrastructure. "They are rapidly becoming a major political power in the world and HPC is part of that," said ffoulkes.

LSU's Sterling noted that the Chinese are not simply buying their way into the upper echelons of supercomputing. The top Tianhe-1A machines is based on a home-grown system design, sporting a custom network interconnect and I/O processor developed from the silicon on up. "Unlike us, they have a long tradition of five year plans, and they stayed the course," said Sterling. And they will do so here as well. They have no doubt through the procedures and methods they've applied that they will be the leader in this field before the end of this decade." Indeed, China has stated it wants to be the first nation to field an exascale machine.

From Sterling's perspective the US is in deep crisis with respect to HPC technology research. "The Democrats don't understand the technology and the Republicans don't fund it unless it's for defense research," he said. "Blue Waters will come online shortly and will act as a placebo. We'll again be at the top of the list and we'll think we're doing fine, when in fact we are way behind in doing research for the future machines for the end of the decade."

Whether those machines will be GPGPU-based is less certain. Other accelerators like Intel's Many Integrated Core (MIC) processor won't debut for another year or so. And AMD's Fusion (CPU-GPU) processors are just now making their way into client side of the ecosystem. Even NVIDIA's roadmap for its discrete GPUs over the next couple of years will likely produce something that is unrecognizable as a graphics processor of the last decade.

It's even possible that accelerators, GPUs or otherwise, will not figure prominently in the biggest machines. PGI's Wolfe noted that the only two publicly-announced 10 petaflop systems -- the Power7-based Blue Waters in the US and the Sparc64 VIIIfx-based Kei Soku Keisanki (aka the "K computer") in Japan -- will rely solely on CPUs, albeit very high-end ones.

Wolfe thinks Intel's MIC is a "fascinating architecture," which has the potential to unseat NVIDIA's current dominance in HPC acceleration arena. And AMD (ATI) GPUs have the raw performance advantage, he says. That could be the decisive factor once a more level playing field for GPGPU middleware is in place, which is what AMD is banking on with the open standard OpenCL API. That technology is even more attractive when seen against AMD's CPU-GPU Fusion roadmap, which will eventually wind its way into the server side of the business.

From ffoulkes' perspective, the GPU will be just one of a number of accelerator architectures that drive future HPC machinery. Eventually though, he believes that accelerators will follow the same path as floating point units (FPUs), which used to run free as discrete coprocessors before being integrated onto the CPU. Integrating accelerator logic into a standard processor will make the technology ubiquitous and essentially free, according to ffoulkes. "And then the key becomes programmability," he said.

Sterling is even more circumspect about the GPGPU's longevity in HPC, at least in its current form. "With respect to the GPU, it's the flavor of the month," he said. "We've been here before with attached array processors."

Like others, Sterling believes heterogeneous architectures will be the model of the future, but the accelerator componentry will eventually be integrated on-chip, a la Fusion. That will solve much of the latency and bandwidth issues that currently limit performance on the PCI-connected discrete GPUs. It will also simplify the programming model.

"What we will develop is a uniform execution model for a non-uniform system architecture that scales," said Sterling. "That's when we will have arrived."

Sponsored Links

Accelerate your science with Seneca
One of the first HPC providers installing a 4X NVIDIA Kepler K-20 cluster. Invites you to a free evaluation on Seneca’s NVIDIA K20 Kepler cluster, pre-loaded with AMBER, NAMD, LAMMPS

High-Performance Computing in Action
Businesses that want to be on the cutting edge of their industries are increasingly turning to high-performance computing (HPC) solutions to handle complex compute processes and speed up their rate of innovation. Download this Executive Brief to see how businesses in energy, life sciences and entertainment put HPC solutions to work in their operations.

May 17, 2013

May 16, 2013

May 15, 2013

May 14, 2013

May 13, 2013

May 10, 2013

May 09, 2013

May 08, 2013

May 07, 2013

May 06, 2013



Short Takes

Running Computational Fluid Dynamics in the Cloud

May 16, 2013 | When it comes to cloud, long distances mean unacceptably high latencies. Researchers from the University of Bonn in Germany examined those latency issues of doing CFD modeling in the cloud by utilizing a common CFD and its utilization in HPC instance types including both CPU and GPU cores of Amazon EC2.
Read more...

Computing the Physics of Bubbles

May 15, 2013 | Supercomputers at the Department of Energy’s National Energy Research Scientific Computing Center (NERSC) have worked on important computational problems such as collapse of the atomic state, the optimization of chemical catalysts, and now modeling popping bubbles.
Read more...

Internet2 Awards Program Seeks Innovative Applications

May 10, 2013 | Program provides cash awards up to $10,000 for the best open-source end-user applications deployed on 100G network.
Read more...

Floating Funding to Exascale Island

May 09, 2013 | The Japanese government has revealed its plans to best its previous K Computer efforts with what they hope will be the first exascale system...
Read more...

HPC and the True Cost of Cloud

May 08, 2013 | For engineers looking to leverage high-performance computing, the accessibility of a cloud-based approach is a powerful draw, but there are costs that may not be readily apparent.
Read more...

Sponsored Whitepapers

Best Practices in Big Data Storage

05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas | From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.

Progress in Parallel: the Bull Parallel Programming Center

04/15/2013 | Bull | “50% of HPC users say their largest jobs scale to 120 cores or less.” How about yours? Are your codes ready to take advantage of today’s and tomorrow’s ultra-parallel HPC systems? Download this White Paper by Analysts Intersect360 Research to see what Bull and Intel’s Center for Excellence in Parallel Programming can do for your codes.

Sponsored Multimedia

SGI DMF ZeroWatt Disk Solution

In this demonstration of SGI DMF ZeroWatt disk solution, Dr. Eng Lim Goh, SGI CTO, discusses a function of SGI DMF software to reduce costs and power consumption in an exascale (Big Data) storage datacenter.

Cray CS300-AC Cluster Supercomputer Air Cooling Technology Video

The Cray CS300-AC cluster supercomputer offers energy efficient, air-cooled design based on modular, industry-standard platforms featuring the latest processor and network technologies and a wide range of datacenter cooling requirements.

SC12 Editorial Feature HPCwire Soundbite sponsored by ISC

HPC Job Bank


Featured Events


  • June 16, 2013 - June 20, 2013
    ISC'13
    Leipzig,
    Germany

  • June 17, 2013 - June 18, 2013
    Forecast 2013
    San Francisco, CA
    United States





HPCwire Events