Nvidia Tesla DataDirect Networks
HPCwire

Since 1986 - Covering the Fastest Computers
in the World and the People Who Run Them

Language Flags

Visit additional Tabor Communication Publications

Datanami
Digital Manufacturing Report
HPC in the Cloud

Research Team 'Virtualizes' Red Storm Supercomputer


Study on virtualization of parallel supercomputing systems is largest of its kind

Jan. 21 -- A collaboration between researchers at Northwestern University, Sandia National Labs and the University of New Mexico has resulted in the largest-scale study ever done on what many consider an important part of the future of computing -- the virtualization of parallel supercomputing systems.

As part of this collaboration, Peter A. Dinda, associate professor of electrical engineering and computer science at Northwestern's McCormick School of Engineering and Applied Science, and his graduate student Jack Lange led the development of a virtual machine monitor called Palacios specifically for supercomputers. The system was tested Dec. 3 on Sandia's world-class Red Storm supercomputer. Sandia researchers, led by Kevin Pedretti, assisted in adapting and optimizing Palacios for the Red Storm environment and directed the testing effort.

Results show that the team successfully virtualized Red Storm using the Palacios virtual machine monitor and ran communication intensive, fine-grain parallel benchmarks of critical interest to Sandia with extremely high performance. Testing went up to 4,096 nodes, making this the largest-scale study by at least two orders of magnitude.

"Virtualizing a parallel supercomputer is particularly challenging because of the need to support extremely low latency, high-bandwidth communication among thousands of virtual machines," Dinda says. "Supercomputing users and the owners of supercomputers will not tolerate any performance compromises because the machines are so expensive to acquire and maintain, but, on the other hand, they also want access to the benefits of virtualization."

A virtual machine monitor (VMM) works by separating a computer's operating system from its hardware. This indirection exposes a range of benefits. For example, a VMM allows an operating system from one machine to be run on another. (If it needs more memory, for example.) It can also allow one machine to simultaneously run multiple operating systems, and it is possible to migrate running operating systems from one computer to another.

In the case of supercomputing, the VMM also acts as a translator between a user's software and the highly specialized hardware and software environments of the system, which could potentially allow more researchers to use supercomputers to solve complex problems.

With more than 38,000 processors, Red Storm is a massively parallel processing supercomputer that was uniquely designed to support modeling and simulation of complex problems in nuclear weapons stockpile stewardship. It is currently the 17th fastest computer in the world, with a theoretical peak performance of 284 trillion floating point operations per second in a relatively compact 3,500-square foot footprint.

Virtualization on such a machine is important because it will allow more researchers to run scientific computing and simulation programs without reconfiguring their software to the machine's specific hardware and software environments. In this context, thousands of virtual machines must cooperate in order to solve large problems. But because the system is extremely expensive to run, any VMM must have low overhead, which is magnified through the fine-grain interactions among the virtual machines.

At these massive scales, the Palacios virtual machine monitor had a measured overhead of less than 5 percent. The results clearly indicate that it is possible to bring the benefits of virtualization to even the largest computers in the world without performance compromises.

Virtualization is big business, with the market research and analysis firm IDC forecasting annual revenues to grow from $5.5 billion in 2007 to $11.7 billion in 2011.

"If we can virtualize supercomputers without performance compromises, we will make them easier to use and easier to manage, generally increasing the utility of these very large national infrastructure investments," Dinda says.

"The end goal is to provide a more flexible supercomputer environment to end users without sacrificing performance," Pedretti says. "The successful experiments with Palacios on Red Storm demonstrate the feasibility of our approach, and we hope to incorporate this technology in future capability supercomputer platforms."

Researchers can learn more about Palacios and download the latest version of Palacios at http://v3vee.org.

-----

Source: Northwestern University

HPCwire on Twitter

Discussion

There are 0 discussion items posted.

Join the Discussion

Join the Discussion

Become a Registered User Today!


Registered Users Log in join the Discussion

May 23, 2012

May 22, 2012

May 21, 2012

May 18, 2012

May 17, 2012

May 16, 2012

May 15, 2012

May 14, 2012

May 11, 2012

May 10, 2012


Most Read Features

Most Read Around the Web

Most Read This Just In

Acer

Feature Articles

NVIDIA Works On CPU Co-Dependency Issues with Kepler GPU

NVIDIA is telling everyone that the GK110, its new Kepler GPU aimed at supercomputing, is all about improving performance per watt. But the other driving theme behind the new architecture is reducing the GPU's reliance on its CPU host. How well it accomplishes both these goals areas could determine the success of the new chip in high performance computing.
Read more...

OpenACC Starts to Gather Developer Mindshare

PGI, Cray, and CAPS enterprise are moving quickly to get their new OpenACC-supported compilers into the hands of GPGPU developers. At NVIDIA's GPU Technology Conference this week, there was plenty of discussion around the new HPC accelerator framework, and all three OpenACC compiler makers, as well as NVIDIA, were talking up the technology.
Read more...

NVIDIA Launches Kepler Into HPC

NVIDIA has introduced its first Kepler-generation GPU product for high performance computing, and revealed some of the inner working of the new architecture. The announcement took place at the kickoff of the company's GPU Technology Conference taking place this week in San Jose, California.
Read more...

Around the Web

Can Google’s Page Ranking Algorithm Cure Cancer?

May 23, 2012 | Computational biologists tweak PageRank to correlate protein markers with disease progression.
Read more...

Apple Datacenter Blooms Green Energy

May 22, 2012 | Company looks to renewable energy to power its computing infrastructure.
Read more...

NVIDIA’s Bill Dally Talks 3D Chips and More at GTC

May 16, 2012 | Chief scientist discusses memory stacks, interconnects, and US technology leadership.
Read more...

NVIDIA Unveils Virtualized GPU with Kepler-Based Board

May 15, 2012 | GPU maker conjures up visualization technology for virtual desktops.
Read more...

Zettaflops Will Happen Says HPC Analyst

May 14, 2012 | Pessimistic predictions about technology have a poor track record, according to 451's John Barr.
Read more...

Sponsored Whitepapers

Sponsored Multimedia

ISC Think Tank 2012

Newsletters



HPC Job Bank


Featured Events







HPC Wire Events