HPCwire

Leading HPC
Solution Providers




















HPCwire >> Features

ORNL Gears Up for New Leadership Computing Systems


Page:  1  of  2
1 | 2   All  »  

Last week at the High Performance Computing and Communication Conference in Newport, Rhode Island, Doug Kothe gave an overview of leadership computing facility at Oak Ridge National Laboratory (ORNL) and talked about the lab's plans for its future computing systems. Kothe, the Director of Science in the National Center for Computational Sciences (NCCS) at ORNL, and a nuclear engineer by training, is no stranger to supercomputers. He has spent most of his career at Los Alamos National Laboratory developing and working with CFD and other multi-physics codes. Before coming to ORNL in January 2006, he was the Deputy Program Director of the LANL ASC Program.

As part of his presentation at the conference, Kothe gave the audience a sense of the preparations going on around the upcoming Cray supercomputer deployments. As one of the Department of Energy's leadership computing facilities, ORNL is in line to get some of the most powerful systems on the planet. By late 2007, ORNL will have upgraded the existing 119-teraflop Cray XT4 'Jaguar' system to a peak performance of 250 teraflops. By late 2008, a new one petaflops Cray 'Baker' system will be installed. Both machines will employ the upcoming quad-core AMD Opteron processors.

The current and planned systems at ORNL represent the largest open resources for computational science research in the world. The scientific research being conducted on these machines is through projects granted allocations via the highly competitive and popular INCITE Program (http://hpc.science.doe.gov/allocations/incite/).

While the computing hardware plans are already in place, the lab is busy lining up other infrastructure and getting the applications ready for the new systems. Although the Cray systems were specifically selected for the types of "big science" applications that the DOE runs, there is still a great deal of work to be done in getting the codes ready for the new systems. In addition, since the optimal types of I/O systems and archive storage are dependent on the application dataset requirements, the storage systems still need to be matched up with the workloads.

"Requirements flow both ways," said Kothe. "The applications impose requirements on the systems and the systems impose requirements on the applications." He said that until they get the thousands of quad-core AMD processors on-site, detailed upstream computer and computational science performance analysis and modeling is required to get a handle on how the applications are going to perform. The developers and NCCS staff are also using testbeds and simulators in this process.

With the next Jaguar upgrade less than six months away, the DOE Office of Advanced Scientific Computing Research (ASCR) has selected the applications that will be granted early user access on the new system. Part of the process involved surveying 20 to 30 different applications teams for the suitability of their codes. The teams were asked questions like: "If you had a 250-teraflop system all to yourself for a short while, what would you do? What are you modeling? What do the algorithms look like? Is your code ready or what would you need to get ready?" In general, leadership computing systems are for scientists who can't advance their science easily without such resources. The scientists have the burden of proving that they need the full system resources to do their research. This process is carried out in a peer-reviewed fashion through the INCITE Program.

The collected information from the surveys was sent to ASCR, the DOE Program Office (http://www.sc.doe.gov/ascr/) whose mission is to deliver leadership computing capabilities to scientists. According to Kothe, six codes have been selected that they believe can be ready when the 250-teraflop system is installed. The applications areas include combustion science, astrophysics, fusion energy, chemistry, material science/nanoscience, and climate. The code teams are gearing up in anticipation.

The same sorts of plans have been started for the 2008 Baker system; they're just not as far along. But they've already polled many scientists on what they would do with the petaflops machine.

The application scale-up work relies on the availability of testbeds and simulators. "The sooner we can get our hands on the [Opteron] quad-core test beds, the better," said Kothe. "We think this will be in place in early summer." Fortunately, ORNL already has Jaguar, a large dual-core Opteron system. So the transition should be pretty smooth and hopefully without too many last-minute surprises."

The real challenge for the applications will be to use as much of the new systems' computing power as possible. This is the classic problem for HPC applications. As the growth in the number of computing cores increases, it often outstrips the ability of applications to parallelize. The petaflops Baker system is expected to contain over 22,000 quad-core processors.

Page:  1  of  2
1 | 2   All  »  

Article Tools

  • Print This Page
  • Bookmark This Article

Share Options

(Digg, Technorati, more)


Subscribe

Discussion

There are 0 discussion items posted.  



Top Headlines

3D Seismic Data: Taking a Smarter Approach to Interpretation

Jul 09 | Engineer Live | The demand for computational tools to underpin the 3D seismic interpretation process has never been more apparent. Read more...

Engineering Unemployment Soared in 2Q to 8.6%

Jul 08 | EE Times | Unemployment for U.S. engineers has reached record levels, according to government figures. Read more...

Gartner Adjusts 2009 IT Spend Downward Again

Jul 08 | Network World | Global spending for 2009 projected to drop 6 percent, for a total of $3.2 trillion. Read more...

Concurrent and Parallel Are Not The Same

Jul 08 | Linux Magazine | Portability or efficiency? Neither is guaranteed when writing explicit parallel code. Read more...

800 TFLOP Real-Time Ray Tracing GPU Unveiled, Not for Gamers

Jul 07 | Ars Technica | Japanese company builds custom ASIC to accelerate real-time ray traced rendering for the auto industry. Read more...

Featured Whitepapers

Building High Performance Computing in a Green and Modular Solution Building Block

Apr 14 | | Many HPC IT departments are feeling the rising pressure to deliver more capacity computing and performance while trying to reduce the total cost of ownership. This white paper discusses how an environmentally-friendly and open-standards HPC building block based computing system using flexible interconnect options helps address capacity computing needs.

Multimedia

Webcast: Dell Expands HPC Access and Adoption with Intel Cluster Ready Program


Source: Addison Snell, GM/VP, Tabor Research; sponsored by Dell

Many organizations that could benefit from the use of HPC clusters find that it is complicated to get the systems up and running because of limited IT resources or the complexities of the clusters themselves. Learn how the Intel Cluster Ready program, for which Dell was an original partner, seeks to address this challenge for entry level and mid-range HPC users.

Video White Paper: Architecting a Better Network Storage Solution

BlueArc's Titan architecture represents an evolutionary step in file servers by creating a hardware-based file system that can scale bandwidth, IOPS, and overall data capacity well beyond conventional software-based devices. With its ability to virtualize a massive storage pool of up to four usable petabytes of tiered storage, Titan can scale with growing data requirements, offering a competitive advantage for businesses, researchers, or other enterprises seeking to better manage data growth while still ensuring optimal performance.

Webcast: HPC Development Solutions: Sun Studio & Sun HPC ClusterTools


Sun Studio Compilers and Tools and Sun HPC ClusterTools allow you to create high performance parallel applications for OpenSolaris, Solaris and Linux. Sun Studio Express 11/08 includes MPI performance analysis capabilities and full OpenMP 3.0 compiler support. Learn about all this and the latest in Sun HPC ClusterTools 8.1.

Special Feature: ISC'09

Newsletters

Stay informed! Subscribe to HPCwire email Newsletters.






HPC Job Bank


Featured Events

WORLDCOMP 2009
Data Mining Courses