The Leading Source for Global News and Information Covering the Ecosystem of High Productivity Computing
September 22, 2006
At the HPC User Forum meeting in Denver this week, Horst Simon was one of two industry experts asked to provide a larger perspective as representatives of organizations in China, Japan and the U.S. discussed their petascale initiatives. In this Q&A, Simon offers his views about the challenges of achieving usable petascale systems within the next five years.
HPCwire: The U.S. and several other countries have petascale initiatives in play. How realistic is the dual goal of achieving sustained petaflops speed and substantially boosting productivity by 2010?
Simon: There have been multiple announcements of plans for petascale machines in that timeframe, so I'm fairly confident that the sustained petaflop goal will be attained by 2010, meaning that by then there will be a Gordon Bell Prize for a sustained petaflop on a real application on a real platform. I'd be even more comfortable predicting that this will happen by 2011.
As for part two of your question, it depends on your definition of productivity. If productivity means the economic output of a country, then I think 2010 is too early for petascale computing to affect that. If productivity means making efficient use of petascale systems in industry, for example to routinely create better products at lower costs through the use of simulation, this implies that the petascale systems would have good scalability, reliable application and system software, and so on, and I think 2010 will also be too soon for this. There will be many technical challenges, because we're entering a completely new arena of scalability. Getting to productive petaflop performance will be as difficult as it has been getting to productive teraflop performance.
But if productivity means running codes faster and at higher resolution, which can enables scientific breakthroughs, then I'm confident this will happen because of the continued dramatic advances in computational technology based on commodity clusters. Ten years ago, few people thought PC clusters would have the big impact they have had. The same thing will happen with petascale systems in the future. They will become common.
HPCwire: Can anyone really afford a general-purpose sustained petaflop system in the 2009-11 timeframe that several countries are targeting? By general purpose, I mean a system that can sustain petaflop performance on a reasonably broad spectrum of codes.
Simon: Yes, to the part about affording petaflop systems in that timeframe. Petaflop systems are expensive, but not super-expensive if you look at them in relation to other large-scale scientific projects, like particle accelerators or the next-generation space telescope. It costs about $200 million to fund a petascale system today, which is not outrageously high. The bigger question is, what is the optimal time to make that investment? The Earth Simulator was a huge investment of about $400 million and made a big, immediate impact when it went live in 2002. Now, four years later, a 40-teraflop machine is much less expensive. It's important to produce significant results in the first one to two years, so Moore's Law doesn't catch up with the machine. It must be productive quickly.
As for general-purpose, if we look at the ratios of memory and disk that would be needed, and the I/O rates, then a general-purpose petascale system could become much more expensive. But we're approaching an era when the whole notion of general-purpose HPC systems may no longer apply. Instead, there will be commodity clusters for most things, along with opportunities to leverage special-purpose technologies like Blue Gene, which can run specific applications very successfully, or MDGRAPE3 from RIKEN, which arguably is the first petascale system and is highly specialized. In 2010-11, I would expect an increasing trend toward more highly specialized systems.
HPCwire: Can anything be done to alleviate the costs of petascale systems while maintaining their usefulness?
Simon: One big issue for the future is that operational costs have been increasing significantly. We're approaching the point where computers can cost more to operate than their acquisition cost. One big potential area for cost reduction is less power-consuming components to reduce overall cost of ownership. Another one is facility construction. Construction costs have also gone up substantially, and you can save a lot of money if you don't have to build a new facility or heavily modify an existing one.
Page: 1 of 3(Digg, Technorati, more)
PGI Accelerator™ Fortran 95/03 and C99 compilers for x64+NVIDIA
Accelerate applications on x64+GPU platforms by adding OpenMP-like compiler directives to existing Fortran and C programs. Available now for Linux, MacOS and Windows. Download a free 15 day trial.
Platform HPC Workgroup Manager
Platform HPC Workgroup Manager integrates all the cluster productivity tools you need to deploy, run and manage your HPC environment.
C-DAC announces plans for a petaflop system; IBM researchers are working on vertical integration techniques to extend Moore's Law another 15 years. We recap those stories and more in our weekly wrapup.
Read More...
The Moscow State University supercomputer, Lomonosov, has been selected for a high-performance makeover, with the goal of tripling its processing power to achieve petaflop-level performance in 2010. T-Platforms, who developed and manufactured the supercomputer, is the odds-on favorite to lead the project.
Read More...
Right on schedule, Intel has launched its Xeon 5600 processors, codenamed "Westmere EP." The 5600 represents the 32nm sequel to the Xeon 5500 (Nehalem EP) for dual-socket servers. Intel is touting better performance and energy efficiency, along with new security features, as the big selling points of the new Xeons.
Read More...
Mar 19 | OfficialWire | New super to support intelligence work Down Under. Read more...
Mar 18 | ChannelWeb | Westmere parts already showing up in HPC machines. Read more...
Mar 17 | The Register | But what about the tier ones? Read more...
Mar 17 | Cadalyst Magazine | A new generation of workstations is changing the nature of technical computing. Read more...
Mar 17 | Linux Magazine | Latest iteration of Sun Grid Engine able to tap into Cloud. Read more...
Jan 12 | | In-depth look at vSMP Foundation server virtualization technology, technical implementation, use cases and capabilities. The technical whitepaper provides an architectural overview and details on the three vSMP Foundation products: vSMP Foundation for SMP, vSMP Foundation for Cluster and vSMP Foundation for Cloud.
Jan 18 | | This white paper discusses Gore’s copper cable assemblies, and how they continue to exceed the standards for providing reliable, cost-effective solutions for high-performance computer applications.
Join this online panel discussion for live Q&A with leading industry experts, analysts, and end-users to discuss the latest innovations, best practices, barriers to implementation, and measurable benefits of server virtualization with a particular focus on today's real world solutions.
Learn about scalable fault-tolerant architectures and examples of energy efficient and scalable supercomputing clusters using dual QDR InfiniBand to combine capacity computing with network failover capabilities with the help of programming languages such as MPI and a robust Linux cluster management package.
LIVE@SCO9: The IBM team discusses new innovations in hardware, software and services that help clients better understand their workloads and get insight from their R&D efforts. Technology demonstrations include the soon-to-be-released Power7 HPC processor, the DCS990 system with 2.4 petabytes of storage, the xCAT management tool, secure HPC cloud computing and more. Winners of two HPCwire Readers' and Editors’ Choice Awards! Take the IBM virtual tour at SC09 or more information go online to: http://www-03.ibm.com/systems/deepcomputing/sc09.html