The Leading Source for Global News and Information Covering the Ecosystem of High Productivity Computing
November 12, 2009
SAN JOSE, Calif., Nov. 12 -- Under the Debugger Software Enhancement program for Petascale production grade tools, awarded to Allinea Software Inc. by Oak Ridge National Laboratory in Q2 2009, Allinea's Distributed Debugging Tool (DDT) is setting new levels of debugger scalability on Jaguar, a Cray XT5 and one of the world's largest supercomputers.
In Q2 2009, Allinea began a collaborative project with ORNL to extent the scalability of its DDT product. The project goal is to enable ORNL's users to debug MPI applications that span many hundreds of thousands of processors, while delivering novel capabilities that can radically simplify this task. Following successful completion of the early stages of the project, Allinea was able to demonstrate that DDT can debug a 220,000 process application running on ORNL's Jaguar supercomputer.
"Given the inherent complexities of developing applications at petascale, it is very important that our users are not frustrated by the very tools that are intended to help solve their problems," said Dr. David Lecomber, CTO of Allinea. "Our initial work has therefore focused on making the basic Petascale debugging experience much the same as it would be on very modest numbers of processes. I am pleased to say that we can now launch a debugging session at 220,000 processors in little more time than it takes to spawn the application itself. Current benchmarks also show that we can perform key actions -- like stepping 220,000 MPI processes, setting breakpoints, or comparing variables across this number of processors -- in a couple of hundred milliseconds or less. This is a massive step beyond what was previously possible in MPI debugging. We are delighted to have been able to achieve this result in such a short period of time."
The critical factors that have permitted Allinea DDT to scale past original project goals are attributed to an excellent collaboration with ORNL and the underlying design of the product, which lends itself well to incremental, modular improvements.
"ORNL and Allinea are partnering to enhance the scalability of the DDT debugger with the goal to support the complete Jaguar System. The work has progressed in a timely manner and has demonstrated the ability to debug a 220,000 process job," commented Richard L. Graham, applications performance tools group leader at ORNL. "We are very pleased with our partnership and the success we are achieving with Allinea Software. Our collaboration with Allinea Software is delivering excellent results."
Allinea Software will be exhibiting at the Supercomputing Conference (SC09) in Portland, Ore., from Nov. 16 -20, 2009, booth #1808.
About Allinea Software Inc.
Based in San Jose, Calif., Allinea Inc. is the US subsidiary of Allinea Software Ltd. and is a leading supplier of tools for parallel programming and high performance computing (HPC). Allinea's products are used by leading commercial and research institutions across the world, and have consistently set the standard for affordability, functionality and ease-of-use -- whether applied to applications at modest scale or petascale applications on the world's largest supercomputers. With new product features aimed at multi-threaded applications and novel computing architectures, Allinea is now bringing its wealth of experience in parallel tools to the rapidly-expanding arena of multicore processing. For more information, visit www.allinea.com.
-----
Source: Allinea Software Inc.
(Digg, Technorati, more)
Platform HPC Workgroup Manager
Platform HPC Workgroup Manager integrates all the cluster productivity tools you need to deploy, run and manage your HPC environment.
The ACM Turing Award goes to the creator of the modern personal computer; and Voltaire announces a mid-range InfiniBand switch and new technology that accelerates distributed applications. We recap those stories and more in our weekly wrapup.
Read More...
The prospects for virtual SMP technology got another boost last month when Florida State University announced it had installed a new HPC system from 3Leaf Systems. The servers are being housed at the university's HPC facility and will be used across a range of scientific disciplines.
Read More...
For the first time in 62 years, the four-man Olympics bobsled team from the US captured the gold medal, setting a course world record in the process. The winning bobsled had some state-of-the-art engineering behind it, including CFD software from Exa Corporation. As it turned out, that software may have proved to be the margin of difference in the race.
Read More...
Mar 15 | The Register | EMC's grand vision for unified global storage. Read more...
Mar 15 | Data Center Knowledge | Company delivers UCS-container solution to NASA. Read more...
Mar 11 | Linux Magazine | CUDA may be the rage, but OpenCL is a standard that has some features you may need. Read more...
Mar 09 | Free Software Magazine | Data-driven computing will need open software. Read more...
Mar 09 | Bio-IT World | Tahoe Informatics founder eyes GPUs, CUDA software. Read more...
Jan 12 | | In-depth look at vSMP Foundation server virtualization technology, technical implementation, use cases and capabilities. The technical whitepaper provides an architectural overview and details on the three vSMP Foundation products: vSMP Foundation for SMP, vSMP Foundation for Cluster and vSMP Foundation for Cloud.
Jan 18 | | This white paper discusses Gore’s copper cable assemblies, and how they continue to exceed the standards for providing reliable, cost-effective solutions for high-performance computer applications.
Join this online panel discussion for live Q&A with leading industry experts, analysts, and end-users to discuss the latest innovations, best practices, barriers to implementation, and measurable benefits of server virtualization with a particular focus on today's real world solutions.
Learn about scalable fault-tolerant architectures and examples of energy efficient and scalable supercomputing clusters using dual QDR InfiniBand to combine capacity computing with network failover capabilities with the help of programming languages such as MPI and a robust Linux cluster management package.
LIVE@SCO9: The IBM team discusses new innovations in hardware, software and services that help clients better understand their workloads and get insight from their R&D efforts. Technology demonstrations include the soon-to-be-released Power7 HPC processor, the DCS990 system with 2.4 petabytes of storage, the xCAT management tool, secure HPC cloud computing and more. Winners of two HPCwire Readers' and Editors’ Choice Awards! Take the IBM virtual tour at SC09 or more information go online to: http://www-03.ibm.com/systems/deepcomputing/sc09.html