The Leading Source for Global News and Information Covering the Ecosystem of High Productivity Computing
May 25, 2007
Cray is listening to its customers about their pain points, says Cray President and CEO Peter Ungaro. A presentation by Ungaro is usually an open and relaxed talk interspersed with humor, interesting insights, and a long-term view. He did not disappoint attendees of the 8th LCI International Conference on High Performance Clustered Computing. The focus of his May 17 keynote was "From Beowulf to Cray-o-wulf: Extending the Linux Clustering Paradigm to Supercomputing Scale."
Ungaro focused on the differences between Beowulf clusters, based entirely on commodity components, and what he termed "Cray-o-wulf" systems, ones that use many commodity components and a few custom components to deliver systems with much higher performance, reliability and manageability at scale. He presented the basic market realities of supercomputing: commodity processors will become primarily focused on scalability, and the proliferation of multicore processors with stagnant single core performance will continue over the next few years. These trends have generated renewed interest in novel processing architectures and accelerator technologies, an area in which Cray has an established reputation and expertise.
In developing its latest systems, Cray began by recognizing its customers' pain points, namely that big clusters were hitting limitations in the areas of power, cooling and floor space; interconnect performance was a major bottleneck; and user and programmer productivity were suffering as a result of system complexity. A major concern for customers is reliability-availability-serviceability (RAS), which is becoming especially difficult at large scale. Many commodity clusters are experiencing daily failures, an observation that was confirmed by other presentations at the LCI conference.
Ungaro stated that today there are few good storage and data management options, nor are there reasonable options for accelerator support, all of which are important to customers as their systems scale ever larger. Cray is bringing its experience in designing and developing high-productivity computer systems to the commodity-driven cluster market, with a number of new and innovative directions and insight.
"Cray's design is based on the premise that pure commodity clusters begin to break down in a number of ways after about 1,000 processors," said Ungaro. "Beowulf commodity clusters did a great job of getting us a low-cost solution to scale past what we could do with SMP technologies, getting us from hundreds of processors upwards of 1,000. But above the 1,000 processor limit, the pure commodity approach breaks down and does not provide productive, operational or maintainable systems."
The Cray-o-wulf approach is based on the convergence of a number of technologies and capabilities that Cray has expertise in, or is in the process of developing. The foundation for this is the company's Adaptive Supercomputing framework, a tightly-coupled integration of hardware and software based on high-availability building blocks. The framework targets capability supercomputing, a market in which customers have more complex applications and higher performance requirements than most mainstream HPC users. Cray's Adaptive Supercomputing vision leverages the company's strength in supercomputing, but aims to broaden the market for its technologies by combining multiple processing architectures into a single, scalable system. Cray's premise is that making supercomputing easier to use will draw in a new set of users that require higher sustained performance from their applications.
A critical component of Cray's system architecture is its proprietary high-bandwidth, low-latency interconnect. The network is highly resilient in the face of transient errors, whereas other network technologies just drop the packets and pay a retransmission performance penalty. Ungaro noted that proprietary interconnects account for only 42 percent of system fabrics on the current Top500 list. But If one narrows it down to the Top 50, then proprietary interconnects account for approximately 77 percent of the systems. In a February 2006 ComputerWorld article, NCAR's James Hack observed: "As scientific computing migrated toward commodity platforms, interconnect technology, both in terms of bandwidth and latency, became the limiting factor on application performance and continues to be a performance bottleneck."
In Cray's case, while many of its machines are built from available commodity multicore processor and standard memory chips, the specialized interconnect technology is what fundamentally differentiates a Cray-o-wulf from a Beowulf cluster.
Cray envisions multiple commodity and specialized processor technologies based on scalar x86/64 (such as the AMD Opteron Cray uses today in its popular XT4 systems), vector processors (which Cray has been well-known for), multithreaded processors (to address some of the new application areas which aren't as floating point intensive but use novel algorithms such as graphs) and exotic hardware accelerators (such as FPGAs, which were used in the Cray XD1, or GPUs).
Ungaro also hinted at the possibility of combining these various processing technologies together in a future adaptive processor. This implementation of an adaptive processor creates an integrated hybrid supercomputer architecture that allows the diverse user community to choose the processor technologies that meet their computing needs of attaining higher sustained performance on their applications, not just on benchmarks or peak performance metrics.
Page: 1 of 2(Digg, Technorati, more)
White Paper: HPC in a Green and Modular Solution Building Block
Learn how the Appro GreenBlade™ System helps consolidate server, storage, network, power and simplified management capabilities in a single package while providing the performance-density, energy-efficiency and best ROI for your business.
Jul 01 | GenomeWeb Daily News | The popularity of cloud computing in the life sciences community was on full display at April's Bio-IT World conference. Read more...
Jul 01 | Linux Magazine | How can getting to the ocean help with HPC computing? Read more...
Jun 29 | GCN.com | Agency issues RFI for "Ubiquitous High Performance Computing" systems. Read more...
Jun 29 | Computerworld | The bottom of the TOP500 reveals the coming revolution in truly accessible high-end computing. Read more...
Jun 18 | EE Times | Parallel software also takes spotlight at Stanford confab. Read more...
Apr 14 | | Many HPC IT departments are feeling the rising pressure to deliver more capacity computing and performance while trying to reduce the total cost of ownership. This white paper discusses how an environmentally-friendly and open-standards HPC building block based computing system using flexible interconnect options helps address capacity computing needs.
Source: Addison Snell, GM/VP, Tabor Research; sponsored by Dell
Many organizations that could benefit from the use of HPC clusters find that it is complicated to get the systems up and running because of limited IT resources or the complexities of the clusters themselves. Learn how the Intel Cluster Ready program, for which Dell was an original partner, seeks to address this challenge for entry level and mid-range HPC users.
BlueArc's Titan architecture represents an evolutionary step in file servers by creating a hardware-based file system that can scale bandwidth, IOPS, and overall data capacity well beyond conventional software-based devices. With its ability to virtualize a massive storage pool of up to four usable petabytes of tiered storage, Titan can scale with growing data requirements, offering a competitive advantage for businesses, researchers, or other enterprises seeking to better manage data growth while still ensuring optimal performance.
Sun Studio Compilers and Tools and Sun HPC ClusterTools allow you to create high performance parallel applications for OpenSolaris, Solaris and Linux. Sun Studio Express 11/08 includes MPI performance analysis capabilities and full OpenMP 3.0 compiler support. Learn about all this and the latest in Sun HPC ClusterTools 8.1.