HPCwire

The Leading Source for Global News and Information Covering the Ecosystem of High Productivity Computing

HPCwire >> Features

Peter Ungaro Talks Up Cray-o-wulf at LCI Conference


Page:  1  of  2
1 | 2   All  »  

Cray is listening to its customers about their pain points, says Cray President and CEO Peter Ungaro. A presentation by Ungaro is usually an open and relaxed talk interspersed with humor, interesting insights, and a long-term view. He did not disappoint attendees of the 8th LCI International Conference on High Performance Clustered Computing. The focus of his May 17 keynote was "From Beowulf to Cray-o-wulf: Extending the Linux Clustering Paradigm to Supercomputing Scale."

Ungaro focused on the differences between Beowulf clusters, based entirely on commodity components, and what he termed "Cray-o-wulf" systems, ones that use many commodity components and a few custom components to deliver systems with much higher performance, reliability and manageability at scale. He presented the basic market realities of supercomputing: commodity processors will become primarily focused on scalability, and the proliferation of multicore processors with stagnant single core performance will continue over the next few years. These trends have generated renewed interest in novel processing architectures and accelerator technologies, an area in which Cray has an established reputation and expertise.

In developing its latest systems, Cray began by recognizing its customers' pain points, namely that big clusters were hitting limitations in the areas of power, cooling and floor space; interconnect performance was a major bottleneck; and user and programmer productivity were suffering as a result of system complexity. A major concern for customers is reliability-availability-serviceability (RAS), which is becoming especially difficult at large scale. Many commodity clusters are experiencing daily failures, an observation that was confirmed by other presentations at the LCI conference.

Ungaro stated that today there are few good storage and data management options, nor are there reasonable options for accelerator support, all of which are important to customers as their systems scale ever larger. Cray is bringing its experience in designing and developing high-productivity computer systems to the commodity-driven cluster market, with a number of new and innovative directions and insight.

"Cray's design is based on the premise that pure commodity clusters begin to break down in a number of ways after about 1,000 processors," said Ungaro. "Beowulf commodity clusters did a great job of getting us a low-cost solution to scale past what we could do with SMP technologies, getting us from hundreds of processors upwards of 1,000. But above the 1,000 processor limit, the pure commodity approach breaks down and does not provide productive, operational or maintainable systems."

The Cray-o-wulf approach is based on the convergence of a number of technologies and capabilities that Cray has expertise in, or is in the process of developing. The foundation for this is the company's Adaptive Supercomputing framework, a tightly-coupled integration of hardware and software based on high-availability building blocks. The framework targets capability supercomputing, a market in which customers have more complex applications and higher performance requirements than most mainstream HPC users. Cray's Adaptive Supercomputing vision leverages the company's strength in supercomputing, but aims to broaden the market for its technologies by combining multiple processing architectures into a single, scalable system. Cray's premise is that making supercomputing easier to use will draw in a new set of users that require higher sustained performance from their applications.

A critical component of Cray's system architecture is its proprietary high-bandwidth, low-latency interconnect. The network is highly resilient in the face of transient errors, whereas other network technologies just drop the packets and pay a retransmission performance penalty. Ungaro noted that proprietary interconnects account for only 42 percent of system fabrics on the current Top500 list. But If one narrows it down to the Top 50, then proprietary interconnects account for approximately 77 percent of the systems. In a February 2006 ComputerWorld article, NCAR's James Hack observed: "As scientific computing migrated toward commodity platforms, interconnect technology, both in terms of bandwidth and latency, became the limiting factor on application performance and continues to be a performance bottleneck."

In Cray's case, while many of its machines are built from available commodity multicore processor and standard memory chips, the specialized interconnect technology is what fundamentally differentiates a Cray-o-wulf from a Beowulf cluster.

Cray envisions multiple commodity and specialized processor technologies based on scalar x86/64 (such as the AMD Opteron Cray uses today in its popular XT4 systems), vector processors (which Cray has been well-known for), multithreaded processors (to address some of the new application areas which aren't as floating point intensive but use novel algorithms such as graphs) and exotic hardware accelerators (such as FPGAs, which were used in the Cray XD1, or GPUs).

Ungaro also hinted at the possibility of combining these various processing technologies together in a future adaptive processor. This implementation of an adaptive processor creates an integrated hybrid supercomputer architecture that allows the diverse user community to choose the processor technologies that meet their computing needs of attaining higher sustained performance on their applications, not just on benchmarks or peak performance metrics.

Page:  1  of  2
1 | 2   All  »  

HPCwire on Twitter

Article Tools

  • Print This Page
  • Bookmark This Article

Share Options

(Digg, Technorati, more)


Subscribe

Discussion

There are 0 discussion items posted.  

HPC in the Cloud Part 2
People to Watch 2010


Top Headlines

Intel Partners See 'Easy' Upgrade Path With Xeon 5600 Chips

Mar 18 | ChannelWeb | Westmere parts already showing up in HPC machines. Read more...

AMD: OEMs primed for Opteron 6100s

Mar 17 | The Register | But what about the tier ones? Read more...

Arrival of the Desktop Supercomputer

Mar 17 | Cadalyst Magazine | A new generation of workstations is changing the nature of technical computing. Read more...

Scheduling HPC In The Cloud

Mar 17 | Linux Magazine | Latest iteration of Sun Grid Engine able to tap into Cloud. Read more...

Tailoring Medicine with Supercomputers

Mar 16 | Bio-IT World | Biotech firm builds genetic models from patient data. Read more...

Featured Whitepapers

Virtualization for Aggregation And The vSMP Architecture™

Jan 12 | | In-depth look at vSMP Foundation server virtualization technology, technical implementation, use cases and capabilities. The technical whitepaper provides an architectural overview and details on the three vSMP Foundation products: vSMP Foundation for SMP, vSMP Foundation for Cluster and vSMP Foundation for Cloud.

Copper Cable Technologies for High Performance Computing

Jan 18 | | This white paper discusses Gore’s copper cable assemblies, and how they continue to exceed the standards for providing reliable, cost-effective solutions for high-performance computer applications.

Multimedia

Webcast: Virtualized Data Center Roundtable

Join this online panel discussion for live Q&A with leading industry experts, analysts, and end-users to discuss the latest innovations, best practices, barriers to implementation, and measurable benefits of server virtualization with a particular focus on today's real world solutions.

Webcast: Watch SC09 Birds of a Feather Video: Scalable Fault-Tolerant HPC Supercomputers

Learn about scalable fault-tolerant architectures and examples of energy efficient and scalable supercomputing clusters using dual QDR InfiniBand to combine capacity computing with network failover capabilities with the help of programming languages such as MPI and a robust Linux cluster management package.

Webcast: High Performance Computing for a Smarter Planet

LIVE@SCO9: The IBM team discusses new innovations in hardware, software and services that help clients better understand their workloads and get insight from their R&D efforts. Technology demonstrations include the soon-to-be-released Power7 HPC processor, the DCS990 system with 2.4 petabytes of storage, the xCAT management tool, secure HPC cloud computing and more. Winners of two HPCwire Readers' and Editors’ Choice Awards! Take the IBM virtual tour at SC09 or more information go online to: http://www-03.ibm.com/systems/deepcomputing/sc09.html

SC09 HPC in the Cloud

Newsletters

Stay informed! Subscribe to HPCwire email Newsletters.






HPC Job Bank


Featured Events

HPC User Forum DICE
2010 High Performance Computing Linux Financial Markets
Cloud Computing Expo
Cloud Lab
ESC
DEISA PRACE Symposium