HPCwire

Leading HPC
Solution Providers

























HPCwire >> Topic >> Interconnects

Ethernet Takes on High-Performance Clustering


Freed of the CPU overhead and networking latency thwarting its deployment beyond networking, Ethernet is finally ready to take on the most demanding of data center applications -- high-performance clustering. When fully implemented, iWARP Ethernet achieves low latency, low CPU utilization, high throughput, and high bandwidth characteristics on par with proprietary clustering fabrics. It simplifies connectivity, lowers total cost of ownership (TCO), and delivers at last on the promise of one data center fabric that can do it all.

Every data center today supports Ethernet for networking, and many use Fibre Channel for block storage. As mainstream popularity grows for clustering and grid, many are also adding a proprietary fabric, such as InfiniBand, specifically designed to meet the performance demands of fine-grain parallel processing applications. These three-fabric configurations work, but require separate maintenance and management of each network and create a challenge in blade systems due to inherent space, power, and cooling constraints. A goal of many data center managers is to reduce the complexity and number of fabrics without compromising networking performance.

iWARP Ethernet achieves high performance in Ethernet channel adapters

iWARP is a set of standards developed by the RDMA Consortium and IETF, enabling TCP/IP-based Ethernet to address the three major sources of networking overhead -- transport (TCP/IP) processing, intermediate buffer copies, and application context switch overhead. Fully implemented in a new type of networking device, an Ethernet channel adapter or ECA, iWARP enables Ethernet to successfully meet the performance requirements of all data center fabrics by:

-- offloading TCP/IP transport processing from the CPU
-- using RDMA and DDP to move data directly to/from application memory, eliminating overhead due to unnecessary data buffering
-- instituting user level direct access (ULDA), enabling applications to directly control data movement through the ECA without OS involvement (this is also known as OS bypass).

Together these features allow applications to take full advantage of 10 Gb of network bandwidth, while utilizing a minimum of server resources for networking support.

Partial implementations of 1 Gb and 10 Gb iWARP Ethernet, available as transport offload engines (TOEs) and kernel-mode RDMA NICs (RNICs), offer some relief from overhead and latency issues, but not enough to support the 10 times performance increase of 10 Gb Ethernet. ECAs offer a full iWARP implementation and can be distinguished from conventional NICs and RNICs by their inclusion of ULDA, a key feature of the iWARP standards.

Benchmark results

In performance comparisons of current InfiniBand host channel adapters (HCAs) to standard 10 GbE NICs and 10 GbE iWARP ECAs, the results speak for themselves. When fully implemented, iWARP allows Ethernet to achieve low latency, high bandwidth and low CPU utilization comparable to or better than proprietary clustering fabrics.

One fabric. One adapter. Lower TCO.

iWARP ECAs enable data centers to use a single building block for any networking topology with no compromises in bandwidth, throughput, CPU utilization or latency. Proprietary clustering fabrics can be replaced by iWARP Ethernet, block-based storage fabrics can be replaced with iSCSI and iSER (RDMA-enabled iSCSI) over iWARP Ethernet, while compatibility with ubiquitous Ethernet is maintained.

One fabric greatly simplifies connectivity. In any server, three adapters can be reduced to one. The three switches formerly connected to those three adapters are reduced from three to one. In fact, in the data center of the very near future, every server and every node in an HPC cluster has only two connections -- one for power and one for everything else. The need for separate spares and different training is eliminated. Network management software returns to familiar Ethernet.

The iWARP specifications allow TCP/IP-based Ethernet to achieve equal or better performance than today's clustering and storage fabrics. IWARP ECAs simplify connectivity, lower total cost of ownership (TCO), and deliver on the promise that one data center fabric can do it all.

-----

Terry Hulett is VP of Silicon Engineering for NetEffect, Inc., an Austin, Texas-based company offering iWARP ECAs for 1 GbE and 10 GbE fabrics.


Article Tools

  • Print This Article

Share & Save Options

Discussion

There are 0 discussion items posted.  

Sponsored Links

Interview: Appro CEO Shares HPC Vision
Appro CEO Daniel Kim provides a glimpse into Appro's vision and opportunities for its supercomputer and high-performance cluster solutions.



Feature Articles

Computed Tomography Software Taps Into NVIDIA GPUs

Minnesota-based North Star Imaging, a firm that specializes in industrial X-rays for nondestructive testing and analysis, is employing NVIDIA GPUs to accelerate 3D renderings in their CT (computed tomography) software. Julien Noel, the company's CT product manager, says the exceptional computational power afforded by CUDA and Tesla hardware is increasing customer productivity and transforming their workflow.
Read More...

The Next Big Thing in Humanities, Arts and Social Science Computing: 18thConnect

For the humanities scholar who may have only recently mastered library and archival finding aids beyond the archaic card catalog, the possibility of retrieving source materials at the flash of a keystroke (well maybe a few...) is very heady stuff.
Read More...

HPC Clouds -- Alto Cirrus or Cumulonimbus

The "cloud" model of exporting user workload and services to remote, distributed and virtual environments is emerging as a powerful computing paradigm. Yet, one domain that challenges this model in its characteristics and needs is high performance computing.
Read More...

Top Headlines

Dawning 6000 to Use Chinese-Made Loongson Processor

Nov 28 | People's Daily Online | Currently under development, the Dawning 6000 HPC system will be based on the Chinese-made "Loongson" microprocessor. Read more...

Can Supercomputers Help Save the Economy?

Nov 27 | Computerworld | The use of supercomputers to increase the industrial might of the U.S. has amounted to little more than an asterisk from a financial standpoint in both the federal budget and the economy as a whole. Read more...

IBM to Establish 'Collaboratory' in Dublin

Nov 26 | Science Business | IBM is getting ready to set up a supercomputing research “collaboratory” in Dublin, Ireland. Read more...

Texan Prof Sees Big Future for Graphene Storage

Nov 25 | The Register | A Rice University professor believes that his proposed graphene arrays could be many times denser and faster than existing storage tech, and they'd be more reliable too. Read more...

Super Micro Computer: A One-Man, or at Least One-Family, Powerhouse

Nov 24 | The New York Times | Server maker Super Micro Computer lives by two principles: give customers what they want, and do it as fast as humanly possible. Read more...

Multimedia

Video White Paper: Architecting a Better Network Storage Solution

BlueArc's Titan architecture represents an evolutionary step in file servers by creating a hardware-based file system that can scale bandwidth, IOPS, and overall data capacity well beyond conventional software-based devices. With its ability to virtualize a massive storage pool of up to four usable petabytes of tiered storage, Titan can scale with growing data requirements, offering a competitive advantage for businesses, researchers, or other enterprises seeking to better manage data growth while still ensuring optimal performance.

Special Feature: SC08

Newsletters

Stay informed! Subscribe to HPCwire email Newsletters.

Get updates and insights on the High Productivity Computing industry delivered driectly to your inbox.





HPC Job Bank

Featured Events

 TradeTech Architecture – Europe’s largest meeting of CTOs and CIOs in the capital markets
Symposium 2009