The Leading Source for Global News and Information Covering the Ecosystem of High Productivity Computing
July 07, 2006
Freed of the CPU overhead and networking latency thwarting its deployment beyond networking, Ethernet is finally ready to take on the most demanding of data center applications -- high-performance clustering. When fully implemented, iWARP Ethernet achieves low latency, low CPU utilization, high throughput, and high bandwidth characteristics on par with proprietary clustering fabrics. It simplifies connectivity, lowers total cost of ownership (TCO), and delivers at last on the promise of one data center fabric that can do it all.
Every data center today supports Ethernet for networking, and many use Fibre Channel for block storage. As mainstream popularity grows for clustering and grid, many are also adding a proprietary fabric, such as InfiniBand, specifically designed to meet the performance demands of fine-grain parallel processing applications. These three-fabric configurations work, but require separate maintenance and management of each network and create a challenge in blade systems due to inherent space, power, and cooling constraints. A goal of many data center managers is to reduce the complexity and number of fabrics without compromising networking performance.
iWARP Ethernet achieves high performance in Ethernet channel adapters
iWARP is a set of standards developed by the RDMA Consortium and IETF, enabling TCP/IP-based Ethernet to address the three major sources of networking overhead -- transport (TCP/IP) processing, intermediate buffer copies, and application context switch overhead. Fully implemented in a new type of networking device, an Ethernet channel adapter or ECA, iWARP enables Ethernet to successfully meet the performance requirements of all data center fabrics by:
-- offloading TCP/IP transport processing from the CPU
-- using RDMA and DDP to move data directly to/from application memory, eliminating overhead due to unnecessary data buffering
-- instituting user level direct access (ULDA), enabling applications to directly control data movement through the ECA without OS involvement (this is also known as OS bypass).
Together these features allow applications to take full advantage of 10 Gb of network bandwidth, while utilizing a minimum of server resources for networking support.
Partial implementations of 1 Gb and 10 Gb iWARP Ethernet, available as transport offload engines (TOEs) and kernel-mode RDMA NICs (RNICs), offer some relief from overhead and latency issues, but not enough to support the 10 times performance increase of 10 Gb Ethernet. ECAs offer a full iWARP implementation and can be distinguished from conventional NICs and RNICs by their inclusion of ULDA, a key feature of the iWARP standards.
Benchmark results
In performance comparisons of current InfiniBand host channel adapters (HCAs) to standard 10 GbE NICs and 10 GbE iWARP ECAs, the results speak for themselves. When fully implemented, iWARP allows Ethernet to achieve low latency, high bandwidth and low CPU utilization comparable to or better than proprietary clustering fabrics.
One fabric. One adapter. Lower TCO.
iWARP ECAs enable data centers to use a single building block for any networking topology with no compromises in bandwidth, throughput, CPU utilization or latency. Proprietary clustering fabrics can be replaced by iWARP Ethernet, block-based storage fabrics can be replaced with iSCSI and iSER (RDMA-enabled iSCSI) over iWARP Ethernet, while compatibility with ubiquitous Ethernet is maintained.
One fabric greatly simplifies connectivity. In any server, three adapters can be reduced to one. The three switches formerly connected to those three adapters are reduced from three to one. In fact, in the data center of the very near future, every server and every node in an HPC cluster has only two connections -- one for power and one for everything else. The need for separate spares and different training is eliminated. Network management software returns to familiar Ethernet.
The iWARP specifications allow TCP/IP-based Ethernet to achieve equal or better performance than today's clustering and storage fabrics. IWARP ECAs simplify connectivity, lower total cost of ownership (TCO), and deliver on the promise that one data center fabric can do it all.
-----
Terry Hulett is VP of Silicon Engineering for NetEffect, Inc., an Austin, Texas-based company offering iWARP ECAs for 1 GbE and 10 GbE fabrics.
Interview: Appro CEO Shares HPC Vision
Appro CEO Daniel Kim provides a glimpse into Appro's vision and opportunities for its supercomputer and high-performance cluster solutions.
Minnesota-based North Star Imaging, a firm that specializes in industrial X-rays for nondestructive testing and analysis, is employing NVIDIA GPUs to accelerate 3D renderings in their CT (computed tomography) software. Julien Noel, the company's CT product manager, says the exceptional computational power afforded by CUDA and Tesla hardware is increasing customer productivity and transforming their workflow.
Read More...
For the humanities scholar who may have only recently mastered library and archival finding aids beyond the archaic card catalog, the possibility of retrieving source materials at the flash of a keystroke (well maybe a few...) is very heady stuff.
Read More...
The "cloud" model of exporting user workload and services to remote, distributed and virtual environments is emerging as a powerful computing paradigm. Yet, one domain that challenges this model in its characteristics and needs is high performance computing.
Read More...
Nov 28 | People's Daily Online | Currently under development, the Dawning 6000 HPC system will be based on the Chinese-made "Loongson" microprocessor. Read more...
Nov 27 | Computerworld | The use of supercomputers to increase the industrial might of the U.S. has amounted to little more than an asterisk from a financial standpoint in both the federal budget and the economy as a whole. Read more...
Nov 26 | Science Business | IBM is getting ready to set up a supercomputing research “collaboratory” in Dublin, Ireland. Read more...
Nov 25 | The Register | A Rice University professor believes that his proposed graphene arrays could be many times denser and faster than existing storage tech, and they'd be more reliable too. Read more...
Nov 24 | The New York Times | Server maker Super Micro Computer lives by two principles: give customers what they want, and do it as fast as humanly possible. Read more...
BlueArc's Titan architecture represents an evolutionary step in file servers by creating a hardware-based file system that can scale bandwidth, IOPS, and overall data capacity well beyond conventional software-based devices. With its ability to virtualize a massive storage pool of up to four usable petabytes of tiered storage, Titan can scale with growing data requirements, offering a competitive advantage for businesses, researchers, or other enterprises seeking to better manage data growth while still ensuring optimal performance.
Get updates and insights on the High Productivity Computing industry delivered driectly to your inbox.