InfiniBand Delivers Best Return on Investment

May 29, 2017

Higher return on investment of up to 250 Percent demonstrated on various high-performance computing applications; InfiniBand delivers up to 55 Percent higher performance Vs. Omni-Path Using half the infrastructure

 

The latest revolution in high-performance computing (HPC) is the move to a co-design architecture — a collaborative effort among industry thought leaders, academia, and manufacturers to achieve Exascale performance by taking a holistic system-level approach to achieve fundamental performance improvements. Co-design architecture exploits system efficiency and optimizes performance by creating synergies between the hardware and the software, as well as between the different hardware elements within the data center.

Industry wide, it is recognized that the CPU has reached the limits of its scalability. This has created a need for the intelligent network to act as a “co-processor”, sharing the responsibility for handling and accelerating application workloads. By placing computation for data-related algorithms on an intelligent network, it is possible to dramatically improve data center and applications performance and to improve scalability.

The new generation of smart interconnect solutions is based on a data-centric architecture, which can offload all network functions from the CPU to the network and perform computation in-transit, freeing CPU up cycles and subsequently increasing the system’s efficiency. With this new architecture, the interconnect supports the management and execution of more data algorithms within the network. This allows users to run algorithms on the data as it is being transferred within the system interconnect rather than waiting for the data to reach the CPU. Smart interconnect solutions can now deliver both In-Network Computing and In-Network Memory, representing the industry’s most advanced approach to achieve performance and scalability for high performance cluster systems.

Mellanox hardware-based acceleration technologies such as SHARP (Scalable Hierarchical Aggregation and Reduction Protocol) for offloading data reduction and data aggregation protocols, hardware-based MPI tag matching, and MPI rendezvous offload are just a few of the solutions that work together to offload a significant amount of inter-process communication-related computation, enabling data algorithm processing as the data moves.

Figure 1 - Data Centric Architecture transition from CPU-centric to Data-centric to Overcome Latency Bottlenecks
Figure 1 – Data Centric Architecture transition from CPU-centric to Data-centric to Overcome Latency Bottlenecks

The performance and scalability advantages of Mellanox interconnect solutions over Intel’s Omni-Path based solutions have been demonstrated over various applications. Testing has been conducted at different sites on production systems, comparing an InfiniBand EDR cluster to an Omni-Path connected cluster. The InfiniBand cluster includes servers with dual-socket Intel Xeon 16-core E5-2697 v4 CPUs at 2.60GHz. The Omni-Path cluster includes servers with dual-socket Intel Xeon 18-core Intel E5-2697 v4 CPUs at 2.30 GHz. Although there exists a small difference between the CPU frequencies, it is very possible to compare the scaling performance of the two clusters. As the following two cases clearly demonstrate, InfiniBand offers dramatically higher performance and lowers total cost of ownership.

Case I: NAMD

NAMD is a molecular dynamics application for chemistry and chemical biology. Figure 1 below shows test results for the standard ApoA1 benchmark of NAMD. As can be seen, a 64-node InfiniBand cluster delivered an impressive 250 percent higher performance than a 64-node Omni-Path cluster. Furthermore, if the same benchmark is run on an InfiniBand cluster, with half the number of servers (32 nodes), the InfiniBand cluster delivered 55 percent higher performance than the 64-node Omni-Path cluster.

Figure 2 - InfiniBand vs. Omni-Path Performance Comparison over NAMD
Figure 2 – InfiniBand vs. Omni-Path Performance Comparison over NAMD

Case II: GROMACS

GROMACS is a molecular dynamics package used for simulations of proteins, lipids and nucleic acids. Figure 2 below shows test results for an industry standard benchmark simulation of lignocellulose. As can be seen, a 128-node InfiniBand cluster delivered 136 percent higher performance than a 128-node Omni-Path cluster. Furthermore, if the same benchmark is run on an InfiniBand cluster with half the number of servers (64 nodes), the InfiniBand cluster still delivered 33 percent higher performance than the 128-node Omni-Path cluster.

Figure 3 - InfiniBand vs. Omni-Path Performance Comparison over GROMACS
Figure 3 – InfiniBand vs. Omni-Path Performance Comparison over GROMACS

Both applications require fast and efficienct interprocess communications. The ability of InfiniBand to run a large portion of the MPI communication layer within the network greatly boosts the performance and scalability attainable from the HPC infrastructure. In both test cases, InfiniBand delivers higher performance (250 percent higher in the NAMD case, and 136 percent in the GROMACS case) versus Omni-Path – for the same-sized cluster job. Of equal import, in both cases InfiniBand delivered higher performance with only half the number of servers (for NAMD, a 32-node InfiniBand cluster delivered 55 percent higher performance than a  64-node Omni-Path cluster; and for GROMACS, a 64-node InfiniBand cluster delivered 33 percent higher performance than a 128-node Omni-Path cluster).

Mellanox has more than 17 years of experience designing high-speed communication fabrics. Today, Mellanox is a leading supplier of end-to-end Ethernet and InfiniBand intelligent interconnect solutions and services for servers, storage, and hyper-converged infrastructure. Mellanox intelligent interconnect solutions increase data center efficiency by providing the highest throughput and lowest latency, delivering data faster to applications and unlocking system performance. For more information, please visit: http://www.mellanox.com/solutions/hpc/.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

NREL ‘Eagle’ Supercomputer to Advance Energy Tech R&D

August 14, 2018

The U.S. Department of Energy (DOE) National Renewable Energy Laboratory (NREL) has contracted with Hewlett Packard Enterprise (HPE) for a new 8-petaflops (peak) supercomputer that will be used to advance early-stage R&a Read more…

By Tiffany Trader

Training Time Slashed for Deep Learning

August 14, 2018

Fast.ai, an organization offering free courses on deep learning, claimed a new speed record for training a popular image database using Nvidia GPUs running on public cloud infrastructure. A pair of researchers trained Read more…

By George Leopold

CERN Project Sees Orders-of-Magnitude Speedup with AI Approach

August 14, 2018

An award-winning effort at CERN has demonstrated potential to significantly change how the physics based modeling and simulation communities view machine learning. The CERN team demonstrated that AI-based models have the Read more…

By Rob Farber

HPE Extreme Performance Solutions

Introducing the First Integrated System Management Software for HPC Clusters from HPE

How do you manage your complex, growing cluster environments? Answer that big challenge with the new HPC cluster management solution: HPE Performance Cluster Manager. Read more…

IBM Accelerated Insights

Super Problem Solving

You might think that tackling the world’s toughest problems is a job only for superheroes, but at special places such as the Oak Ridge National Laboratory, supercomputers are the real heroes. Read more…

Rigetti Eyes Scaling with 128-Qubit Architecture

August 10, 2018

Rigetti Computing plans to build a 128-qubit quantum computer based on an equivalent quantum processor that leverages emerging hybrid computing algorithms used to test programs and potential applications. Founded in 2 Read more…

By George Leopold

NREL ‘Eagle’ Supercomputer to Advance Energy Tech R&D

August 14, 2018

The U.S. Department of Energy (DOE) National Renewable Energy Laboratory (NREL) has contracted with Hewlett Packard Enterprise (HPE) for a new 8-petaflops (peak Read more…

By Tiffany Trader

CERN Project Sees Orders-of-Magnitude Speedup with AI Approach

August 14, 2018

An award-winning effort at CERN has demonstrated potential to significantly change how the physics based modeling and simulation communities view machine learni Read more…

By Rob Farber

Intel Announces Cooper Lake, Advances AI Strategy

August 9, 2018

Intel's chief datacenter exec Navin Shenoy kicked off the company's Data-Centric Innovation Summit Wednesday, the day-long program devoted to Intel's datacenter Read more…

By Tiffany Trader

SLATE Update: Making Math Libraries Exascale-ready

August 9, 2018

Practically-speaking, achieving exascale computing requires enabling HPC software to effectively use accelerators – mostly GPUs at present – and that remain Read more…

By John Russell

Summertime in Washington: Some Unexpected Advanced Computing News

August 8, 2018

Summertime in Washington DC is known for its heat and humidity. That is why most people get away to either the mountains or the seashore and things slow down. H Read more…

By Alex R. Larzelere

NSF Invests $15 Million in Quantum STAQ

August 7, 2018

Quantum computing development is in full ascent as global backers aim to transcend the limitations of classical computing by leveraging the magical-seeming prop Read more…

By Tiffany Trader

By the Numbers: Cray Would Like Exascale to Be the Icing on the Cake

August 1, 2018

On its earnings call held for investors yesterday, Cray gave an accounting for its latest quarterly financials, offered future guidance and provided an update o Read more…

By Tiffany Trader

Google is First Partner in NIH’s STRIDES Effort to Speed Discovery in the Cloud

July 31, 2018

The National Institutes of Health, with the help of Google, last week launched STRIDES - Science and Technology Research Infrastructure for Discovery, Experimen Read more…

By John Russell

Leading Solution Providers

SC17 Booth Video Tours Playlist

Altair @ SC17

Altair

AMD @ SC17

AMD

ASRock Rack @ SC17

ASRock Rack

CEJN @ SC17

CEJN

DDN Storage @ SC17

DDN Storage

Huawei @ SC17

Huawei

IBM @ SC17

IBM

IBM Power Systems @ SC17

IBM Power Systems

Intel @ SC17

Intel

Lenovo @ SC17

Lenovo

Mellanox Technologies @ SC17

Mellanox Technologies

Microsoft @ SC17

Microsoft

Penguin Computing @ SC17

Penguin Computing

Pure Storage @ SC17

Pure Storage

Supericro @ SC17

Supericro

Tyan @ SC17

Tyan

Univa @ SC17

Univa

  • arrow
  • Click Here for More Headlines
  • arrow
Do NOT follow this link or you will be banned from the site!
Share This