Still on the InfiniBandwagon

By Michael Feldman

March 26, 2009

In the realm of datacenter interconnects, much of the IT industry continues to be focused on the rollout of 10 Gigabit Ethernet offerings, with a raft of switches, adapters and other 10GigE paraphernalia having made its way into the marketplace over the past 18 months. Cisco’s recent foray into the datacenter, for example, will be premised on 10GigE-based blades. This next generation of Ethernet products will not only bring higher bandwidth and lower latencies, but also lossless fabrics suitable for both compute and storage interconnects.

But despite all the hoopla over 10GigE, InfiniBand continues to be the interconnect that excites the HPC crowd. The majority of new HPC systems of note all seem to be InfiniBand-based. The most prominent example of an Ethernet-based system is the ATLAS cluster at the Max Planck Institute for Gravitational Physics in Germany, which we reported on last year. From a performance standpoint, the choice between Ethernet and InfiniBand is not so much a bandwidth issue — multiple 10GigE links can always be aggregated to achieve InfiniBand-like bandwidth — as a latency one. Today, even the most capable 10GigE implementations have higher latencies than InfiniBand, and it is this attribute that many HPC workloads find indispensable.

A recent market study by Tabor Research points to InfiniBand’s continued popularity in the HPC space. Citing an August 2008 site survey, the Taborites found that 60 percent of HPC systems installed since the start of 2007 were employing InfiniBand as a system interconnect. That’s a much bigger percentage than you see on the latest TOP500 list, where only 28 percent are InfiniBand-based versus 56 percent for Ethernet — the remainder being a smattering of proprietary interconnects. In fact, it’s probable that the majority of these really big Ethernet-connected clusters are running loosely-coupled parallel applications, rather than latency-sensitive HPC workloads. It’s notable that as of November 2008, no TOP500 systems were using 10GigE.

More importantly, InfiniBand usage in HPC is growing. According to the same Tabor Research survey, in 2006 the proportion of HPC systems employing InfiniBand and Ethernet were about equal. It was in 2007 that InfiniBand jumped into the lead. With QDR IB (40 Gbps) expected to hit its stride in 2009, InfiniBand should consolidate its lead in the HPC interconnect market. InfiniBand has also made some inroads into more traditional enterprise applications, most notably in the HP-Oracle database machine. Time will tell whether this is just an outlier or the beginning of a wider trend.

Mellanox continues to be the dominant vendor in the InfiniBand marketplace, having recently added switches and gateways to its adapter and silicon business. But with QLogic now offering home-grown InfiniBand ASICs alongside its own switches and HCAs, HPC system vendors will have a wider choice of interconnect options. Although this introduces an element of competition, Tabor Research believes that the InfiniBand market is now big enough for two vendors to succeed. Considering that Mellanox enjoyed record revenues through the front end of the recession — $107.7 million in FY2008 — this seems like a fair assessment.

InfiniBand’s success in HPC doesn’t seem to quiet the naysayers, though. The Ethernet drumbeat that pervades the industry invariably leads to press coverage that casts InfiniBand as an endangered technology. Chris Mellor’s recent piece in The Register, titled InfiniBand: Caught in the Ethernet meatgrinder, sounds ominous, but the main thrust of that article is actually about fabric convergence and how Ethernet and InfiniBand are learning to co-exist.

In fact, converged fabrics are likely to be the real story of datacenter interconnects over the next several years, as vendors look to accommodate multiple networking, clustering and storage communication protocols on top of lossless communication technologies like InfiniBand and RDMA Ethernet. It’s not surprising that the major InfiniBand vendors — Mellanox, QLogic and Voltaire — have developed converged fabric offerings in various flavors, and Ethernet vendors are layering protocols like Fibre Channel on top of lossless Ethernet.

The whole process resembles the convergence of RISC and CISC technologies in the microprocessor arena. There, instead of one architecture killing off the other one, Intel was able to maintain the dominance of its legacy x86 CISC ISA by incorporating a RISC-like core underneath the covers. Meanwhile, true RISC processors found other markets to play in. Ethernet and InfiniBand look like they’re on a similar trajectory.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

Glimpses of Today’s Total Solar Eclipse

August 21, 2017

Here are a few arresting images posted by NASA of today’s total solar eclipse. Such astronomical events have always captured our imagination and it’s not hard to understand why such occurrences were often greeted wit Read more…

By John Russell

Tech Giants Outline Battle Plans for Future HPC Market

August 21, 2017

Four companies engaged in a cage fight for leadership in the emerging HPC market of the 2020s are, despite deep differences in some areas, in violent agreement on at least one thing: the power consumption and latency pen Read more…

By Doug Black

Geospatial Data Research Leverages GPUs

August 17, 2017

MapD Technologies, the GPU-accelerated database specialist, said it is working with university researchers on leveraging graphics processors to advance geospatial analytics. The San Francisco-based company is collabor Read more…

By George Leopold

HPE Extreme Performance Solutions

Leveraging Deep Learning for Fraud Detection

Advancements in computing technologies and the expanding use of e-commerce platforms have dramatically increased the risk of fraud for financial services companies and their customers. Read more…

Intel, NERSC and University Partners Launch New Big Data Center

August 17, 2017

A collaboration between the Department of Energy’s National Energy Research Scientific Computing Center (NERSC), Intel and five Intel Parallel Computing Centers (IPCCs) has resulted in a new Big Data Center (BDC) that Read more…

By Linda Barney

Tech Giants Outline Battle Plans for Future HPC Market

August 21, 2017

Four companies engaged in a cage fight for leadership in the emerging HPC market of the 2020s are, despite deep differences in some areas, in violent agreement Read more…

By Doug Black

Microsoft Bolsters Azure With Cloud HPC Deal

August 15, 2017

Microsoft has acquired cloud computing software vendor Cycle Computing in a move designed to bring orchestration tools along with high-end computing access capabilities to the cloud. Terms of the acquisition were not disclosed. Read more…

By George Leopold

HPE Ships Supercomputer to Space Station, Final Destination Mars

August 14, 2017

With a manned mission to Mars on the horizon, the demand for space-based supercomputing is at hand. Today HPE and NASA sent the first off-the-shelf HPC system i Read more…

By Tiffany Trader

AMD EPYC Video Takes Aim at Intel’s Broadwell

August 14, 2017

Let the benchmarking begin. Last week, AMD posted a YouTube video in which one of its EPYC-based systems outperformed a ‘comparable’ Intel Broadwell-based s Read more…

By John Russell

Deep Learning Thrives in Cancer Moonshot

August 8, 2017

The U.S. War on Cancer, certainly a worthy cause, is a collection of programs stretching back more than 40 years and abiding under many banners. The latest is t Read more…

By John Russell

IBM Raises the Bar for Distributed Deep Learning

August 8, 2017

IBM is announcing today an enhancement to its PowerAI software platform aimed at facilitating the practical scaling of AI models on today’s fastest GPUs. Scal Read more…

By Tiffany Trader

IBM Storage Breakthrough Paves Way for 330TB Tape Cartridges

August 3, 2017

IBM announced yesterday a new record for magnetic tape storage that it says will keep tape storage density on a Moore's law-like path far into the next decade. Read more…

By Tiffany Trader

AMD Stuffs a Petaflops of Machine Intelligence into 20-Node Rack

August 1, 2017

With its Radeon “Vega” Instinct datacenter GPUs and EPYC “Naples” server chips entering the market this summer, AMD has positioned itself for a two-head Read more…

By Tiffany Trader

How ‘Knights Mill’ Gets Its Deep Learning Flops

June 22, 2017

Intel, the subject of much speculation regarding the delayed, rewritten or potentially canceled “Aurora” contract (the Argonne Lab part of the CORAL “ Read more…

By Tiffany Trader

Nvidia’s Mammoth Volta GPU Aims High for AI, HPC

May 10, 2017

At Nvidia's GPU Technology Conference (GTC17) in San Jose, Calif., this morning, CEO Jensen Huang announced the company's much-anticipated Volta architecture a Read more…

By Tiffany Trader

Reinders: “AVX-512 May Be a Hidden Gem” in Intel Xeon Scalable Processors

June 29, 2017

Imagine if we could use vector processing on something other than just floating point problems.  Today, GPUs and CPUs work tirelessly to accelerate algorithms Read more…

By James Reinders

Russian Researchers Claim First Quantum-Safe Blockchain

May 25, 2017

The Russian Quantum Center today announced it has overcome the threat of quantum cryptography by creating the first quantum-safe blockchain, securing cryptocurrencies like Bitcoin, along with classified government communications and other sensitive digital transfers. Read more…

By Doug Black

Quantum Bits: D-Wave and VW; Google Quantum Lab; IBM Expands Access

March 21, 2017

For a technology that’s usually characterized as far off and in a distant galaxy, quantum computing has been steadily picking up steam. Just how close real-wo Read more…

By John Russell

Nvidia Responds to Google TPU Benchmarking

April 10, 2017

Nvidia highlights strengths of its newest GPU silicon in response to Google's report on the performance and energy advantages of its custom tensor processor. Read more…

By Tiffany Trader

Groq This: New AI Chips to Give GPUs a Run for Deep Learning Money

April 24, 2017

CPUs and GPUs, move over. Thanks to recent revelations surrounding Google’s new Tensor Processing Unit (TPU), the computing world appears to be on the cusp of Read more…

By Alex Woodie

HPC Compiler Company PathScale Seeks Life Raft

March 23, 2017

HPCwire has learned that HPC compiler company PathScale has fallen on difficult times and is asking the community for help or actively seeking a buyer for its a Read more…

By Tiffany Trader

Leading Solution Providers

Trump Budget Targets NIH, DOE, and EPA; No Mention of NSF

March 16, 2017

President Trump’s proposed U.S. fiscal 2018 budget issued today sharply cuts science spending while bolstering military spending as he promised during the cam Read more…

By John Russell

Google Debuts TPU v2 and will Add to Google Cloud

May 25, 2017

Not long after stirring attention in the deep learning/AI community by revealing the details of its Tensor Processing Unit (TPU), Google last week announced the Read more…

By John Russell

CPU-based Visualization Positions for Exascale Supercomputing

March 16, 2017

In this contributed perspective piece, Intel’s Jim Jeffers makes the case that CPU-based visualization is now widely adopted and as such is no longer a contrarian view, but is rather an exascale requirement. Read more…

By Jim Jeffers, Principal Engineer and Engineering Leader, Intel

Six Exascale PathForward Vendors Selected; DoE Providing $258M

June 15, 2017

The much-anticipated PathForward awards for hardware R&D in support of the Exascale Computing Project were announced today with six vendors selected – AMD Read more…

By John Russell

Top500 Results: Latest List Trends and What’s in Store

June 19, 2017

Greetings from Frankfurt and the 2017 International Supercomputing Conference where the latest Top500 list has just been revealed. Although there were no major Read more…

By Tiffany Trader

IBM Clears Path to 5nm with Silicon Nanosheets

June 5, 2017

Two years since announcing the industry’s first 7nm node test chip, IBM and its research alliance partners GlobalFoundries and Samsung have developed a proces Read more…

By Tiffany Trader

Messina Update: The US Path to Exascale in 16 Slides

April 26, 2017

Paul Messina, director of the U.S. Exascale Computing Project, provided a wide-ranging review of ECP’s evolving plans last week at the HPC User Forum. Read more…

By John Russell

Graphcore Readies Launch of 16nm Colossus-IPU Chip

July 20, 2017

A second $30 million funding round for U.K. AI chip developer Graphcore sets up the company to go to market with its “intelligent processing unit” (IPU) in Read more…

By Tiffany Trader

  • arrow
  • Click Here for More Headlines
  • arrow
Share This