Eurotech Hive Takes The Sting Out Of Density

By Timothy Prickett Morgan

November 21, 2014

Back at the International Supercomputing Conference in June, supercomputer maker Eurotech dropped some hints about its future water-cooled Aurora systems that would employ a mix of ARM processors and Nvidia Tesla GPU accelerators in a dense form. At the SC14 conference this week, these machines have now been officially launched as the Aurora Hive systems, and it turns out that the systems will also allow customers to build massively parallel machines based on Intel Xeon processors and Xeon Phi coprocessors.

The Hive systems use a modular enclosure that that is based on a cubic shape rather than a hexagonal one, but the concept of densely stacking compute elements while isolating them from each other, as a beehive does, holds true. The system crams up to 128 nodes (which are called bricks) into a single rack – 64 nodes in the front and another 64 nodes in the back, which is something you can do when you use water cooling on the components of the nodes because you do not have to worry about airflow from cold to hot aisles through each rack.eurotech-aurora-hive-cross-section

The Hive system makes use of a second generation of direct hot water cooling from the Aurora line, which Fabio Gallo, Eurotech HPC business unit managing director, tells HPCwire can cool a system with 50 degree Celsius (122 degrees Fahrenheit) inlet water temperature. The new water cooling is lighter and more compact, allowing for more compute and cooling to be crammed into the same space. The water distribution system is built right into the Aurora Hive rack, and there are dripless connectors for inlet cold (relatively speaking) and outlet hot water coming off each node. Being able to take the heat away quickly and efficiently is vital because a fully configured Hive rack draws 166 kilowatts of juice.

“You can free cool this machine nearly anywhere on earth,” says Gallo. By Eurotech’s math, customers using the Aurora Hive should be able to attain a power usage effectiveness of 1.05, which is about as good as the hyperscale datacenter operators are getting. (PUE, as this metric is abbreviated, is the ratio of the power consumed by a datacenter divided by the power consumed by the compute, storage, and network components of the datacenter. Getting as close as possible to 1 is the goal.)

eurotech-hive-block-exposedThe Hive nodes are 3U high, and you can put them into a rack four across and sixteen high. (Each node is 130 mm high by 105 mm deep by 325 mm deep.) Each node has a system board that includes risers for a compute module and five coprocessor modules; this system board also includes a PCI-Express 3.0 switch from PLX Technology (now part of Avago Technologies) that links the compute and coprocessor elements to each other. The PCI-Express switch also has hooks out to network adapters, in this case a two-port FDR InfiniBand adapter from Mellanox Technologies. All of the PCI-Express slots have the full bandwidth of an x16 slot, which means Nvidia Tesla GPU and Intel Xeon Phi coprocessors can find a place.

Eurotech’s first Hive system will have a CPU compute element that is based on Intel’s “Haswell” Xeon E3-1200 v3 processors. This family of chips has four cores and clock speeds that range from 3.1 GHz to 3.7 GHz in standard versions. The Intel E3-1200 v3 compute node has 32 GB of memory welded onto it for low clearance and also has a 256 GB half-height 1.8-inch solid state disk drive. You can use any E3-1200 v3 chip that has a thermal design point of 84 watts or lower.

The compute brick allows for up to four coprocessors to be fitted with cold plates for sucking the heat off their components and linked to each one of the cores over the PCI-Express switch and into the PCI-Express controllers on the E3-1200 processors. Gallo tells HPCwire that it will ship the Xeon E3-1200 plus Xeon Phi configuration in a few weeks to initial customers, and that a few months after that the combination of the Xeon E3 processor and Nvidia’s Tesla K40 coprocessor will be supported. The Xeon Phi 7120X is rated at 1.2 teraflops doing double precision floating point math, while the Tesla K40 card has a base performance of 1.43 teraflops that can rise to 1.66 teraflops with GPU Boost overclocking turned on. That works out to 614 teraflops per rack with Xeon Phis and 732 teraflops per rack with the Tesla K40s (not counting the extra performance from GPU Boost).

eurotech-hive-rack_openBack in June at ISC, Eurotech was talking up the Hive system (which did not yet have that name) by saying that it would be delivering a variant of the system that would marry a 64-bit ARM processor from Applied Micro with Tesla GPU coprocessors, and you might have gotten the impression that this would come out first. While Applied Micro is shipping its “Storm” X-Gene 1 chip now, it is readying the much-better “Shadowcat” X-Gene 2 processor, which has been sampling since August. This chip will support the RDMA over Converged Ethernet (RoCE) protocol over its integrated Ethernet network interface cards, simplifying the components that go into an ARM server node. The X-Gene 1 and 2 chips have two 10 Gb/sec Ethernet ports on the die, and these can be hooked eight into adapter ports. That, in theory, leaves more room for other peripherals in the complex. The plan is to ship the X-Gene 2 as the ARM option for the CPU side of the hybrid node, along with the Tesla K40 cards as coprocessors, sometime around the second quarter of 2015.

Incidentally, Eurotech is able to get its hands on a modified Tesla K40 card with its thermal plates modified so it fits into the super-skinny Hive module. The new Tesla K80 coprocessor card, announced this week at SC14, will be a bit tricky to add to the Aurora Hive system, explains Gallo, because this dual-GPU card has some of its power connectors across the top of the card. This does not work with the very tight tolerances in the Hive module, which are necessitated by the thermal conduction plates. With the Tesla K80 offering a base 1.87 teraflops of double precision math with a GPU Boost of up to 2.91 teraflops, you can bet some customers will want this. Gallo says that there is enough thermal capacity to pull the heat off this 300 watt part, if the connectors can be sorted. Being able to double the flops in the box is a pretty strong motivator to solve this engineering problem.

Generally speaking, the X86 processor option plus either the Xeon Phi or Tesla GPU accelerators draws about 1,500 watts per node, which works out to around 5 gigaflops per watt. The top machines on the Green500 ranking of supercomputers are in the range of 4 gigaflops per watt.

Gallo is tight lipped about what other processing components it might add to the Aurora Hive system, but obviously next year’s “Knights Landing” Xeon Phi, which will be sold as a standalone processor as well as a PCI-Express coprocessor, will slide right into this system. At 3 teraflops of double-precision floating point performance, and with the ability to put in five cards, this will be a radical increase in the math capabilities. And for dense-packed, CPU only workloads that used low-speed Ethernet, Eurotech could make Hive bricks that are just based on Xeon E3 or various ARM processors which sport their own networking on the chip. If you take out the network card, that leaves room for six CPU-only compute cards per module, or 768 processors per rack. Another option would be to add cards that have flash drives with the high-speed, low-latency NVM Express protocol linking into that PCI-Express switch. You could also swap out some of the flash drives and put in GPU cards for visualization to do visualization in the same nodes where the data is stored. Eurotech has lots of options with the Aurora Hive architecture, and that is so by design.

But initially at least, Eurotech is going after the workloads that have been accelerated. “There are markets where accelerated application have become the norm instead of an exotic thing,” says Gallo. “Geosciences, particularly reverse time migration reservoir analysis, is a good example. In general, signal processing will be interesting on this system, as well be machine learning, analytics, and some computer-aided engineering tools that have been modified for accelerators.”

The Aurora Hive comes preconfigured with the CentOS 6.X variant of Linux and support from Eurotech for this distribution, but customers can deploy other Linux operating systems on the machine as needed. Scientific Linux, Red Hat Enterprise Linux, SUSE Linux Enterprise Server, and Canonical Ubuntu Server are all supported. The Aurora software stack includes support for Intel Cluster Studio, Nvidia CUDA, MPSS, and the GCC compilers as well as the Intel MPI, Open MPI, and MVAPICH2 communication libraries.

Pricing for the Aurora Hive system was not available, and the question is what kind of premium can Eurotech charge for density and hot water cooling. The combination of the two should allow Eurotech to command a premium for its systems over plain vanilla clusters based on rack or blade servers, but it is a question as to how much. The market will decide.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

ISC 2019 Student Cluster Competition: Meet the Teams!

June 25, 2019

Finally! The videos have been rendered, the statistics compiled, and the story lines set. It’s time to share with you the incredible event that was the ISC 2019 Student Cluster Competition. So what’s a Student Clu Read more…

By Dan Olds

What’s New in HPC Research: Rock Art, Protein Design, Genome Assembly & More

June 25, 2019

In this bimonthly feature, HPCwire highlights newly published research in the high-performance computing community and related domains. From parallel programming to exascale to quantum computing, the details are here. Read more…

By Oliver Peckham

Azure Benchmarks HC-series Across 20,000 cores for HPC

June 25, 2019

Cloud provider Microsoft Azure’s push into HPC continues to gain momentum. In a blog last week, Evan Burness, principal program manager, Azure HPC, announced HC-series Virtual Machine are now available in West US 2 and Read more…

By John Russell

HPE Extreme Performance Solutions

HPE and Intel® Omni-Path Architecture: How to Power a Cloud

Learn how HPE and Intel® Omni-Path Architecture provide critical infrastructure for leading Nordic HPC provider’s HPCFLOW cloud service.

For decades, HPE has been at the forefront of high-performance computing, and we’ve powered some of the fastest and most robust supercomputers in the world. Read more…

IBM Accelerated Insights

Rediscovering the Value of the Past

Some people would like to forget their past, perhaps for good reasons. But for business or research organizations, preserving institutional memory can be the key to thriving in the future. Read more…

MLPerf Expands Toolset; Launches Inferencing Suite

June 24, 2019

MLPerf today launched a benchmark suite for inferencing, v0.5, which joins the MLPerf training suite launched a little over a year ago. The new inferencing benchmark, which has been anticipated, covers models applicable Read more…

By John Russell

ISC 2019 Student Cluster Competition: Meet the Teams!

June 25, 2019

Finally! The videos have been rendered, the statistics compiled, and the story lines set. It’s time to share with you the incredible event that was the ISC 20 Read more…

By Dan Olds

MLPerf Expands Toolset; Launches Inferencing Suite

June 24, 2019

MLPerf today launched a benchmark suite for inferencing, v0.5, which joins the MLPerf training suite launched a little over a year ago. The new inferencing benc Read more…

By John Russell

Is Weather and Climate Prediction the Perfect ‘Pilot’ for Exascale?

June 21, 2019

At ISC 2019 this week, Peter Bauer – deputy director of research for the European Centre for Medium-Range Weather Forecasts (ECMWF) – outlined an ambitious Read more…

By Oliver Peckham

ISC Keynote: Thomas Sterling’s Take on Whither HPC

June 20, 2019

Entertaining, insightful, and unafraid to launch the occasional verbal ICBM, HPC pioneer Thomas Sterling delivered his 16th annual closing keynote at ISC yesterday. He explored, among other things: exascale machinations; quantum’s bubbling money pot; Arm’s new HPC viability; Europe’s... Read more…

By John Russell

IBM Claims No. 1 Commercial Supercomputer with Total Oil & Gas System 

June 20, 2019

IBM can now boast not only the two most powerful supercomputers in the world, it also has claimed the top spot for a supercomputer used in a commercial setting. Read more…

By Staff Report

HPC on Pace for 5-Year 6.8% CAGR; Guess Which Hyperscaler Spent $10B on IT Last Year?

June 20, 2019

In the neck-and-neck horse race for HPC server market share, HPE has hung on to a slim, shrinking lead over Dell EMC – but if server and storage market shares Read more…

By Doug Black

ISC 2019 Research Paper Award Winners Announced

June 19, 2019

At the 2019 International Supercomputing Conference (ISC) in Frankfurt this week, the ISC committee awarded the event's top prizes for outstanding research pape Read more…

By Oliver Peckham

ISC Keynote: The Algorithms of Life – Scientific Computing for Systems Biology

June 19, 2019

Systems biology has existed loosely under many definitions for a couple of decades. It’s the notion of describing living systems using first-principle physics Read more…

By John Russell

High Performance (Potato) Chips

May 5, 2006

In this article, we focus on how Procter & Gamble is using high performance computing to create some common, everyday supermarket products. Tom Lange, a 27-year veteran of the company, tells us how P&G models products, processes and production systems for the betterment of consumer package goods. Read more…

By Michael Feldman

Cray, AMD to Extend DOE’s Exascale Frontier

May 7, 2019

Cray and AMD are coming back to Oak Ridge National Laboratory to partner on the world’s largest and most expensive supercomputer. The Department of Energy’s Read more…

By Tiffany Trader

Graphene Surprises Again, This Time for Quantum Computing

May 8, 2019

Graphene is fascinating stuff with promise for use in a seeming endless number of applications. This month researchers from the University of Vienna and Institu Read more…

By John Russell

Why Nvidia Bought Mellanox: ‘Future Datacenters Will Be…Like High Performance Computers’

March 14, 2019

“Future datacenters of all kinds will be built like high performance computers,” said Nvidia CEO Jensen Huang during a phone briefing on Monday after Nvidia revealed scooping up the high performance networking company Mellanox for $6.9 billion. Read more…

By Tiffany Trader

AMD Verifies Its Largest 7nm Chip Design in Ten Hours

June 5, 2019

AMD announced last week that its engineers had successfully executed the first physical verification of its largest 7nm chip design – in just ten hours. The AMD Radeon Instinct Vega20 – which boasts 13.2 billion transistors – was tested using a TSMC-certified Calibre nmDRC software platform from Mentor. Read more…

By Oliver Peckham

It’s Official: Aurora on Track to Be First US Exascale Computer in 2021

March 18, 2019

The U.S. Department of Energy along with Intel and Cray confirmed today that an Intel/Cray supercomputer, "Aurora," capable of sustained performance of one exaf Read more…

By Tiffany Trader

Deep Learning Competitors Stalk Nvidia

May 14, 2019

There is no shortage of processing architectures emerging to accelerate deep learning workloads, with two more options emerging this week to challenge GPU leader Nvidia. First, Intel researchers claimed a new deep learning record for image classification on the ResNet-50 convolutional neural network. Separately, Israeli AI chip startup Hailo.ai... Read more…

By George Leopold

TSMC and Samsung Moving to 5nm; Whither Moore’s Law?

June 12, 2019

With reports that Taiwan Semiconductor Manufacturing Co. (TMSC) and Samsung are moving quickly to 5nm manufacturing, it’s a good time to again ponder whither goes the venerable Moore’s law. Shrinking feature size has of course been the primary hallmark of achieving Moore’s law... Read more…

By John Russell

Leading Solution Providers

ISC 2019 Virtual Booth Video Tour

CRAY
CRAY
DDN
DDN
DELL EMC
DELL EMC
ONE STOP SYSTEMS
ONE STOP SYSTEMS
PANASAS
PANASAS
VERNE GLOBAL
VERNE GLOBAL

Nvidia Embraces Arm, Declares Intent to Accelerate All CPU Architectures

June 17, 2019

As the Top500 list was being announced at ISC in Frankfurt today with an upgraded petascale Arm supercomputer in the top third of the list, Nvidia announced its Read more…

By Tiffany Trader

The Case Against ‘The Case Against Quantum Computing’

January 9, 2019

It’s not easy to be a physicist. Richard Feynman (basically the Jimi Hendrix of physicists) once said: “The first principle is that you must not fool yourse Read more…

By Ben Criger

Top500 Purely Petaflops; US Maintains Performance Lead

June 17, 2019

With the kick-off of the International Supercomputing Conference (ISC) in Frankfurt this morning, the 53rd Top500 list made its debut, and this one's for petafl Read more…

By Tiffany Trader

Cray – and the Cray Brand – to Be Positioned at Tip of HPE’s HPC Spear

May 22, 2019

More so than with most acquisitions of this kind, HPE’s purchase of Cray for $1.3 billion, announced last week, seems to have elements of that overused, often Read more…

By Doug Black and Tiffany Trader

Intel Launches Cascade Lake Xeons with Up to 56 Cores

April 2, 2019

At Intel's Data-Centric Innovation Day in San Francisco (April 2), the company unveiled its second-generation Xeon Scalable (Cascade Lake) family and debuted it Read more…

By Tiffany Trader

Announcing four new HPC capabilities in Google Cloud Platform

April 15, 2019

When you’re running compute-bound or memory-bound applications for high performance computing or large, data-dependent machine learning training workloads on Read more…

By Wyatt Gorman, HPC Specialist, Google Cloud; Brad Calder, VP of Engineering, Google Cloud; Bart Sano, VP of Platforms, Google Cloud

In Wake of Nvidia-Mellanox: Xilinx to Acquire Solarflare

April 25, 2019

With echoes of Nvidia’s recent acquisition of Mellanox, FPGA maker Xilinx has announced a definitive agreement to acquire Solarflare Communications, provider Read more…

By Doug Black

Nvidia Claims 6000x Speed-Up for Stock Trading Backtest Benchmark

May 13, 2019

A stock trading backtesting algorithm used by hedge funds to simulate trading variants has received a massive, GPU-based performance boost, according to Nvidia, Read more…

By Doug Black

  • arrow
  • Click Here for More Headlines
  • arrow
Do NOT follow this link or you will be banned from the site!
Share This