Eurotech Hive Takes The Sting Out Of Density

By Timothy Prickett Morgan

November 21, 2014

Back at the International Supercomputing Conference in June, supercomputer maker Eurotech dropped some hints about its future water-cooled Aurora systems that would employ a mix of ARM processors and Nvidia Tesla GPU accelerators in a dense form. At the SC14 conference this week, these machines have now been officially launched as the Aurora Hive systems, and it turns out that the systems will also allow customers to build massively parallel machines based on Intel Xeon processors and Xeon Phi coprocessors.

The Hive systems use a modular enclosure that that is based on a cubic shape rather than a hexagonal one, but the concept of densely stacking compute elements while isolating them from each other, as a beehive does, holds true. The system crams up to 128 nodes (which are called bricks) into a single rack – 64 nodes in the front and another 64 nodes in the back, which is something you can do when you use water cooling on the components of the nodes because you do not have to worry about airflow from cold to hot aisles through each rack.eurotech-aurora-hive-cross-section

The Hive system makes use of a second generation of direct hot water cooling from the Aurora line, which Fabio Gallo, Eurotech HPC business unit managing director, tells HPCwire can cool a system with 50 degree Celsius (122 degrees Fahrenheit) inlet water temperature. The new water cooling is lighter and more compact, allowing for more compute and cooling to be crammed into the same space. The water distribution system is built right into the Aurora Hive rack, and there are dripless connectors for inlet cold (relatively speaking) and outlet hot water coming off each node. Being able to take the heat away quickly and efficiently is vital because a fully configured Hive rack draws 166 kilowatts of juice.

“You can free cool this machine nearly anywhere on earth,” says Gallo. By Eurotech’s math, customers using the Aurora Hive should be able to attain a power usage effectiveness of 1.05, which is about as good as the hyperscale datacenter operators are getting. (PUE, as this metric is abbreviated, is the ratio of the power consumed by a datacenter divided by the power consumed by the compute, storage, and network components of the datacenter. Getting as close as possible to 1 is the goal.)

eurotech-hive-block-exposedThe Hive nodes are 3U high, and you can put them into a rack four across and sixteen high. (Each node is 130 mm high by 105 mm deep by 325 mm deep.) Each node has a system board that includes risers for a compute module and five coprocessor modules; this system board also includes a PCI-Express 3.0 switch from PLX Technology (now part of Avago Technologies) that links the compute and coprocessor elements to each other. The PCI-Express switch also has hooks out to network adapters, in this case a two-port FDR InfiniBand adapter from Mellanox Technologies. All of the PCI-Express slots have the full bandwidth of an x16 slot, which means Nvidia Tesla GPU and Intel Xeon Phi coprocessors can find a place.

Eurotech’s first Hive system will have a CPU compute element that is based on Intel’s “Haswell” Xeon E3-1200 v3 processors. This family of chips has four cores and clock speeds that range from 3.1 GHz to 3.7 GHz in standard versions. The Intel E3-1200 v3 compute node has 32 GB of memory welded onto it for low clearance and also has a 256 GB half-height 1.8-inch solid state disk drive. You can use any E3-1200 v3 chip that has a thermal design point of 84 watts or lower.

The compute brick allows for up to four coprocessors to be fitted with cold plates for sucking the heat off their components and linked to each one of the cores over the PCI-Express switch and into the PCI-Express controllers on the E3-1200 processors. Gallo tells HPCwire that it will ship the Xeon E3-1200 plus Xeon Phi configuration in a few weeks to initial customers, and that a few months after that the combination of the Xeon E3 processor and Nvidia’s Tesla K40 coprocessor will be supported. The Xeon Phi 7120X is rated at 1.2 teraflops doing double precision floating point math, while the Tesla K40 card has a base performance of 1.43 teraflops that can rise to 1.66 teraflops with GPU Boost overclocking turned on. That works out to 614 teraflops per rack with Xeon Phis and 732 teraflops per rack with the Tesla K40s (not counting the extra performance from GPU Boost).

eurotech-hive-rack_openBack in June at ISC, Eurotech was talking up the Hive system (which did not yet have that name) by saying that it would be delivering a variant of the system that would marry a 64-bit ARM processor from Applied Micro with Tesla GPU coprocessors, and you might have gotten the impression that this would come out first. While Applied Micro is shipping its “Storm” X-Gene 1 chip now, it is readying the much-better “Shadowcat” X-Gene 2 processor, which has been sampling since August. This chip will support the RDMA over Converged Ethernet (RoCE) protocol over its integrated Ethernet network interface cards, simplifying the components that go into an ARM server node. The X-Gene 1 and 2 chips have two 10 Gb/sec Ethernet ports on the die, and these can be hooked eight into adapter ports. That, in theory, leaves more room for other peripherals in the complex. The plan is to ship the X-Gene 2 as the ARM option for the CPU side of the hybrid node, along with the Tesla K40 cards as coprocessors, sometime around the second quarter of 2015.

Incidentally, Eurotech is able to get its hands on a modified Tesla K40 card with its thermal plates modified so it fits into the super-skinny Hive module. The new Tesla K80 coprocessor card, announced this week at SC14, will be a bit tricky to add to the Aurora Hive system, explains Gallo, because this dual-GPU card has some of its power connectors across the top of the card. This does not work with the very tight tolerances in the Hive module, which are necessitated by the thermal conduction plates. With the Tesla K80 offering a base 1.87 teraflops of double precision math with a GPU Boost of up to 2.91 teraflops, you can bet some customers will want this. Gallo says that there is enough thermal capacity to pull the heat off this 300 watt part, if the connectors can be sorted. Being able to double the flops in the box is a pretty strong motivator to solve this engineering problem.

Generally speaking, the X86 processor option plus either the Xeon Phi or Tesla GPU accelerators draws about 1,500 watts per node, which works out to around 5 gigaflops per watt. The top machines on the Green500 ranking of supercomputers are in the range of 4 gigaflops per watt.

Gallo is tight lipped about what other processing components it might add to the Aurora Hive system, but obviously next year’s “Knights Landing” Xeon Phi, which will be sold as a standalone processor as well as a PCI-Express coprocessor, will slide right into this system. At 3 teraflops of double-precision floating point performance, and with the ability to put in five cards, this will be a radical increase in the math capabilities. And for dense-packed, CPU only workloads that used low-speed Ethernet, Eurotech could make Hive bricks that are just based on Xeon E3 or various ARM processors which sport their own networking on the chip. If you take out the network card, that leaves room for six CPU-only compute cards per module, or 768 processors per rack. Another option would be to add cards that have flash drives with the high-speed, low-latency NVM Express protocol linking into that PCI-Express switch. You could also swap out some of the flash drives and put in GPU cards for visualization to do visualization in the same nodes where the data is stored. Eurotech has lots of options with the Aurora Hive architecture, and that is so by design.

But initially at least, Eurotech is going after the workloads that have been accelerated. “There are markets where accelerated application have become the norm instead of an exotic thing,” says Gallo. “Geosciences, particularly reverse time migration reservoir analysis, is a good example. In general, signal processing will be interesting on this system, as well be machine learning, analytics, and some computer-aided engineering tools that have been modified for accelerators.”

The Aurora Hive comes preconfigured with the CentOS 6.X variant of Linux and support from Eurotech for this distribution, but customers can deploy other Linux operating systems on the machine as needed. Scientific Linux, Red Hat Enterprise Linux, SUSE Linux Enterprise Server, and Canonical Ubuntu Server are all supported. The Aurora software stack includes support for Intel Cluster Studio, Nvidia CUDA, MPSS, and the GCC compilers as well as the Intel MPI, Open MPI, and MVAPICH2 communication libraries.

Pricing for the Aurora Hive system was not available, and the question is what kind of premium can Eurotech charge for density and hot water cooling. The combination of the two should allow Eurotech to command a premium for its systems over plain vanilla clusters based on rack or blade servers, but it is a question as to how much. The market will decide.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

Pfizer HPC Engineer Aims to Automate Software Stack Testing

January 17, 2019

Seeking to reign in the tediousness of manual software testing, Pfizer HPC Engineer Shahzeb Siddiqui is developing an open source software tool called buildtest, aimed at automating software stack testing by providing the community with a central repository of tests for common HPC apps and the ability to automate execution of testing. Read more…

By Tiffany Trader

Senegal Prepares to Take Delivery of Atos Supercomputer

January 16, 2019

In just a few months time, Senegal will be operating the second largest HPC system in sub-Saharan Africa. The Minister of Higher Education, Research and Innovation Mary Teuw Niane made the announcement... Read more…

By Tiffany Trader

Google Cloud Platform Extends GPU Instance Options

January 16, 2019

If it's Nvidia GPUs you're after to power your AI/HPC/visualization workload, Google Cloud has them, now claiming "broadest GPU availability." Each of the three big public cloud vendors has by turn touted the latest and Read more…

By Tiffany Trader

HPE Extreme Performance Solutions

HPE Systems With Intel Omni-Path: Architected for Value and Accessible High-Performance Computing

Today’s high-performance computing (HPC) and artificial intelligence (AI) users value high performing clusters. And the higher the performance that their system can deliver, the better. Read more…

IBM Accelerated Insights

Resource Management in the Age of Artificial Intelligence

New challenges demand fresh approaches

Fueled by GPUs, big data, and rapid advances in software, the AI revolution is upon us. Read more…

STAC Floats ML Benchmark for Financial Services Workloads

January 16, 2019

STAC (Securities Technology Analysis Center) recently released an ‘exploratory’ benchmark for machine learning which it hopes will evolve into a firm benchmark or suite of benchmarking tools to compare the performanc Read more…

By John Russell

Google Cloud Platform Extends GPU Instance Options

January 16, 2019

If it's Nvidia GPUs you're after to power your AI/HPC/visualization workload, Google Cloud has them, now claiming "broadest GPU availability." Each of the three Read more…

By Tiffany Trader

STAC Floats ML Benchmark for Financial Services Workloads

January 16, 2019

STAC (Securities Technology Analysis Center) recently released an ‘exploratory’ benchmark for machine learning which it hopes will evolve into a firm benchm Read more…

By John Russell

A Big Data Journey While Seeking to Catalog our Universe

January 16, 2019

It turns out, astronomers have lots of photos of the sky but seek knowledge about what the photos mean. Sound familiar? Big data problems are often characterize Read more…

By James Reinders

Intel Bets Big on 2-Track Quantum Strategy

January 15, 2019

Quantum computing has lived so long in the future it’s taken on a futuristic life of its own, with a Gartner-style hype cycle that includes triggers of innovation, inflated expectations and – though a useful quantum system is still years away – anticipatory troughs of disillusionment. Read more…

By Doug Black

IBM Quantum Update: Q System One Launch, New Collaborators, and QC Center Plans

January 10, 2019

IBM made three significant quantum computing announcements at CES this week. One was introduction of IBM Q System One; it’s really the integration of IBM’s Read more…

By John Russell

IBM’s New Global Weather Forecasting System Runs on GPUs

January 9, 2019

Anyone who has checked a forecast to decide whether or not to pack an umbrella knows that weather prediction can be a mercurial endeavor. It is a Herculean task: the constant modeling of incredibly complex systems to a high degree of accuracy at a local level within very short spans of time. Read more…

By Oliver Peckham

The Case Against ‘The Case Against Quantum Computing’

January 9, 2019

It’s not easy to be a physicist. Richard Feynman (basically the Jimi Hendrix of physicists) once said: “The first principle is that you must not fool yourse Read more…

By Ben Criger

The Deep500 – Researchers Tackle an HPC Benchmark for Deep Learning

January 7, 2019

How do you know if an HPC system, particularly a larger-scale system, is well-suited for deep learning workloads? Today, that’s not an easy question to answer Read more…

By John Russell

Quantum Computing Will Never Work

November 27, 2018

Amid the gush of money and enthusiastic predictions being thrown at quantum computing comes a proposed cold shower in the form of an essay by physicist Mikhail Read more…

By John Russell

Cray Unveils Shasta, Lands NERSC-9 Contract

October 30, 2018

Cray revealed today the details of its next-gen supercomputing architecture, Shasta, selected to be the next flagship system at NERSC. We've known of the code-name "Shasta" since the Argonne slice of the CORAL project was announced in 2015 and although the details of that plan have changed considerably, Cray didn't slow down its timeline for Shasta. Read more…

By Tiffany Trader

AMD Sets Up for Epyc Epoch

November 16, 2018

It’s been a good two weeks, AMD’s Gary Silcott and Andy Parma told me on the last day of SC18 in Dallas at the restaurant where we met to discuss their show news and recent successes. Heck, it’s been a good year. Read more…

By Tiffany Trader

The Case Against ‘The Case Against Quantum Computing’

January 9, 2019

It’s not easy to be a physicist. Richard Feynman (basically the Jimi Hendrix of physicists) once said: “The first principle is that you must not fool yourse Read more…

By Ben Criger

US Leads Supercomputing with #1, #2 Systems & Petascale Arm

November 12, 2018

The 31st Supercomputing Conference (SC) - commemorating 30 years since the first Supercomputing in 1988 - kicked off in Dallas yesterday, taking over the Kay Ba Read more…

By Tiffany Trader

Contract Signed for New Finnish Supercomputer

December 13, 2018

After the official contract signing yesterday, configuration details were made public for the new BullSequana system that the Finnish IT Center for Science (CSC Read more…

By Tiffany Trader

Nvidia’s Jensen Huang Delivers Vision for the New HPC

November 14, 2018

For nearly two hours on Monday at SC18, Jensen Huang, CEO of Nvidia, presented his expansive view of the future of HPC (and computing in general) as only he can do. Animated. Backstopped by a stream of data charts, product photos, and even a beautiful image of supernovae... Read more…

By John Russell

HPE No. 1, IBM Surges, in ‘Bucking Bronco’ High Performance Server Market

September 27, 2018

Riding healthy U.S. and global economies, strong demand for AI-capable hardware and other tailwind trends, the high performance computing server market jumped 28 percent in the second quarter 2018 to $3.7 billion, up from $2.9 billion for the same period last year, according to industry analyst firm Hyperion Research. Read more…

By Doug Black

Leading Solution Providers

SC 18 Virtual Booth Video Tour

Advania @ SC18 AMD @ SC18
ASRock Rack @ SC18
DDN Storage @ SC18
HPE @ SC18
IBM @ SC18
Lenovo @ SC18 Mellanox Technologies @ SC18
NVIDIA @ SC18
One Stop Systems @ SC18
Oracle @ SC18 Panasas @ SC18
Supermicro @ SC18 SUSE @ SC18 TYAN @ SC18
Verne Global @ SC18

HPC Reflections and (Mostly Hopeful) Predictions

December 19, 2018

So much ‘spaghetti’ gets tossed on walls by the technology community (vendors and researchers) to see what sticks that it is often difficult to peer through Read more…

By John Russell

Intel Confirms 48-Core Cascade Lake-AP for 2019

November 4, 2018

As part of the run-up to SC18, taking place in Dallas next week (Nov. 11-16), Intel is doling out info on its next-gen Cascade Lake family of Xeon processors, specifically the “Advanced Processor” version (Cascade Lake-AP), architected for high-performance computing, artificial intelligence and infrastructure-as-a-service workloads. Read more…

By Tiffany Trader

Germany Celebrates Launch of Two Fastest Supercomputers

September 26, 2018

The new high-performance computer SuperMUC-NG at the Leibniz Supercomputing Center (LRZ) in Garching is the fastest computer in Germany and one of the fastest i Read more…

By Tiffany Trader

Microsoft to Buy Mellanox?

December 20, 2018

Networking equipment powerhouse Mellanox could be an acquisition target by Microsoft, according to a published report in an Israeli financial publication. Microsoft has reportedly gone so far as to engage Goldman Sachs to handle negotiations with Mellanox. Read more…

By Doug Black

Houston to Field Massive, ‘Geophysically Configured’ Cloud Supercomputer

October 11, 2018

Based on some news stories out today, one might get the impression that the next system to crack number one on the Top500 would be an industrial oil and gas mon Read more…

By Tiffany Trader

The Deep500 – Researchers Tackle an HPC Benchmark for Deep Learning

January 7, 2019

How do you know if an HPC system, particularly a larger-scale system, is well-suited for deep learning workloads? Today, that’s not an easy question to answer Read more…

By John Russell

Summit Supercomputer is Already Making its Mark on Science

September 20, 2018

Summit, now the fastest supercomputer in the world, is quickly making its mark in science – five of the six finalists just announced for the prestigious 2018 Read more…

By John Russell

IBM Quantum Update: Q System One Launch, New Collaborators, and QC Center Plans

January 10, 2019

IBM made three significant quantum computing announcements at CES this week. One was introduction of IBM Q System One; it’s really the integration of IBM’s Read more…

By John Russell

  • arrow
  • Click Here for More Headlines
  • arrow
Do NOT follow this link or you will be banned from the site!
Share This