IBM Debuts Power8 Chip with NVLink and Three New Systems

By John Russell

September 8, 2016

Not long after revealing more details about its next-gen Power9 chip due in 2017, IBM today rolled out three new Power8-based Linux servers and a new version of its Power8 chip featuring Nvidia’s NVLink interconnect. One of the servers – Power S822LC for High Performance Computing (codenamed “Minsky”) – uses the new chip (Power8 with NVLink) to communicate with P100 Pascal GPUs, NVIDIA’s most recent and highest performing GPU.

The other servers – the Power S821LC and the Power S822LC for Big Data – also leverage GPU acceleration technology (K80 or P100) via PCIe interface and have IBM’s Coherent Accelerator Processor Interface (CAPI) for use with Flash storage and FPGAs. All three servers are standard two-socket additions to IBM’s Linux line.

These introductions, said Sumit Gupta, vice president, High Performance Computing and Analytics, IBM, should be seen as proof of IBM’s ongoing commitment to its vision of accelerated computing as the new paradigm, and of cognitive computing (writ large) and big data analytics as the major drivers (See HPCwire article, Think Fast: IBM Talks Acceleration in HPC and the Enterprise).

Also noteworthy is that the new systems are manufactured by partners. “All three of these OpenPOWER systems leverage the strengths and expertise of OpenPOWER partners, from acceleration capabilities to strengths in design and manufacturing. In the spirit of open we hope that our Industry partners who are manufacturing these systems, Wistron, an OpenPOWER partner, and Supermicro, will deliver Power-based servers to their clients through their routes to market in order to proliferate the OpenPOWER ecosystem,” said Gupta.

IBM.LC Server

IBM says the S822LC server with NVLink embedded at the silicon level “enables data to flow 5x faster” than on a comparable x86-based system. It also substantially reduces the programming barrier to aggressive use of GPUs according to Gupta who has written a more detailed blog on the introductions. This is the first Power8-based system delivered with NVLink according to IBM.

“Moving data from the CPU to the GPU [has been the] bottleneck because with most systems most of it is going through this thin pipe, PCIe. With NVLink the GPU has access to up to half a terabyte of memory that sits on the CPU side of that interconnect,” Gupta told HPCWire. NVLink allows improved transfer of data between both processors which fundamentally makes it easier to program.

“When an application starts, all the data is sitting in the system memory, and you’ve got to move chunks of it over to the GPU,” Gupta continued. “NVLink does three things. It improves the performance because we’ve enabled a fatter pipe between the processors. It enables you to move smaller functions. And it makes programming accelerators easier because you have to do less data management.”

IBM.POWER8.NVLINK

The new Power8 with NVLink processor features 10 cores running up to 3.26 GHz. POWER8 processors in this server have higher memory bandwidth than x86 CPUs, at 115 GB/s and can have as much as ½ a terabyte of system memory per socket.   There are larger caches per core inside the POWER8 processor, and this coupled with the faster cores and memory bandwidth leads to higher application performance and throughput.

The new NVIDIA Tesla P100 GPU accelerator increases floating point performance, delivering 21 teraflops of half-precision, 10.6 teraflops of single-precision, and 5.3 teraflops of double-precision performance. The accelerator includes 16 gigabytes of the HBM2 stacked memory with an on-GPU memory bandwidth of 720 gigabytes per sec (GB/s). The NVIDIA Tesla P100 with NVLink GPU in the SXM2 form factor delivers 14 percent more raw compute performance than the PCIe variant.

Using NVLink, writes Gupta in his blog, provides three major advantages to application acceleration:

  1. Performance: The new Power8 with NVLink processor and the new Tesla P100 GPU have four NVLink interfaces that enable “5x faster communication than a PCIe x16 Gen3 connection used in other systems.” This enables faster data exchange and application performance, by overcoming the limitation of narrow PCIe data pipe into the GPU.
  2. Programmability: The CUDA 8 software and the Page Migration Engine in Tesla P100 enable a unified memory space with automated data management between the system memory connected to the CPU and the GPU memory. Coupled with NVLink, unified memory makes programming GPU accelerators much easier for developers. Applications can be easily accelerated with GPUs by incrementally moving functions from the CPU to the GPU, without having to deal with data management.
  3. More application acceleration: Since NVLink reduces the communication time between the CPU and GPU, it enables smaller pieces of work to be moved to the GPU for acceleration. This means that more parts of an application can be GPU accelerated.

In making the announcements, IBM continued ratcheting up its ‘we’re-better-than-Intel’ rhetoric. Its broad application targets encompass all things big data and analytics, as well as deep learning and cognitive computing.

“The big advantage we are seeing for Power8 in the market has been around data analytics, databases, and high performance computing for machine learning and deep learning, and artificial intelligence,” said Gupta. “Because we have faster cores, we see much better performance, [for example], on databases compared to Intel-based systems. Applications like kinetica, which is an accelerated (GPU optimized) database for deep learning and machine learning, gets the value of NVLink high speed data connection between CPU and GPU.”

Recognizing the uphill battle in winning x86 market share, IBM in the past has emphasized efforts to penetrate hyperscalers as pivotal to its success (see HPCwire article, Handicapping IBM/OpenPOWER’s Odds for Success).

According to the IBM release: “Early testing with one of the world’s largest Internet service providers (Tencent) based in China has shown that a large cluster of the new Power S822LC for Big Data servers was able to run a data-intensive workload three times faster than its former x86-based infrastructure.  Moreover, this result was achieved while reducing the total number of servers used by two-thirds. Given the significant cost benefits of using fewer servers to deliver faster performance, the company is now integrating the new LC servers into its hyperscale data center for big data workloads.”

Sumit Gupta, IBM
Sumit Gupta, IBM

Gupta maintains there’s a big appetite for new Power8-based servers despite the advancing Power9. “Several businesses, research organizations and government bodies have pre-tested early systems and placed their orders. Among those first in line to receive shipments are a large multinational retail corporation and the U.S. Department of Energy’s Oak Ridge National Laboratory (ORNL),” according to the IBM release.

ORNL will use the new systems as a development platform for optimizing applications to take advantage of the built-in NVLink interface technology. The systems will serve as an early-generation test bed for developing demanding applications for Summit, ORNL’s next generation supercomputer that IBM will deliver in 2017 and which will use the Power9 chip. Arthur S. (Buddy) Bland, OLCF project director, is quoted in the press release saying, “As a long-time user of GPUs, we believe that this will improve the performance of our applications and make it easier for the users to deliver great science.”

Building an ecosystem is hard. For IBM and OpenPOWER, many of the diverse pieces needed are seemingly falling into place. Time will tell.

Link to Gupta’s blog: www.ibm.com/blogs/systems/ibm-nvidia-present-nvlink-server-youve-waiting

Image source: IBM

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

Supercomputer Simulations Validate NASA Crash Testing

February 17, 2020

Car crash simulation is already a challenging supercomputing task, requiring pinpoint estimation of how hundreds of components interact with turbulent forces and human bodies. Spacecraft crash simulation is far more diff Read more…

By Oliver Peckham

What’s New in HPC Research: Quantum Clouds, Interatomic Models, Genetic Algorithms & More

February 14, 2020

In this bimonthly feature, HPCwire highlights newly published research in the high-performance computing community and related domains. From parallel programming to exascale to quantum computing, the details are here. Read more…

By Oliver Peckham

The Massive GPU Cloudburst Experiment Plays a Smaller, More Productive Encore

February 13, 2020

In November, researchers at the San Diego Supercomputer Center (SDSC) and the IceCube Particle Astrophysics Center (WIPAC) set out to break the internet – or at least, pull off the cloud HPC equivalent. As part of thei Read more…

By Oliver Peckham

ORNL Team Develops AI-based Cancer Text Mining Tool on Summit

February 13, 2020

A group of Oak Ridge National Laboratory researchers working on the Summit supercomputer has developed a new neural network tool for fast extraction of information from cancer pathology reports to speed research and clin Read more…

By John Russell

Nature Serves up Another Challenge to Quantum Computing?

February 13, 2020

Just when you thought it was safe to assume quantum computing – though distant – would eventually succumb to clever technology, another potentially confounding factor pops up. It’s the Heisenberg Limit (HL), close Read more…

By John Russell

AWS Solution Channel

Challenging the barriers to High Performance Computing in the Cloud

Cloud computing helps democratize High Performance Computing by placing powerful computational capabilities in the hands of more researchers, engineers, and organizations who may lack access to sufficient on-premises infrastructure. Read more…

IBM Accelerated Insights

Intelligent HPC – Keeping Hard Work at Bay(es)

Since the dawn of time, humans have looked for ways to make their lives easier. Over the centuries human ingenuity has given us inventions such as the wheel and simple machines – which help greatly with tasks that would otherwise be extremely laborious. Read more…

Researchers Enlist Three Supercomputers to Apply Deep Learning to Extreme Weather

February 12, 2020

When it comes to extreme weather, an errant forecast can have serious effects. While advance warning can give people time to prepare for the weather as it did with the polar vortex last year, the absence of accurate adva Read more…

By Oliver Peckham

The Massive GPU Cloudburst Experiment Plays a Smaller, More Productive Encore

February 13, 2020

In November, researchers at the San Diego Supercomputer Center (SDSC) and the IceCube Particle Astrophysics Center (WIPAC) set out to break the internet – or Read more…

By Oliver Peckham

Eni to Retake Industry HPC Crown with Launch of HPC5

February 12, 2020

With the launch of its Dell-built HPC5 system, Italian energy company Eni regains its position atop the industrial supercomputing leaderboard. At 52-petaflops p Read more…

By Tiffany Trader

Trump Budget Proposal Again Slashes Science Spending

February 11, 2020

President Donald Trump’s FY2021 U.S. Budget, submitted to Congress this week, again slashes science spending. It’s a $4.8 trillion statement of priorities, Read more…

By John Russell

Policy: Republicans Eye Bigger Science Budgets; NSF Celebrates 70th, Names Idea Machine Winners

February 5, 2020

It’s a busy week for science policy. Yesterday, the National Science Foundation announced winners of its 2026 Idea Machine contest seeking directions for futu Read more…

By John Russell

Fujitsu A64FX Supercomputer to Be Deployed at Nagoya University This Summer

February 3, 2020

Japanese tech giant Fujitsu announced today that it will supply Nagoya University Information Technology Center with the first commercial supercomputer powered Read more…

By Tiffany Trader

Intel Stopping Nervana Development to Focus on Habana AI Chips

February 3, 2020

Just two months after acquiring Israeli AI chip start-up Habana Labs for $2 billion, Intel is stopping development of its existing Nervana neural network proces Read more…

By John Russell

Lise Supercomputer, Part of HLRN-IV, Begins Operations

January 29, 2020

The second phase of the build-out of HLRN-IV – the planned 16 peak-petaflops supercomputer serving the North-German Supercomputing Alliance (HLRN) – is unde Read more…

By Staff report

IBM Debuts IC922 Power Server for AI Inferencing and Data Management

January 28, 2020

IBM today launched a Power9-based inference server – the IC922 – that features up to six Nvidia T4 GPUs, PCIe Gen 4 and OpenCAPI connectivity, and can accom Read more…

By John Russell

Julia Programming’s Dramatic Rise in HPC and Elsewhere

January 14, 2020

Back in 2012 a paper by four computer scientists including Alan Edelman of MIT introduced Julia, A Fast Dynamic Language for Technical Computing. At the time, t Read more…

By John Russell

Cray, Fujitsu Both Bringing Fujitsu A64FX-based Supercomputers to Market in 2020

November 12, 2019

The number of top-tier HPC systems makers has shrunk due to a steady march of M&A activity, but there is increased diversity and choice of processing compon Read more…

By Tiffany Trader

SC19: IBM Changes Its HPC-AI Game Plan

November 25, 2019

It’s probably fair to say IBM is known for big bets. Summit supercomputer – a big win. Red Hat acquisition – looking like a big win. OpenPOWER and Power processors – jury’s out? At SC19, long-time IBMer Dave Turek sketched out a different kind of bet for Big Blue – a small ball strategy, if you’ll forgive the baseball analogy... Read more…

By John Russell

Intel Debuts New GPU – Ponte Vecchio – and Outlines Aspirations for oneAPI

November 17, 2019

Intel today revealed a few more details about its forthcoming Xe line of GPUs – the top SKU is named Ponte Vecchio and will be used in Aurora, the first plann Read more…

By John Russell

Dell Ramps Up HPC Testing of AMD Rome Processors

October 21, 2019

Dell Technologies is wading deeper into the AMD-based systems market with a growing evaluation program for the latest Epyc (Rome) microprocessors from AMD. In a Read more…

By John Russell

IBM Unveils Latest Achievements in AI Hardware

December 13, 2019

“The increased capabilities of contemporary AI models provide unprecedented recognition accuracy, but often at the expense of larger computational and energet Read more…

By Oliver Peckham

SC19: Welcome to Denver

November 17, 2019

A significant swath of the HPC community has come to Denver for SC19, which began today (Sunday) with a rich technical program. As is customary, the ribbon cutt Read more…

By Tiffany Trader

D-Wave’s Path to 5000 Qubits; Google’s Quantum Supremacy Claim

September 24, 2019

On the heels of IBM’s quantum news last week come two more quantum items. D-Wave Systems today announced the name of its forthcoming 5000-qubit system, Advantage (yes the name choice isn’t serendipity), at its user conference being held this week in Newport, RI. Read more…

By John Russell

Leading Solution Providers

SC 2019 Virtual Booth Video Tour

AMD
AMD
ASROCK RACK
ASROCK RACK
AWS
AWS
CEJN
CJEN
CRAY
CRAY
DDN
DDN
DELL EMC
DELL EMC
IBM
IBM
MELLANOX
MELLANOX
ONE STOP SYSTEMS
ONE STOP SYSTEMS
PANASAS
PANASAS
SIX NINES IT
SIX NINES IT
VERNE GLOBAL
VERNE GLOBAL
WEKAIO
WEKAIO

Jensen Huang’s SC19 – Fast Cars, a Strong Arm, and Aiming for the Cloud(s)

November 20, 2019

We’ve come to expect Nvidia CEO Jensen Huang’s annual SC keynote to contain stunning graphics and lively bravado (with plenty of examples) in support of GPU Read more…

By John Russell

51,000 Cloud GPUs Converge to Power Neutrino Discovery at the South Pole

November 22, 2019

At the dead center of the South Pole, thousands of sensors spanning a cubic kilometer are buried thousands of meters beneath the ice. The sensors are part of Ic Read more…

By Oliver Peckham

Fujitsu A64FX Supercomputer to Be Deployed at Nagoya University This Summer

February 3, 2020

Japanese tech giant Fujitsu announced today that it will supply Nagoya University Information Technology Center with the first commercial supercomputer powered Read more…

By Tiffany Trader

Top500: US Maintains Performance Lead; Arm Tops Green500

November 18, 2019

The 54th Top500, revealed today at SC19, is a familiar list: the U.S. Summit (ORNL) and Sierra (LLNL) machines, offering 148.6 and 94.6 petaflops respectively, Read more…

By Tiffany Trader

Azure Cloud First with AMD Epyc Rome Processors

November 6, 2019

At Ignite 2019 this week, Microsoft's Azure cloud team and AMD announced an expansion of their partnership that began in 2017 when Azure debuted Epyc-backed instances for storage workloads. The fourth-generation Azure D-series and E-series virtual machines previewed at the Rome launch in August are now generally available. Read more…

By Tiffany Trader

Intel’s New Hyderabad Design Center Targets Exascale Era Technologies

December 3, 2019

Intel's Raja Koduri was in India this week to help launch a new 300,000 square foot design and engineering center in Hyderabad, which will focus on advanced com Read more…

By Tiffany Trader

In Memoriam: Steve Tuecke, Globus Co-founder

November 4, 2019

HPCwire is deeply saddened to report that Steve Tuecke, longtime scientist at Argonne National Lab and University of Chicago, has passed away at age 52. Tuecke Read more…

By Tiffany Trader

Cray Debuts ClusterStor E1000 Finishing Remake of Portfolio for ‘Exascale Era’

October 30, 2019

Cray, now owned by HPE, today introduced the ClusterStor E1000 storage platform, which leverages Cray software and mixes hard disk drives (HDD) and flash memory Read more…

By John Russell

  • arrow
  • Click Here for More Headlines
  • arrow
Do NOT follow this link or you will be banned from the site!
Share This