AMD Sets Up for Epyc Epoch

By Tiffany Trader

November 16, 2018

It’s been a good two weeks, AMD’s Gary Silcott and Andy Parma told me on the last day of SC18 in Dallas at the restaurant where we met to discuss their show news and recent successes.

Heck, it’s been a good year. AMD Epyc, delivered to the market earlier this year, has been winning – convincingly winning – the price/performance bake-off against Xeon. Wall Street has fallen in love with AMD stock (though with some slackening of late). And the company moved off the market share dime for high end server chips; in fact, AMD stands by its expectation of hitting the “mid-single digits” for market share by the end of the year. And we’re seeing Epyc pop up in some major systems wins, such as HPE’s announcement at SC18 that it will deliver the fastest supercomputer in the world for industrial production, an Epyc-powered system (more on that below).

There’s no arguing: AMD has shined in 2018. But it still has a hill to climb after backing out of the datacenter server processor and HPC markets over the last half-decade or so.

AMD will tell you the company dropped out of those markets for a server processor re-set, the result of which is Epyc. All well and good, but there’s a price to pay for a market drop out: a credibility hit. When you ask managers of IT organizations and HPC centers to bet against Intel, you’re asking them to take a risk. Now, with a processor technology buyers can love, trust remains the missing piece.

This is why everyone at AMD, starting with Lisa Su, is laser focused on executing the AMD roadmap and touting that execution to anyone who’ll listen. AMD believes hitting milestones not only reestablishes credibility, it stands in contrast to the market leader, which has had significant roadmap making challenges over the past 12 months.

“We set out to hit our milestones and deliver to our customers and we did it,” the CEO stated in her keynote at the AMD Next Horizon event in San Francisco last week (Nov. 6). Yes, that was the week leading into SC18 – it’s been a busy time for AMD and the reporters who cover them.

Putting the next-gen Eypc and Radeon GPU launch the week before the biggest HPC conference of the year gave AMD a strong hand coming into SC18. With up to 64-Zen2 cores, the 7nm Rome chip provides at least two times higher performance per core than first-generation Epyc chips. The revamped Radeon Instinct line, the MI60 and MI50, are the world’s first 7nm datacenter GPUs, and feature flexible mixed mixed-precision capabilities. Both families support PCIe gen4.

Traction for all three generations of Epyc (Naples, Rome and future-gen Milan) got a shot in the arm in the last few weeks, with four five major system wins showcasing AMD’s growing traction as well as the diversity of its partners and end users. [Editor’s note: article has been updated to include Finland’s next-gen system.]

In late October: the DOE in tandem with Berkeley Lab revealed Perlmutter (code-name: NERSC-9) would be built by Cray on its next-gen Shasta architecture with future generation Epycs (Milan) plus future-generation Nvidia GPUs. Perlmutter, due out in 2020 with a planned 100 petaflops (peak) performance, will put AMD solidly back on the leadership supercomputing map, a position it hasn’t enjoyed since the circa-2012 NCSA Blue Waters supercomputer, which is set to retire next year after getting a one-year reprieve (sys admins are currently ferrying data from Blue Waters to Frontera, the next NSF tier 1 system, deployed at TACC).

The High-Performance Computing Center of the University of Stuttgart (HLRS) revealed at SC that it is using the next generation AMD Eypc Rome processor to power what is anticipated to be the largest supercomputer in Europe and the world’s fastest supercomputer for industrial production. That system, named Hawk, will be built by HPE and will have a theoretical peak performance of 24 petaflops and consist of a 5,000-node cluster. It is set to debut in 2019.

Lawrence Livermore has signed on with system-builder Penguin Computing for an all AMD heterogeneous machine, what AMD calls A+A. That system, named Corona, will have 170 nodes incorporating more than 300 AMD Eypc 7401 processors and 300 AMD Radeon Instinct MI25 GPUs. With these current gen technologies, Corona will be delivered soon and is expected to come online for limited use by the end of the year. Lawrence Livermore reports that the unclassified computing cluster will provide unique capabilities for lab researchers and industry partners to explore data science, machine learning and big data analytics.

On Tuesday (Nov. 13) the Finnish IT Center for Science Ltd (CSC) announced that it had selected AMD future-gen Rome Epycs to power the second phase of its next-generation supercomputer and data management systems, to be supplied by Atos. The planned system, due to come online in spring 2020, will be comprised of BullSequana XH2000 servers with Rome processors and Mellanox HDR InfiniBand, plus tightly integrated storage. The phase 2 partition will extend the compute capacity of the phase one Intel Xeon Cascade Lake-equipped system, expected to be available to CSC researchers in summer 2019. CSC is the first customer of Atos’ BullSequana XH2000, which was launched on Monday at SC18.

Moving back over to the commercial sector, the F1 HAAS racing team has deployed current generation 7000-series Epyc processors as part of a Cray CS500 cluster that it will use for CFD simulation. “The combination of the system’s design and ability to handle the most demanding simulation and analytics workloads and the high-performance computing expertise of Cray helps the Haas F1 team optimize aerodynamics and more accurately predict and reduce drag, downforce and flow patterns around its race cars,” said Cray.

In addition, the University of Notre Dame Center for Research ComputingOregon State University and the National Institute for Nuclear Physics in Italy have all also deployed AMD Epyc-based systems.

“One of the questions we got asked this week,” said Parma, Epyc HPC product/segment marketing leader at AMD, “was ‘do you have a favorite customer, favorite preferred customer?’ The answer is no, right now we are having a lot of success with a lot of different customers which is a great place to be right now.”

Other highlights from AMD announced this week include:

  • A preview of a new HB instance debuted on Microsoft Azure this week. HB VMs feature 60 AMD Epyc 7551 processor cores, 4 GB of RAM per CPU core, and no hyperthreading, according to Microsoft.
  • High-Frequency Epyc processor — The new AMD Epyc 7371 processor, introduced this week, was designed for workloads such as EDA and high-frequency trading. The new SKU is on track for production availability in the first quarter of 2019
  • ROCm 2.0 Open Software Platform— AMD also showcased the newest version of AMD ROCm, designed for scale and HPC, energy-efficient heterogeneous computing systems in an open environment.

Reached for comment, HPC market watcher Addison Snell, CEO of Intersect360 Research said, “We see a lot of potential interest in AMD Epyc in the HPC arena, at a time when many organizations are ready to reevaluate what architectures they build on moving forward. AMD offers a compelling x86 option to Intel. With little to lose, it’s not really a question of whether AMD will gain share, but rather how much share they’ll gain.”

Company leaders, though, are not taking any chances and continue to emphasize AMD’s commitment, dedication and follow-through.

“Focus, execution and delivery — that’s what makes AMD a bankable supplier,” said Mark Papermaster, chief technology officer and senior vice president of technology and engineering, in his keynote address at AMD’s media and analyst event on Nov. 6. ”When we make the effort to go back into high performance, equally important with coming out with that first set of products in 2017 was the roadmap behind it because that’s what enterprise customers need. They want to know that you are delivering value today and they need full confidence in the roadmap going forward. That’s what we are doing that’s a game changer for AMD and for riding the acceptance of these new products in the market.”

Doug Black contributed to this report.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

AWS Expands Worldwide Availability to AMD-based Instances

July 22, 2019

Setting aside potential setbacks caused by U.S. trade policies, the steady cadence of AMD’s revival in HPC and the datacenter continued last week with AWS expanding availability of its AMD Epyc-based instances. Recall Read more…

By Staff

Microsoft Investing $1B in OpenAI Artificial General Intelligence R&D

July 22, 2019

Artificial general intelligence (AGI) is AI’s moonshot, the next giant leap for the AI field. Microsoft regards it to be feasible enough to warrant a $1 billion investment in OpenAI, the not-for-profit research organi Read more…

By Doug Black

Researchers Use Supercomputing to Study Links Between Hurricanes and Climate Change

July 19, 2019

As climate change looms, researchers are scrambling to answer the question of how a warming planet will affect the frequency and severity of already-deadly hurricanes. Now, a team of researchers from the University of Il Read more…

By Oliver Peckham

HPE Extreme Performance Solutions

Bring the Combined Power of HPC and AI to Your Business Transformation

A growing number of commercial businesses are implementing HPC solutions to derive actionable business insights, to run higher performance applications and to gain a competitive advantage. Read more…

IBM Accelerated Insights

With HPC the Future is Looking Grid

Gone are the days when problems such as unraveling genetic sequences or searching for extra-terrestrial life were solved using only a single high-performance computing (HPC) resource located at one facility. Read more…

San Diego Supercomputer Center to Welcome ‘Expanse’ Supercomputer in 2020

July 18, 2019

With a $10 million dollar award from the National Science Foundation, San Diego Supercomputer Center (SDSC) at the University of California San Diego is procuring a new supercomputer, called Expanse, to be deployed next Read more…

By Staff report

Microsoft Investing $1B in OpenAI Artificial General Intelligence R&D

July 22, 2019

Artificial general intelligence (AGI) is AI’s moonshot, the next giant leap for the AI field. Microsoft regards it to be feasible enough to warrant a $1 billi Read more…

By Doug Black

Informing Designs of Safer, More Efficient Aircraft with Exascale Computing

July 18, 2019

During the process of designing an aircraft, aeronautical engineers must perform predictive simulations to understand how airflow around the plane impacts fligh Read more…

By Rob Johnson

Intel Debuts Pohoiki Beach, Its 8M Neuron Neuromorphic Development System

July 17, 2019

Neuromorphic computing has received less fanfare of late than quantum computing whose mystery has captured public attention and which seems to have generated mo Read more…

By John Russell

Goonhilly Unveils New Immersion-Cooled Platform, Doubles Down on Sustainability Mission

July 16, 2019

Goonhilly Earth Station has opened its new datacenter – an enhancement to its existing tier 3 facility – in Cornwall, England, touting an ambitious commitme Read more…

By Oliver Peckham

ISC19 Cluster Competition: Application Results, Finally!

July 15, 2019

Our exhaustive coverage of the ISC19 Student Cluster Competition continues as we discuss the application scores below. While the scores were typically high, som Read more…

By Dan Olds

Nvidia Expands DGX-Ready AI Program to 19 Countries

July 11, 2019

Nvidia’s DGX-Ready Data Center Program, announced in January and designed to provide colo and public cloud-like options to access the company’s GPU-powered Read more…

By Doug Black

Argonne Team Makes Record Globus File Transfer

July 10, 2019

A team of scientists at Argonne National Laboratory has broken a data transfer record by moving a staggering 2.9 petabytes of data for a research project.  The data – from three large cosmological simulations – was generated and stored on the Summit supercomputer at the Oak Ridge Leadership Computing Facility (OLCF)... Read more…

By Oliver Peckham

Nvidia, Google Tie in Second MLPerf Training ‘At-Scale’ Round

July 10, 2019

Results for the second round of the AI benchmarking suite known as MLPerf were published today with Google Cloud and Nvidia each picking up three wins in the at Read more…

By Tiffany Trader

High Performance (Potato) Chips

May 5, 2006

In this article, we focus on how Procter & Gamble is using high performance computing to create some common, everyday supermarket products. Tom Lange, a 27-year veteran of the company, tells us how P&G models products, processes and production systems for the betterment of consumer package goods. Read more…

By Michael Feldman

Cray, AMD to Extend DOE’s Exascale Frontier

May 7, 2019

Cray and AMD are coming back to Oak Ridge National Laboratory to partner on the world’s largest and most expensive supercomputer. The Department of Energy’s Read more…

By Tiffany Trader

Graphene Surprises Again, This Time for Quantum Computing

May 8, 2019

Graphene is fascinating stuff with promise for use in a seeming endless number of applications. This month researchers from the University of Vienna and Institu Read more…

By John Russell

AMD Verifies Its Largest 7nm Chip Design in Ten Hours

June 5, 2019

AMD announced last week that its engineers had successfully executed the first physical verification of its largest 7nm chip design – in just ten hours. The AMD Radeon Instinct Vega20 – which boasts 13.2 billion transistors – was tested using a TSMC-certified Calibre nmDRC software platform from Mentor. Read more…

By Oliver Peckham

TSMC and Samsung Moving to 5nm; Whither Moore’s Law?

June 12, 2019

With reports that Taiwan Semiconductor Manufacturing Co. (TMSC) and Samsung are moving quickly to 5nm manufacturing, it’s a good time to again ponder whither goes the venerable Moore’s law. Shrinking feature size has of course been the primary hallmark of achieving Moore’s law... Read more…

By John Russell

Deep Learning Competitors Stalk Nvidia

May 14, 2019

There is no shortage of processing architectures emerging to accelerate deep learning workloads, with two more options emerging this week to challenge GPU leader Nvidia. First, Intel researchers claimed a new deep learning record for image classification on the ResNet-50 convolutional neural network. Separately, Israeli AI chip startup Read more…

By George Leopold

Nvidia Embraces Arm, Declares Intent to Accelerate All CPU Architectures

June 17, 2019

As the Top500 list was being announced at ISC in Frankfurt today with an upgraded petascale Arm supercomputer in the top third of the list, Nvidia announced its Read more…

By Tiffany Trader

Top500 Purely Petaflops; US Maintains Performance Lead

June 17, 2019

With the kick-off of the International Supercomputing Conference (ISC) in Frankfurt this morning, the 53rd Top500 list made its debut, and this one's for petafl Read more…

By Tiffany Trader

Leading Solution Providers

ISC 2019 Virtual Booth Video Tour


Intel Launches Cascade Lake Xeons with Up to 56 Cores

April 2, 2019

At Intel's Data-Centric Innovation Day in San Francisco (April 2), the company unveiled its second-generation Xeon Scalable (Cascade Lake) family and debuted it Read more…

By Tiffany Trader

Cray – and the Cray Brand – to Be Positioned at Tip of HPE’s HPC Spear

May 22, 2019

More so than with most acquisitions of this kind, HPE’s purchase of Cray for $1.3 billion, announced last week, seems to have elements of that overused, often Read more…

By Doug Black and Tiffany Trader

A Behind-the-Scenes Look at the Hardware That Powered the Black Hole Image

June 24, 2019

Two months ago, the first-ever image of a black hole took the internet by storm. A team of scientists took years to produce and verify the striking image – an Read more…

By Oliver Peckham

Announcing four new HPC capabilities in Google Cloud Platform

April 15, 2019

When you’re running compute-bound or memory-bound applications for high performance computing or large, data-dependent machine learning training workloads on Read more…

By Wyatt Gorman, HPC Specialist, Google Cloud; Brad Calder, VP of Engineering, Google Cloud; Bart Sano, VP of Platforms, Google Cloud

Chinese Company Sugon Placed on US ‘Entity List’ After Strong Showing at International Supercomputing Conference

June 26, 2019

After more than a decade of advancing its supercomputing prowess, operating the world’s most powerful supercomputer from June 2013 to June 2018, China is keep Read more…

By Tiffany Trader

In Wake of Nvidia-Mellanox: Xilinx to Acquire Solarflare

April 25, 2019

With echoes of Nvidia’s recent acquisition of Mellanox, FPGA maker Xilinx has announced a definitive agreement to acquire Solarflare Communications, provider Read more…

By Doug Black

Qualcomm Invests in RISC-V Startup SiFive

June 7, 2019

Investors are zeroing in on the open standard RISC-V instruction set architecture and the processor intellectual property being developed by a batch of high-flying chip startups. Last fall, Esperanto Technologies announced a $58 million funding round. Read more…

By George Leopold

Nvidia Claims 6000x Speed-Up for Stock Trading Backtest Benchmark

May 13, 2019

A stock trading backtesting algorithm used by hedge funds to simulate trading variants has received a massive, GPU-based performance boost, according to Nvidia, Read more…

By Doug Black

  • arrow
  • Click Here for More Headlines
  • arrow
Do NOT follow this link or you will be banned from the site!
Share This