Top500 Purely Petaflops; US Maintains Performance Lead

By Tiffany Trader

June 17, 2019

Editor’s note: In preparing this article, I noticed an irregularity with the Top500 reporting in that Sierra and two other NNSA/LLNL systems (Lassen and Ansel) were omitted from IBM’s totals (by system share and by performance share), which incorrectly put IBM in second place by aggregate list performance, instead of first place. An earlier version of this article pointed out that irregularity. On Monday (June 17) Top500 coauthor Erich Strohmaier confirmed to HPCwire that this was an aggregation error, triggered by the manufacturer field being listed as “IBM/Nvidia/Mellanox” for these three machines. The article has been amended to reflect the correct information.

With the kick-off of the International Supercomputing Conference (ISC) in Frankfurt this morning, the 53rd Top500 list made its debut, and this one’s for petafloppers only. The entry point for the new list is 1.022 petaflops, and the total aggregate performance of all 500 systems for the first time exceeds 1.5 exaflops. The top new entry to the list is Frontera, the NSF-funded Dell EMC system installed at the Texas Advanced Computing Center (TACC), at number five with 23.5 Linpack petaflops.

China claims the most systems (219) with 208 of these pretty evenly split among Lenovo (74), Inspur (71) and Sugon (63). Lenovo, which acquired IBM’s x86 business in 2014, maintains its lead as the number one Top500 vendor by number of systems (173). As we continue to note, the majority of these are not HPC systems or supercomputers; in fact, over half the full list (about 55 percent) is now comprised of web-scale/cloud systems.

The US (with 116 systems) trails China by nearly half in terms of system share, but when it comes to aggregate performance share, the US leads with 38.4 percent of list flops compared with China’s 29.9 percent. Summit and Sierra–still at number one and two respectively–enjoy 243.2 Linpack petaflops between them, accounting for nearly a sixth of the list’s aggregate capacity. Oak Ridge National Laboratory has optimized Summit’s Linpack performance, such that it now delivers 148.6 petaflops on the HPL benchmark, versus 143.5 petaflops six months ago.

The prime manufacturer behind Summit and Sierra, IBM, is in first place by aggregate list performance with 321 petaflops, ahead of Lenovo, which touts 302 total petaflops. Cray has the next highest aggregate list performance share (183.4 petaflops), followed by HPE (120.1 petaflops) and Sugon (96 petaflops). In anticipation of HPE purchasing Cray, we note that a combined union on this list would deliver a total of 303.5 aggregate petaflops–still shy of IBM’s total, and a smidgen ahead of Lenovo’s.

Restricting our analysis to the top 100 systems to better gauge the HPC/supercomputing picture (an imperfect but quick reckoning), the top five countries by system share are US with 37, Japan with 15, China and Germany tied with 9, followed by France with six systems (including two Jean Zay systems, an accelerated machine in 42nd position and a CPU-only one in 72nd). By aggregate performance, US leads the top 100 pack with 49 percent of the flops, China with 17 percent, Japan with 10 percent, and Germany and France both with 5 percent. The top vendors by system share in the top 100 grouping are Cray with 25, HPE with 18, IBM with eight [Ed. this is the corrected number], Fujitsu with seven, and then a three-way tie between Dell EMC, Nvidia and Sugon (all with three systems). By aggregate performance share, the order changes to IBM, Cray, HPE, Fujitsu and Dell EMC.

June 2019 Top 12 – Click to Expand (Source: Top500)

View from the Top: Bienvenida Frontera

Frontera, the Dell C6420 system installed at the TACC, makes its list debut at number five, achieving 23.5 Linpack petaflops (38.7 peak petaflops) using 448,448 Intel Xeon cores. Frontera’s 8,000 nodes are hooked together using Mellanox HDR InfiniBand. This is the debut list for HDR, which connects three other systems (#68, #166, and #264) in addition to Frontera.

Lassen–Sierra’s unclassified “companion” system at LLNL that was built by IBM, Nvidia and Mellanox–moves up into the number 10 spot with an upgraded capacity of 18.2 Linpack petaflops.

Titan, ORNL’s Cray XK7 system, which had a long reign as the United States’ fastest supercomputer since its debut as an upgrade to Jaguar in November 2012 until Summit was deployed in June 2018, has shifted out of the top 10, as has Sequoia, the last BlueGene/Q system at LLNL.

Taking Lassen’s previous spot in 11th place is another IBM system, a new entrant to the list and the fastest commercial supercomputer: Pangea III. Measured at 17.9 Linpack petaflops (rPeak 25 petaflops), the system was built by IBM for Total to help the energy company more accurately locate new oil reservoirs and better assess the potential associated revenue opportunities.

One vendor missing from the top 10 pack is HPE, an observation made by market analyst Addison Snell, president of Intersect360 Research. “IBM, Cray, Fujitsu, Lenovo, and Dell EMC all have top ten systems. The biggest HPE system is no slouch, at number 17, the HPC4 system at Italian energy company Eni. And of course, HPE will soon acquire Cray, inheriting the plans for the first two exascale systems in the US.”

How the Chips Fall

While there are more CPU options appearing in the general server market, the trend has not had much of an impact on the Top500 yet. Intel chips still dominate in so far as they appear in 95.6 percent of all systems, although in some cases, much or most of the heavy lifting is owed to accelerators or coprocessors. IBM Power processors are in seven systems, followed by AMD CPUs, which are used in three systems. A single supercomputer on the list, Astra at Sandia National Labs, is powered by (Marvell/Cavium) Arm processors. Built by HPE, Astra debuted on the November 2018 list at 204 with 1.5 petaflops. An upgrade goosed Linpack to 1.76 petaflops, elevating the system into 156th position.

Accelerated Computing

There are 133 systems that use accelerator or co-processor technology, dipping slightly from 138 six months ago. Overall the trend toward acceleration remains strong with Nvidia seeing 25 percent growth in the number of Top500 systems that employ its GPUs in the last year. The total number of Nvidia GPU-accelerated systems has gone from 98 last June 2018 to 125 total systems on the current list.

Nvidia’s DGX SuperPOD

With AMD and Intel server GPUs coming into the market, or preparing to in Intel’s case–including already announced big wins (Aurora, Frontier)–this will be an interesting trend to follow.

Nvidia has two DGX-2 systems on the list, Circe and DGX SuperPod, which integrate Xeon Skylake Platinum 8174 processors, Nvidia V100 GPUs, NVLink technology and Mellanox EDR InfiniBand. Both are used internally for research and company workloads.

Circe, a cluster of 36 DGX-2H boxes that ranks 71 with 3.1 Linpack petaflops, entered the list last November. DGX SuperPod, comprising 96 DGX-2H nodes, joins the list today at number 22 with a Linpack score of 9.4 petaflops. Nvidia used an earlier version of this system for its MLPerf benchmarking.

“This is our system that we use to develop most notably the brains for our autonomous vehicle platform,” said Ian Buck, general manager and vice president of accelerated computing at Nvidia, referring to the DGX SuperPod. He noted that the 96 DGX systems are highly connected. “We have more than 10 Mellanox EDR InfiniBand NICs per node connected together with multiple direction level switches, connecting more than 1,500 tensor core GPUs.”

Hyperscale Takeover Continues

A tipping point has been reached, where the number of IT/web-scale machines on the Top500 now outnumber true HPC clusters and supercomputers. Claims of list-stuffing were raised last year at ISC due to an increase in hyperscale systems being creatively benchmarked for the list (which to be clear is not breaking any rules and is not a new practice; it dates back to at least 2010). The renewed enthusiasm for running the HPL benchmark on cloud systems that will be deployed and leveraged in a disaggregrated manner is partly behind China’s ascendance on the list as Chinese vendors Sugon, Inspur and Lenovo have made a concerted effort to get their systems on the list. Somebody has to lead these efforts; they don’t just happen.

In-line with this trend, Gigabit Ethernet (GE) continues to expand its hold on the list, connecting 271 systems (a 54.2 share). By aggregate flops, GE carries just 25 percent of list share. The other 75 percent of flops were conducted via InfiniBand, OmniPath, custom and proprietary interconnects.

New Entrant Profile

There are 94 new entrants to the list, bringing in 223 petaflops with China backing 35 percent of these flops and US responsible for 30 percent. Of the 94 additions, 45 are Chinese and 17 are US. Roughly two-thirds of the new systems are cloud/web-scale machines.

Green500 & HPCG

The Green500, which tracks the most energy-efficient Top500 machines, has been integrated into the Top500 reporting process and the High Performance Conjugate Gradients (HPCG) benchmark is also included in the list.

Maintaining their positions from six months ago, the top two Green500 systems are Shoubu system B and DGX SaturnV Volta ranked 471 and 469 on the Top500, respectively. Shoubu system B, made by PEZY/Exascalar and located at RIKEN, achieves 17.6 gigaflops/watt; while DGX SaturnV Volta, made by Nvidia for Nvidia, delivers 15.1 gigaflops/watt.

Top500 champ Summit, located at Oak Ridge National Laboratory in Tennessee, is in third place on the Green500 with a slightly improved result of 14.71 gigaflops/watt.

At number four is the AI Bridging Cloud Infrastructure (ABCI) system installed at the National Institute of Advanced Industrial Science and Technology (AIST) in Japan. Number seven on the Top500, ABCI delivers 14.4 gigaflops/watt.

The MareNostrum P9 CTE, moves up three spots into fifth place with a refreshed result of 14.1 gigaflops/watt. Built by IBM for Barcelona Supercomputing Center, the machine delivers 1.14 Linpack petaflops for a 393rd ranking on the Top500.

TSUBAME 3.0 drops from fifth to sixth place on the Green500 with a result of 13.7 gigaflops/watt. The 8.1 petaflops supercomputer, 25th on the current Top500 list, was made by HPE for GSIC Center, Tokyo Institute of Technology.

Total’s new supercomputer, Pangea III, new to the Top500 list in 11th place, nabs the seventh position on the Green500, achieving 13.07 gigaflops/watt.

Sierra, LLNL’s flagship system that is number two on the Top500, moves to eighth place, maintaining its efficiency score of 12.72 gigaflops/watt.

Sugon’s Advanced Computing System (PreE) and Taiwania 2 (installed at Taiwan’s National Center for High Performance Computing) captured the nine and ten positions on the Green500, respectively. The PreE system—number 43 on the Top500 with 4.3 petaflops—is powered by the Dhyana CPU, a Hygon-licensed implementation of AMD’s Epyc processor. Built by Quanta Computer, Taiwania 2 occupies spot number 23 on the Top500 with a Linpack rating of 9 petaflops.

All of the top 10 greenest supercomputers employ accelerators (primarily Nvidia GPUs) in a hybrid system design. In fact, the highest-ranking Green500 CPU-only machine is Sunway TaihuLight in 25th place. China’s TaihuLight, the pride of Wuxi, home to the computer’s eponymous Taihu Lake, maintains its third-spot ranking on the Top500 with 93 Linpack petaflops.

One of the encouraging things about the Green500 is the diversity of international representation. Japan, the United States, Spain, France, China, Taiwan, the United Kingdom, Italy, Canada, Switzerland and Brazil all have machines in the top 25.

On the HPCG, Summit and Sierra hold the top positions with Japan’s K computer at number three. Respectively, they achieved 2.93 HPCG-petaflops, 1.80 HPCG-petaflops, and .60 HPCG-petaflops. As the list authors note, “Summit and Sierra remain the only two systems to exceed a petaflop on the HPCG benchmark…. The average HPCG result on the current list is 213.3 teraflops, a marginal increase from 211.2 six months ago.”

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

Red Hat’s Disruption of CentOS Unleashes Storm of Dissent

January 22, 2021

Five weeks after angering much of the CentOS Linux developer community by unveiling controversial changes to the no-cost CentOS operating system, Red Hat has unveiled alternatives for affected users that give them severa Read more…

By Todd R. Weiss

China Unveils First 7nm Chip: Big Island

January 22, 2021

Shanghai Tianshu Zhaoxin Semiconductor Co. is claiming China’s first 7-nanometer chip, described as a leading-edge, general-purpose cloud computing chip based on a proprietary GPU architecture. Dubbed “Big Island Read more…

By George Leopold

HiPEAC Keynote: In-Memory Computing Steps Closer to Practical Reality

January 21, 2021

Pursuit of in-memory computing has long been an active area with recent progress showing promise. Just how in-memory computing works, how close it is to practical application, and what are some of the key opportunities a Read more…

By John Russell

HiPEAC’s Vision for a New Cyber Era, a ‘Continuum of Computing’

January 21, 2021

Earlier this week (Jan. 19), HiPEAC — the European Network on High Performance and Embedded Architecture and Compilation — published the 8th edition of the HiPEAC Vision, detailing an increasingly interconnected computing landscape where complex tasks are carried out across multiple... Read more…

By Tiffany Trader

Supercomputers Assist Hunt for Mysterious Axion Particle

January 21, 2021

In the 1970s, scientists theorized the existence of axions: particles born in the hearts of stars that, when exposed to a magnetic field, become light particles, and which may even comprise dark matter. To date, however, Read more…

By Oliver Peckham

AWS Solution Channel

Fire Dynamics Simulation CFD workflow on AWS

Modeling fires is key for many industries, from the design of new buildings, defining evacuation procedures for trains, planes and ships, and even the spread of wildfires. Read more…

Intel® HPC + AI Pavilion

Intel Keynote Address

Intel is the foundation of HPC – from the workstation to the cloud to the backbone of the Top500. At SC20, Intel’s Trish Damkroger, VP and GM of high performance computing, addresses the audience to show how Intel and its partners are building the future of HPC today, through hardware and software technologies that accelerate the broad deployment of advanced HPC systems. Read more…

Researchers Train Fluid Dynamics Neural Networks on Supercomputers

January 21, 2021

Fluid dynamics simulations are critical for applications ranging from wind turbine design to aircraft optimization. Running these simulations through direct numerical simulations, however, is computationally costly. Many Read more…

By Oliver Peckham

Red Hat’s Disruption of CentOS Unleashes Storm of Dissent

January 22, 2021

Five weeks after angering much of the CentOS Linux developer community by unveiling controversial changes to the no-cost CentOS operating system, Red Hat has un Read more…

By Todd R. Weiss

HiPEAC Keynote: In-Memory Computing Steps Closer to Practical Reality

January 21, 2021

Pursuit of in-memory computing has long been an active area with recent progress showing promise. Just how in-memory computing works, how close it is to practic Read more…

By John Russell

HiPEAC’s Vision for a New Cyber Era, a ‘Continuum of Computing’

January 21, 2021

Earlier this week (Jan. 19), HiPEAC — the European Network on High Performance and Embedded Architecture and Compilation — published the 8th edition of the HiPEAC Vision, detailing an increasingly interconnected computing landscape where complex tasks are carried out across multiple... Read more…

By Tiffany Trader

Saudi Aramco Unveils Dammam 7, Its New Top Ten Supercomputer

January 21, 2021

By revenue, oil and gas giant Saudi Aramco is one of the largest companies in the world, and it has historically employed commensurate amounts of supercomputing Read more…

By Oliver Peckham

President-elect Biden Taps Eric Lander and Deep Team on Science Policy

January 19, 2021

Last Friday U.S. President-elect Joe Biden named The Broad Institute founding director and president Eric Lander as his science advisor and as director of the Office of Science and Technology Policy. Lander, 63, is a mathematician by training and distinguished life sciences... Read more…

By John Russell

Pat Gelsinger Returns to Intel as CEO

January 14, 2021

The Intel board of directors has appointed a new CEO. Intel alum Pat Gelsinger is leaving his post as CEO of VMware to rejoin the company that he parted ways with 11 years ago. Gelsinger will succeed Bob Swan, who will remain CEO until Feb. 15. Gelsinger previously spent 30 years... Read more…

By Tiffany Trader

Julia Update: Adoption Keeps Climbing; Is It a Python Challenger?

January 13, 2021

The rapid adoption of Julia, the open source, high level programing language with roots at MIT, shows no sign of slowing according to data from Julialang.org. I Read more…

By John Russell

Intel ‘Ice Lake’ Server Chips in Production, Set for Volume Ramp This Quarter

January 12, 2021

Intel Corp. used this week’s virtual CES 2021 event to reassert its dominance of the datacenter with the formal roll out of its next-generation server chip, the 10nm Xeon Scalable processor that targets AI and HPC workloads. The third-generation “Ice Lake” family... Read more…

By George Leopold

Esperanto Unveils ML Chip with Nearly 1,100 RISC-V Cores

December 8, 2020

At the RISC-V Summit today, Art Swift, CEO of Esperanto Technologies, announced a new, RISC-V based chip aimed at machine learning and containing nearly 1,100 low-power cores based on the open-source RISC-V architecture. Esperanto Technologies, headquartered in... Read more…

By Oliver Peckham

Julia Update: Adoption Keeps Climbing; Is It a Python Challenger?

January 13, 2021

The rapid adoption of Julia, the open source, high level programing language with roots at MIT, shows no sign of slowing according to data from Julialang.org. I Read more…

By John Russell

Azure Scaled to Record 86,400 Cores for Molecular Dynamics

November 20, 2020

A new record for HPC scaling on the public cloud has been achieved on Microsoft Azure. Led by Dr. Jer-Ming Chia, the cloud provider partnered with the Beckman I Read more…

By Oliver Peckham

NICS Unleashes ‘Kraken’ Supercomputer

April 4, 2008

A Cray XT4 supercomputer, dubbed Kraken, is scheduled to come online in mid-summer at the National Institute for Computational Sciences (NICS). The soon-to-be petascale system, and the resulting NICS organization, are the result of an NSF Track II award of $65 million to the University of Tennessee and its partners to provide next-generation supercomputing for the nation's science community. Read more…

Is the Nvidia A100 GPU Performance Worth a Hardware Upgrade?

October 16, 2020

Over the last decade, accelerators have seen an increasing rate of adoption in high-performance computing (HPC) platforms, and in the June 2020 Top500 list, eig Read more…

By Hartwig Anzt, Ahmad Abdelfattah and Jack Dongarra

Aurora’s Troubles Move Frontier into Pole Exascale Position

October 1, 2020

Intel’s 7nm node delay has raised questions about the status of the Aurora supercomputer that was scheduled to be stood up at Argonne National Laboratory next year. Aurora was in the running to be the United States’ first exascale supercomputer although it was on a contemporaneous timeline with... Read more…

By Tiffany Trader

10nm, 7nm, 5nm…. Should the Chip Nanometer Metric Be Replaced?

June 1, 2020

The biggest cool factor in server chips is the nanometer. AMD beating Intel to a CPU built on a 7nm process node* – with 5nm and 3nm on the way – has been i Read more…

By Doug Black

Programming the Soon-to-Be World’s Fastest Supercomputer, Frontier

January 5, 2021

What’s it like designing an app for the world’s fastest supercomputer, set to come online in the United States in 2021? The University of Delaware’s Sunita Chandrasekaran is leading an elite international team in just that task. Chandrasekaran, assistant professor of computer and information sciences, recently was named... Read more…

By Tracey Bryant

Leading Solution Providers

Contributors

Top500: Fugaku Keeps Crown, Nvidia’s Selene Climbs to #5

November 16, 2020

With the publication of the 56th Top500 list today from SC20's virtual proceedings, Japan's Fugaku supercomputer – now fully deployed – notches another win, Read more…

By Tiffany Trader

Texas A&M Announces Flagship ‘Grace’ Supercomputer

November 9, 2020

Texas A&M University has announced its next flagship system: Grace. The new supercomputer, named for legendary programming pioneer Grace Hopper, is replacing the Ada system (itself named for mathematician Ada Lovelace) as the primary workhorse for Texas A&M’s High Performance Research Computing (HPRC). Read more…

By Oliver Peckham

At Oak Ridge, ‘End of Life’ Sometimes Isn’t

October 31, 2020

Sometimes, the old dog actually does go live on a farm. HPC systems are often cursed with short lifespans, as they are continually supplanted by the latest and Read more…

By Oliver Peckham

Gordon Bell Special Prize Goes to Massive SARS-CoV-2 Simulations

November 19, 2020

2020 has proven a harrowing year – but it has produced remarkable heroes. To that end, this year, the Association for Computing Machinery (ACM) introduced the Read more…

By Oliver Peckham

Nvidia and EuroHPC Team for Four Supercomputers, Including Massive ‘Leonardo’ System

October 15, 2020

The EuroHPC Joint Undertaking (JU) serves as Europe’s concerted supercomputing play, currently comprising 32 member states and billions of euros in funding. I Read more…

By Oliver Peckham

Intel Xe-HP GPU Deployed for Aurora Exascale Development

November 17, 2020

At SC20, Intel announced that it is making its Xe-HP high performance discrete GPUs available to early access developers. Notably, the new chips have been deplo Read more…

By Tiffany Trader

Nvidia-Arm Deal a Boon for RISC-V?

October 26, 2020

The $40 billion blockbuster acquisition deal that will bring chipmaker Arm into the Nvidia corporate family could provide a boost for the competing RISC-V architecture. As regulators in the U.S., China and the European Union begin scrutinizing the impact of the blockbuster deal on semiconductor industry competition and innovation, the deal has at the very least... Read more…

By George Leopold

HPE, AMD and EuroHPC Partner for Pre-Exascale LUMI Supercomputer

October 21, 2020

Not even a week after Nvidia announced that it would be providing hardware for the first four of the eight planned EuroHPC systems, HPE and AMD are announcing a Read more…

By Oliver Peckham

  • arrow
  • Click Here for More Headlines
  • arrow
Do NOT follow this link or you will be banned from the site!
Share This