Top500 Purely Petaflops; US Maintains Performance Lead

By Tiffany Trader

June 17, 2019

Editor’s note: In preparing this article, I noticed an irregularity with the Top500 reporting in that Sierra and two other NNSA/LLNL systems (Lassen and Ansel) were omitted from IBM’s totals (by system share and by performance share), which incorrectly put IBM in second place by aggregate list performance, instead of first place. An earlier version of this article pointed out that irregularity. On Monday (June 17) Top500 coauthor Erich Strohmaier confirmed to HPCwire that this was an aggregation error, triggered by the manufacturer field being listed as “IBM/Nvidia/Mellanox” for these three machines. The article has been amended to reflect the correct information.

With the kick-off of the International Supercomputing Conference (ISC) in Frankfurt this morning, the 53rd Top500 list made its debut, and this one’s for petafloppers only. The entry point for the new list is 1.022 petaflops, and the total aggregate performance of all 500 systems for the first time exceeds 1.5 exaflops. The top new entry to the list is Frontera, the NSF-funded Dell EMC system installed at the Texas Advanced Computing Center (TACC), at number five with 23.5 Linpack petaflops.

China claims the most systems (219) with 208 of these pretty evenly split among Lenovo (74), Inspur (71) and Sugon (63). Lenovo, which acquired IBM’s x86 business in 2014, maintains its lead as the number one Top500 vendor by number of systems (173). As we continue to note, the majority of these are not HPC systems or supercomputers; in fact, over half the full list (about 55 percent) is now comprised of web-scale/cloud systems.

The US (with 116 systems) trails China by nearly half in terms of system share, but when it comes to aggregate performance share, the US leads with 38.4 percent of list flops compared with China’s 29.9 percent. Summit and Sierra — still at number one and two respectively — enjoy 243.2 Linpack petaflops between them, accounting for nearly a sixth of the list’s aggregate capacity. Oak Ridge National Laboratory has optimized Summit’s Linpack performance, such that it now delivers 148.6 petaflops on the HPL benchmark, versus 143.5 petaflops six months ago.

The prime manufacturer behind Summit and Sierra, IBM, is in first place by aggregate list performance with 321 petaflops, ahead of Lenovo, which touts 302 total petaflops. Cray has the next highest aggregate list performance share (183.4 petaflops), followed by HPE (120.1 petaflops) and Sugon (96 petaflops). In anticipation of HPE purchasing Cray, we note that a combined union on this list would deliver a total of 303.5 aggregate petaflops — still shy of IBM’s total, and a smidgen ahead of Lenovo’s.

Restricting our analysis to the top 100 systems to better gauge the HPC/supercomputing picture (an imperfect but quick reckoning), the top five countries by system share are US with 37, Japan with 15, China and Germany tied with 9, followed by France with six systems (including two Jean Zay systems, an accelerated machine in 42nd position and a CPU-only one in 72nd). By aggregate performance, US leads the top 100 pack with 49 percent of the flops, China with 17 percent, Japan with 10 percent, and Germany and France both with 5 percent. The top vendors by system share in the top 100 grouping are Cray with 25, HPE with 18, IBM with eight [Ed. this is the corrected number], Fujitsu with seven, and then a three-way tie between Dell EMC, Nvidia and Sugon (all with three systems). By aggregate performance share, the order changes to IBM, Cray, HPE, Fujitsu and Dell EMC.

June 2019 Top 12 – Click to Expand (Source: Top500)

View from the Top: Bienvenida Frontera

Frontera, the Dell C6420 system installed at the TACC, makes its list debut at number five, achieving 23.5 Linpack petaflops (38.7 peak petaflops) using 448,448 Intel Xeon cores. Frontera’s 8,000 nodes are hooked together using Mellanox HDR InfiniBand. This is the debut list for HDR, which connects three other systems (#68, #166, and #264) in addition to Frontera.

Lassen — Sierra’s unclassified “companion” system at LLNL that was built by IBM, Nvidia and Mellanox — moves up into the number 10 spot with an upgraded capacity of 18.2 Linpack petaflops.

Titan, ORNL’s Cray XK7 system, which had a long reign as the United States’ fastest supercomputer since its debut as an upgrade to Jaguar in November 2012 until Summit was deployed in June 2018, has shifted out of the top 10, as has Sequoia, the last BlueGene/Q system at LLNL.

Taking Lassen’s previous spot in 11th place is another IBM system, a new entrant to the list and the fastest commercial supercomputer: Pangea III. Measured at 17.9 Linpack petaflops (rPeak 25 petaflops), the system was built by IBM for Total to help the energy company more accurately locate new oil reservoirs and better assess the potential associated revenue opportunities.

One vendor missing from the top 10 pack is HPE, an observation made by market analyst Addison Snell, CEO of Intersect360 Research. “IBM, Cray, Fujitsu, Lenovo, and Dell EMC all have top ten systems. The biggest HPE system is no slouch, at number 17, the HPC4 system at Italian energy company Eni. And of course, HPE will soon acquire Cray, inheriting the plans for the first two exascale systems in the US,” Snell commented to HPCwire.

How the Chips Fall

While there are more CPU options appearing in the general server market, the trend has not had much of an impact on the Top500 yet. Intel chips still dominate in so far as they appear in 95.6 percent of all systems, although in some cases, much or most of the heavy lifting is owed to accelerators or coprocessors. IBM Power processors are in seven systems, followed by AMD CPUs, which are used in three systems. A single supercomputer on the list, Astra at Sandia National Labs, is powered by (Marvell/Cavium) Arm processors. Built by HPE, Astra debuted on the November 2018 list at 204 with 1.5 petaflops. An upgrade goosed Linpack to 1.76 petaflops, elevating the system into 156th position.

Accelerated Computing

There are 133 systems that use accelerator or co-processor technology, dipping slightly from 138 six months ago. Overall the trend toward acceleration remains strong with Nvidia seeing 25 percent growth in the number of Top500 systems that employ its GPUs in the last year. The total number of Nvidia GPU-accelerated systems has gone from 98 last June 2018 to 125 total systems on the current list.

Nvidia’s DGX SuperPOD

With AMD and Intel server GPUs coming into the market, or preparing to in Intel’s case — including already announced big wins (Aurora, Frontier) — this will be an interesting trend to follow.

Nvidia has two DGX-2 systems on the list, Circe and DGX SuperPod, which integrate Xeon Skylake Platinum 8174 processors, Nvidia V100 GPUs, NVLink technology and Mellanox EDR InfiniBand. Both are used internally for research and company workloads.

Circe, a cluster of 36 DGX-2H boxes that ranks 71 with 3.1 Linpack petaflops, entered the list last November. DGX SuperPod, comprising 96 DGX-2H nodes, joins the list today at number 22 with a Linpack score of 9.4 petaflops. Nvidia used an earlier version of this system for its MLPerf benchmarking.

“This is our system that we use to develop most notably the brains for our autonomous vehicle platform,” said Ian Buck, general manager and vice president of accelerated computing at Nvidia, referring to the DGX SuperPod. He noted that the 96 DGX systems are highly connected. “We have more than 10 Mellanox EDR InfiniBand NICs per node connected together with multiple direction level switches, connecting more than 1,500 tensor core GPUs.”

Hyperscale Takeover Continues

A tipping point has been reached, where the number of IT/web-scale machines on the Top500 now outnumber true HPC clusters and supercomputers. Claims of list-stuffing were raised last year at ISC due to an increase in hyperscale systems being creatively benchmarked for the list (which to be clear is not breaking any rules and is not a new practice; it dates back to at least 2010). The renewed enthusiasm for running the HPL benchmark on cloud systems that will be deployed and leveraged in a disaggregrated manner is partly behind China’s ascendance on the list as Chinese vendors Sugon, Inspur and Lenovo have made a concerted effort to get their systems on the list. Somebody has to lead these efforts; they don’t just happen.

In-line with this trend, Gigabit Ethernet (GE) continues to expand its hold on the list, connecting 271 systems (a 54.2 share). By aggregate flops, GE carries just 25 percent of list share. The other 75 percent of flops were conducted via InfiniBand, OmniPath, custom and proprietary interconnects.

New Entrant Profile

There are 94 new entrants to the list, bringing in 223 petaflops with China backing 35 percent of these flops and US responsible for 30 percent. Of the 94 additions, 45 are Chinese and 17 are US. Roughly two-thirds of the new systems are cloud/web-scale machines.

Green500 & HPCG

The Green500, which tracks the most energy-efficient Top500 machines, has been integrated into the Top500 reporting process and the High Performance Conjugate Gradients (HPCG) benchmark is also included in the list.

Maintaining their positions from six months ago, the top two Green500 systems are Shoubu system B and DGX SaturnV Volta ranked 471 and 469 on the Top500, respectively. Shoubu system B, made by PEZY/Exascalar and located at RIKEN, achieves 17.6 gigaflops/watt; while DGX SaturnV Volta, made by Nvidia for Nvidia, delivers 15.1 gigaflops/watt.

Top500 champ Summit, located at Oak Ridge National Laboratory in Tennessee, is in third place on the Green500 with a slightly improved result of 14.71 gigaflops/watt.

At number four is the AI Bridging Cloud Infrastructure (ABCI) system installed at the National Institute of Advanced Industrial Science and Technology (AIST) in Japan. Number seven on the Top500, ABCI delivers 14.4 gigaflops/watt.

The MareNostrum P9 CTE, moves up three spots into fifth place with a refreshed result of 14.1 gigaflops/watt. Built by IBM for Barcelona Supercomputing Center, the machine delivers 1.14 Linpack petaflops for a 393rd ranking on the Top500.

TSUBAME 3.0 drops from fifth to sixth place on the Green500 with a result of 13.7 gigaflops/watt. The 8.1 petaflops supercomputer, 25th on the current Top500 list, was made by HPE for GSIC Center, Tokyo Institute of Technology.

Total’s new supercomputer, Pangea III, new to the Top500 list in 11th place, nabs the seventh position on the Green500, achieving 13.07 gigaflops/watt.

Sierra, LLNL’s flagship system that is number two on the Top500, moves to eighth place, maintaining its efficiency score of 12.72 gigaflops/watt.

Sugon’s Advanced Computing System (PreE) and Taiwania 2 (installed at Taiwan’s National Center for High Performance Computing) captured the nine and ten positions on the Green500, respectively. The PreE system—number 43 on the Top500 with 4.3 petaflops—is powered by the Dhyana CPU, a Hygon-licensed implementation of AMD’s Epyc processor. Built by Quanta Computer, Taiwania 2 occupies spot number 23 on the Top500 with a Linpack rating of 9 petaflops.

All of the top 10 greenest supercomputers employ accelerators (primarily Nvidia GPUs) in a hybrid system design. In fact, the highest-ranking Green500 CPU-only machine is Sunway TaihuLight in 25th place. China’s TaihuLight, the pride of Wuxi, home to the computer’s eponymous Taihu Lake, maintains its third-spot ranking on the Top500 with 93 Linpack petaflops.

One of the encouraging things about the Green500 is the diversity of international representation. Japan, the United States, Spain, France, China, Taiwan, the United Kingdom, Italy, Canada, Switzerland and Brazil all have machines in the top 25.

On the HPCG, Summit and Sierra hold the top positions with Japan’s K computer at number three. Respectively, they achieved 2.93 HPCG-petaflops, 1.80 HPCG-petaflops, and .60 HPCG-petaflops. As the list authors note, “Summit and Sierra remain the only two systems to exceed a petaflop on the HPCG benchmark…. The average HPCG result on the current list is 213.3 teraflops, a marginal increase from 211.2 six months ago.”

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

What’s New in HPC Research: ROBE, OpenMP Automated Scheduling, SCALSALE, & More

October 6, 2022

In this regular feature, HPCwire highlights newly published research in the high-performance computing community and related domains. From parallel programming to exascale to quantum computing, the details are here. Read more…

China Expands Quantum Computing Development Push says Report

October 6, 2022

The drumbeat around development of quantum computing continues to grow in mainstream media, as evidenced by a report in today’s Wall Street Journal (China Seeks a Quantum Leap in Computing). While timelines for practic Read more…

Intel Is Opening up Its Chip Factories to Academia

October 6, 2022

Intel is opening up its fabs for academic institutions so researchers can get their hands on physical versions of its chips, with the end goal of boosting semiconductor research and development. The effort, called the Read more…

NCSA’s Delta System Enters Full Production

October 5, 2022

In June 2020, the NSF awarded the National Center for Supercomputing Applications (NCSA) $10 million for its post-Blue Waters “Delta” supercomputer. Now, that funding has come to fruition: NCSA has announced that Del Read more…

GM, National Labs Pave the Way for Next-Gen Vehicles

October 5, 2022

For the better part of a century, General Motors (GM) was the biggest automaker in the world. Now, amid a paradigm shift toward smarter, electrified vehicles, the leading American automaker is working to meet the moment Read more…

AWS Solution Channel

Shutterstock 242193979

Last Chance to Vote – AWS for Best Use of HPC in the 2022 Readers’ Choice Awards

AWS Once Again Showcasing its Reach Across Several Areas in HPC

Ten categories feature Amazon Web Services (AWS) in the 2022 HPCwire Readers’ Choice Awards. Read more…

Microsoft/NVIDIA Solution Channel

Shutterstock 1166887495

Improving Insurance Fraud Detection using AI Running on Cloud-based GPU-Accelerated Systems

Insurance is a highly regulated industry that is evolving as the industry faces changing customer expectations, massive amounts of data, and increased regulations. A major issue facing the industry is tracking insurance fraud. Read more…

Quantum Computing Needs More Public-Private Collaboration Says QED-C

October 4, 2022

Last week the Quantum Economic Development Consortium (QED-C) released a new report – Public-Private Partnerships in Quantum Computing – that calls for increased government-commercial collaboration, broadly describes Read more…

Intel Is Opening up Its Chip Factories to Academia

October 6, 2022

Intel is opening up its fabs for academic institutions so researchers can get their hands on physical versions of its chips, with the end goal of boosting semic Read more…

GM, National Labs Pave the Way for Next-Gen Vehicles

October 5, 2022

For the better part of a century, General Motors (GM) was the biggest automaker in the world. Now, amid a paradigm shift toward smarter, electrified vehicles, t Read more…

Quantum Computing Needs More Public-Private Collaboration Says QED-C

October 4, 2022

Last week the Quantum Economic Development Consortium (QED-C) released a new report – Public-Private Partnerships in Quantum Computing – that calls for incr Read more…

How Intel Plans to Rebuild Its Manufacturing Supply Chain

September 29, 2022

Intel's engineering roots saw a revival at this week's Innovation, with attendees recalling the show’s resemblance to Intel Developer Forum, the company's ann Read more…

Intel Labs Launches Neuromorphic ‘Kapoho Point’ Board

September 28, 2022

Over the past five years, Intel has been iterating on its neuromorphic chips and systems, aiming to create devices (and software for those devices) that closely Read more…

HPE to Build 100+ Petaflops Shaheen III Supercomputer

September 27, 2022

The King Abdullah University of Science and Technology (KAUST) in Saudi Arabia has announced that HPE has won the bid to build the Shaheen III supercomputer. Sh Read more…

Intel’s New Programmable Chips Next Year to Replace Aging Products

September 27, 2022

Intel shared its latest roadmap of programmable chips, and doesn't want to dig itself into a hole by following AMD's strategy in the area.  "We're thankfully not matching their strategy," said Shannon Poulin, corporate vice president for the datacenter and AI group at Intel, in response to a question posed by HPCwire during a press briefing. The updated roadmap pieces together Intel's strategy for FPGAs... Read more…

Intel Ships Sapphire Rapids – to Its Cloud

September 27, 2022

Intel has had trouble getting its chips in the hands of customers on time, but is providing the next best thing – to try out those chips in the cloud. Delayed chips such as Sapphire Rapids server processors and Habana Gaudi 2 AI chip will be available on a platform called the Intel Developer Cloud, which was announced at the Intel Innovation event being held in San Jose, California. Read more…

Nvidia Shuts Out RISC-V Software Support for GPUs 

September 23, 2022

Nvidia is not interested in bringing software support to its GPUs for the RISC-V architecture despite being an early adopter of the open-source technology in its GPU controllers. Nvidia has no plans to add RISC-V support for CUDA, which is the proprietary GPU software platform, a company representative... Read more…

AWS Takes the Short and Long View of Quantum Computing

August 30, 2022

It is perhaps not surprising that the big cloud providers – a poor term really – have jumped into quantum computing. Amazon, Microsoft Azure, Google, and th Read more…

US Senate Passes CHIPS Act Temperature Check, but Challenges Linger

July 19, 2022

The U.S. Senate on Tuesday passed a major hurdle that will open up close to $52 billion in grants for the semiconductor industry to boost manufacturing, supply chain and research and development. U.S. senators voted 64-34 in favor of advancing the CHIPS Act, which sets the stage for the final consideration... Read more…

Chinese Startup Biren Details BR100 GPU

August 22, 2022

Amid the high-performance GPU turf tussle between AMD and Nvidia (and soon, Intel), a new, China-based player is emerging: Biren Technology, founded in 2019 and headquartered in Shanghai. At Hot Chips 34, Biren co-founder and president Lingjie Xu and Biren CTO Mike Hong took the (virtual) stage to detail the company’s inaugural product: the Biren BR100 general-purpose GPU (GPGPU). “It is my honor to present... Read more…

Tesla Bulks Up Its GPU-Powered AI Super – Is Dojo Next?

August 16, 2022

Tesla has revealed that its biggest in-house AI supercomputer – which we wrote about last year – now has a total of 7,360 A100 GPUs, a nearly 28 percent uplift from its previous total of 5,760 GPUs. That’s enough GPU oomph for a top seven spot on the Top500, although the tech company best known for its electric vehicles has not publicly benchmarked the system. If it had, it would... Read more…

AMD’s MI300 APUs to Power Exascale El Capitan Supercomputer

June 21, 2022

Additional details of the architecture of the exascale El Capitan supercomputer were disclosed today by Lawrence Livermore National Laboratory’s (LLNL) Terri Read more…

Exclusive Inside Look at First US Exascale Supercomputer

July 1, 2022

HPCwire takes you inside the Frontier datacenter at DOE's Oak Ridge National Laboratory (ORNL) in Oak Ridge, Tenn., for an interview with Frontier Project Direc Read more…

AMD Opens Up Chip Design to the Outside for Custom Future

June 15, 2022

AMD is getting personal with chips as it sets sail to make products more to the liking of its customers. The chipmaker detailed a modular chip future in which customers can mix and match non-AMD processors in a custom chip package. "We are focused on making it easier to implement chips with more flexibility," said Mark Papermaster, chief technology officer at AMD during the analyst day meeting late last week. Read more…

Leading Solution Providers

Contributors

Nvidia, Intel to Power Atos-Built MareNostrum 5 Supercomputer

June 16, 2022

The long-troubled, hotly anticipated MareNostrum 5 supercomputer finally has a vendor: Atos, which will be supplying a system that includes both Nvidia and Inte Read more…

UCIe Consortium Incorporates, Nvidia and Alibaba Round Out Board

August 2, 2022

The Universal Chiplet Interconnect Express (UCIe) consortium is moving ahead with its effort to standardize a universal interconnect at the package level. The c Read more…

Using Exascale Supercomputers to Make Clean Fusion Energy Possible

September 2, 2022

Fusion, the nuclear reaction that powers the Sun and the stars, has incredible potential as a source of safe, carbon-free and essentially limitless energy. But Read more…

Is Time Running Out for Compromise on America COMPETES/USICA Act?

June 22, 2022

You may recall that efforts proposed in 2020 to remake the National Science Foundation (Endless Frontier Act) have since expanded and morphed into two gigantic bills, the America COMPETES Act in the U.S. House of Representatives and the U.S. Innovation and Competition Act in the U.S. Senate. So far, efforts to reconcile the two pieces of legislation have snagged and recent reports... Read more…

Nvidia, Qualcomm Shine in MLPerf Inference; Intel’s Sapphire Rapids Makes an Appearance.

September 8, 2022

The steady maturation of MLCommons/MLPerf as an AI benchmarking tool was apparent in today’s release of MLPerf v2.1 Inference results. Twenty-one organization Read more…

Not Just Cash for Chips – The New Chips and Science Act Boosts NSF, DOE, NIST

August 3, 2022

After two-plus years of contentious debate, several different names, and final passage by the House (243-187) and Senate (64-33) last week, the Chips and Science Act will soon become law. Besides the $54.2 billion provided to boost US-based chip manufacturing, the act reshapes US science policy in meaningful ways. NSF’s proposed budget... Read more…

AMD Lines Up Alternate Chips as It Eyes a ‘Post-exaflops’ Future

June 10, 2022

Close to a decade ago, AMD was in turmoil. The company was playing second fiddle to Intel in PCs and datacenters, and its road to profitability hinged mostly on Read more…

RISC-V Opens the Door on 48-bit Computing

July 7, 2022

There’s a growing interest among silicon providers backing RISC-V to introduce 48-bit computing in custom chips to meet their specific requirements. The 48-bit long instructions focus is more as a middle ground between 32-bit and 64-bit, which has largely been the focus of chips and instruction sets until now. Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire