ISC 2020 Keynote: Hope for the Future, Praise for Fugaku and HPC’s Pandemic Response

By John Russell

June 24, 2020

In stark contrast to past years Thomas Sterling’s ISC20 keynote today struck a more somber note with the COVID-19 pandemic as the central character in Sterling’s annual review of worldwide trends in HPC. Better known for his engaging manner and occasional willingness to poke prickly egos, Sterling instead strode through the numbing statistics associated with the pandemic and then turned to optimistically spotlighting the worldwide HPC community’s rapid pivot to attack the pandemic.

Perhaps this was appropriate. The full-fledged ISC20 conference scheduled to happen in Frankfurt, Germany, has been transformed into a digital-only event with a slimmed down agenda delivered by livestreaming and prerecorded sessions. Sterling’s keynote – his 17th – was prerecorded. The pandemic has wreaked global damage on health, economies, and disrupted lifestyles.

Thomas Sterling, Indiana University

“This was not easy for this year because this year is like no other year, certainly in my lifetime, and I suspect most of yours. I chose ‘a world in hope’ (as his theme for the talk) because wherever you are, I think we are experiencing the same sense of need for a future that is better than our present. Sadly, many have lost their lives since I last presented on this virtual stage. And there are more than one kind of worldwide dilemma, crisis, really. It is a health crisis. It is an economic crisis. And for those of us in the US, it is a crisis of conscience, long overdue,” said Sterling.

There was some core HPC technology discussion too. Sterling, professor of engineering and director, AI Computing Systems Laboratory, Indiana University, lauded the innovation and performance of Fugaku at the RIKEN Center for Computational Science (R-CCS) in Kobe, Japan, which finished atop the latest Top500 by a wide margin.

“This is the machine that I would claim was redesigned and pretty much from scratch now. Now that’s not 100% true, it does use an ARM CPU and a manycore [approach] for that matter. But this is on a chip and a framework that provide very high bandwidth and very low latency at least among the local parts, and a very high bandwidth and rapid switching network on the chip and scalability. In addition, it has a custom-made accelerator, a processor sort of GPU-like, the SVE, designed by Fujitsu, that is both speeding up integer and floating point at different speeds including its own cache and very tight packaging, as well as massive memory bandwidth in and out of the accelerator.

“This machine is not just for numeric computing. It is not just for graph computing. It is designed in particular to be able to advance the capabilities in artificial intelligence, including but not limited to machine learning. And one exceptional note is that because of the COVID-19 crisis, [Satoshi] Matsuoka at RIKEN and his colleagues expedited the deployment. Originally it was supposed to be deployed next year in 2021. But part of the machine, at least has been deployed now and is running science applications targeting the COVID virus.”

During his talk, Sterling also dug into the long-term structure of the Top500 and noted the passing of a few prominent members of the HPC community as well as recent award winners (more on both below) but he spent most of his time discussing the pandemic and HPC efforts to collaborate in the fight against it. It was a different sort of talk.

Here are two sobering slides and one hopeful one.

 

“You can observe the red curve that shows the monotonically increasing number of cases. The other bars are the additional cases added per day in the different geographical areas. Yellow is the European. Green is the Americas. And I believe that includes not just the US, UK, Canada, Mexico and South America and the gray covering the myriad other countries That either have or are now experiencing the same virus. And in that area, you can see that this represents the trend, the transmission of that disease to many other areas parts of the world,” said Sterling.

“This is more sobering, if anything, this is not the number of cases this is the number of confirmed deaths on a per daily basis and the black line is the world, the world the death rate, where we see that something on the order today of 3000 4000 people dying every day. I guess the good news here is that number was more like 8000 on a daily basis. And has somewhat subsided,” he said.

He commented on the fact that various regions have taken different approaches to controlling the pandemic, noting for example that Sweden is relying on development of “herd immunity” over time which implies taking less active measures. He left open the question of how effective that strategy would be. He’s encouraged by some efforts, in particular New York City, which is where he was born. See slide below.

Sterling, “This chart is from New York City and it is a number of deaths on a daily basis for New York. And, clearly the take home message is that New York suffered greatly at its peak, and through stringent measures, and I credit both the governor of New York Cuomo and the mayor of New York City de Blasio and they didn’t always get along with each other but by together applying control methods with the engagement of the populace, mostly, we see that the number of deaths per day diminished to not quite, but almost zero. This is this is wonderful. I can’t help but stare at it.

Veering away from the somber tone used to review pandemic statistics, Sterling perked up, enthusiastically pointing to the worldwide HPC community’s massive pivot to engage in pandemic-fighting research. The HPC world came together, “not purely by intent or organization, but by natural inclination internationally to address the challenges of this unique and painful crisis around the world. I just want to show you a few examples and my apologies ahead of time, if I don’t do adequate representation of all of the places are working to this,” according to Sterling.

It’s best to watch his presentation for the details. Here are just three of his slides singling out efforts representative of the HPC community.

In looking at the Top500 constituents over time, Sterling presented what may be familiar data to some contending the Top500 is comprised of three distinct communities – Three Worlds of Supercomputing – in which the top ten or so systems have always accounted for the bulk of the performance and where the number of system in each category has remained roughly the same.

“We recognized that, in fact, there isn’t a continuum of capabilities across those two orders of magnitude, which is roughly what it is between number 500 and number one, but rather, it’s highly polarized where as you can see on this chart, the Top500 rank is on the horizontal axis and the machines are ordered from right to left in the lowest performance up to the highest performance. And what you see here is that or at least we assert that there’s kind of a three worlds model. There’s what we call the mainstream, which sits at almost flat horizontal line that covers most of the graph. It’s leadership meet leadership machines, which is that essentially vertical line way at the left. Then there is this small number of additional machines that kind of fit between there.

“But the big take home from this is almost all of the computing that almost all of us get to do – it’s along that horizontal law line and is mainstream and while we put a lot of focus attention on that vertical line, it represents a very small percentage in numbers in counts within the top 500. It does consume a disproportionately large amount of the total aggregate performance. If we take that data and we look at it a little different, we see that the mainstream computing, the number of machines on the mainstream computing is almost flat. And this is just over the last decade. This doesn’t go all the way back.

“You also see that the leadership machines, the number is well within the top 10. And this has been continuously so the top 10 machines are where the leadership computers reside, and they certainly don’t reach out further than that. Then somewhat varying is, is the middle area that goes roughly between a factor of 10 and maybe something like 70 or 80 across the years. I also point out to you that on the right-hand side of this, for all three, the numbers are really flatlining, which is interesting,” said Sterling.

Further analysis, said Sterling, reveals “It takes about eight years for a machine that would be number one on the list to slowly migrate down to number 500. And you can do that exercise yourself. So the lifespan of a machine is about eight years. Many machines don’t survive that long because while they do provide that performance or other normalizing factors, not the least of which is operational cost and the value of floor space, which could hold a lot more performance, so we tend to roll them out faster than that.”

He singled out recent HPC award winners – Geoffrey C. Fox (Kennedy Award), Edwin Catmull and Pat Hanrahan (ACM Turning Award), David Kirk (Seymour Cray Award), and Chris Johnson (Sidney Fernbach Award). Sterling also paid tribute to a number of HPC community members. Among those who have recently passed away he singled out Rich Bruekner (InsideHPC), Anne Redelfs (UCSD), Steve Tuecke (ANL and Globus founder), and Lucy Nowell (DoE, NSF).

Sterling’s slides about the winners and the recent losses, with a few details about each, are presented at the end of this article.

Here’s a link to his full keynote: https://2020.isc-program.com/presentation/?id=int_spe102&sess=sess272

AWARD WINNERS

IN MEMORIAM

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industry updates delivered to you every week!

Intel’s Silicon Brain System a Blueprint for Future AI Computing Architectures

April 24, 2024

Intel is releasing a whole arsenal of AI chips and systems hoping something will stick in the market. Its latest entry is a neuromorphic system called Hala Point. The system includes Intel's research chip called Loihi 2, Read more…

Anders Dam Jensen on HPC Sovereignty, Sustainability, and JU Progress

April 23, 2024

The recent 2024 EuroHPC Summit meeting took place in Antwerp, with attendance substantially up since 2023 to 750 participants. HPCwire asked Intersect360 Research senior analyst Steve Conway, who closely tracks HPC, AI, Read more…

AI Saves the Planet this Earth Day

April 22, 2024

Earth Day was originally conceived as a day of reflection. Our planet’s life-sustaining properties are unlike any other celestial body that we’ve observed, and this day of contemplation is meant to provide all of us Read more…

Intel Announces Hala Point – World’s Largest Neuromorphic System for Sustainable AI

April 22, 2024

As we find ourselves on the brink of a technological revolution, the need for efficient and sustainable computing solutions has never been more critical.  A computer system that can mimic the way humans process and s Read more…

Empowering High-Performance Computing for Artificial Intelligence

April 19, 2024

Artificial intelligence (AI) presents some of the most challenging demands in information technology, especially concerning computing power and data movement. As a result of these challenges, high-performance computing Read more…

Kathy Yelick on Post-Exascale Challenges

April 18, 2024

With the exascale era underway, the HPC community is already turning its attention to zettascale computing, the next of the 1,000-fold performance leaps that have occurred about once a decade. With this in mind, the ISC Read more…

Intel’s Silicon Brain System a Blueprint for Future AI Computing Architectures

April 24, 2024

Intel is releasing a whole arsenal of AI chips and systems hoping something will stick in the market. Its latest entry is a neuromorphic system called Hala Poin Read more…

Anders Dam Jensen on HPC Sovereignty, Sustainability, and JU Progress

April 23, 2024

The recent 2024 EuroHPC Summit meeting took place in Antwerp, with attendance substantially up since 2023 to 750 participants. HPCwire asked Intersect360 Resear Read more…

AI Saves the Planet this Earth Day

April 22, 2024

Earth Day was originally conceived as a day of reflection. Our planet’s life-sustaining properties are unlike any other celestial body that we’ve observed, Read more…

Kathy Yelick on Post-Exascale Challenges

April 18, 2024

With the exascale era underway, the HPC community is already turning its attention to zettascale computing, the next of the 1,000-fold performance leaps that ha Read more…

Software Specialist Horizon Quantum to Build First-of-a-Kind Hardware Testbed

April 18, 2024

Horizon Quantum Computing, a Singapore-based quantum software start-up, announced today it would build its own testbed of quantum computers, starting with use o Read more…

MLCommons Launches New AI Safety Benchmark Initiative

April 16, 2024

MLCommons, organizer of the popular MLPerf benchmarking exercises (training and inference), is starting a new effort to benchmark AI Safety, one of the most pre Read more…

Exciting Updates From Stanford HAI’s Seventh Annual AI Index Report

April 15, 2024

As the AI revolution marches on, it is vital to continually reassess how this technology is reshaping our world. To that end, researchers at Stanford’s Instit Read more…

Intel’s Vision Advantage: Chips Are Available Off-the-Shelf

April 11, 2024

The chip market is facing a crisis: chip development is now concentrated in the hands of the few. A confluence of events this week reminded us how few chips Read more…

Nvidia H100: Are 550,000 GPUs Enough for This Year?

August 17, 2023

The GPU Squeeze continues to place a premium on Nvidia H100 GPUs. In a recent Financial Times article, Nvidia reports that it expects to ship 550,000 of its lat Read more…

Synopsys Eats Ansys: Does HPC Get Indigestion?

February 8, 2024

Recently, it was announced that Synopsys is buying HPC tool developer Ansys. Started in Pittsburgh, Pa., in 1970 as Swanson Analysis Systems, Inc. (SASI) by John Swanson (and eventually renamed), Ansys serves the CAE (Computer Aided Engineering)/multiphysics engineering simulation market. Read more…

Intel’s Server and PC Chip Development Will Blur After 2025

January 15, 2024

Intel's dealing with much more than chip rivals breathing down its neck; it is simultaneously integrating a bevy of new technologies such as chiplets, artificia Read more…

Choosing the Right GPU for LLM Inference and Training

December 11, 2023

Accelerating the training and inference processes of deep learning models is crucial for unleashing their true potential and NVIDIA GPUs have emerged as a game- Read more…

Comparing NVIDIA A100 and NVIDIA L40S: Which GPU is Ideal for AI and Graphics-Intensive Workloads?

October 30, 2023

With long lead times for the NVIDIA H100 and A100 GPUs, many organizations are looking at the new NVIDIA L40S GPU, which it’s a new GPU optimized for AI and g Read more…

Baidu Exits Quantum, Closely Following Alibaba’s Earlier Move

January 5, 2024

Reuters reported this week that Baidu, China’s giant e-commerce and services provider, is exiting the quantum computing development arena. Reuters reported � Read more…

Shutterstock 1179408610

Google Addresses the Mysteries of Its Hypercomputer 

December 28, 2023

When Google launched its Hypercomputer earlier this month (December 2023), the first reaction was, "Say what?" It turns out that the Hypercomputer is Google's t Read more…

AMD MI3000A

How AMD May Get Across the CUDA Moat

October 5, 2023

When discussing GenAI, the term "GPU" almost always enters the conversation and the topic often moves toward performance and access. Interestingly, the word "GPU" is assumed to mean "Nvidia" products. (As an aside, the popular Nvidia hardware used in GenAI are not technically... Read more…

Leading Solution Providers

Contributors

Shutterstock 1606064203

Meta’s Zuckerberg Puts Its AI Future in the Hands of 600,000 GPUs

January 25, 2024

In under two minutes, Meta's CEO, Mark Zuckerberg, laid out the company's AI plans, which included a plan to build an artificial intelligence system with the eq Read more…

China Is All In on a RISC-V Future

January 8, 2024

The state of RISC-V in China was discussed in a recent report released by the Jamestown Foundation, a Washington, D.C.-based think tank. The report, entitled "E Read more…

Shutterstock 1285747942

AMD’s Horsepower-packed MI300X GPU Beats Nvidia’s Upcoming H200

December 7, 2023

AMD and Nvidia are locked in an AI performance battle – much like the gaming GPU performance clash the companies have waged for decades. AMD has claimed it Read more…

Nvidia’s New Blackwell GPU Can Train AI Models with Trillions of Parameters

March 18, 2024

Nvidia's latest and fastest GPU, codenamed Blackwell, is here and will underpin the company's AI plans this year. The chip offers performance improvements from Read more…

Eyes on the Quantum Prize – D-Wave Says its Time is Now

January 30, 2024

Early quantum computing pioneer D-Wave again asserted – that at least for D-Wave – the commercial quantum era has begun. Speaking at its first in-person Ana Read more…

GenAI Having Major Impact on Data Culture, Survey Says

February 21, 2024

While 2023 was the year of GenAI, the adoption rates for GenAI did not match expectations. Most organizations are continuing to invest in GenAI but are yet to Read more…

The GenAI Datacenter Squeeze Is Here

February 1, 2024

The immediate effect of the GenAI GPU Squeeze was to reduce availability, either direct purchase or cloud access, increase cost, and push demand through the roof. A secondary issue has been developing over the last several years. Even though your organization secured several racks... Read more…

Intel’s Xeon General Manager Talks about Server Chips 

January 2, 2024

Intel is talking data-center growth and is done digging graves for its dead enterprise products, including GPUs, storage, and networking products, which fell to Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire