Thomas Sterling’s ISC 2016 Closing Keynote

By John Russell

June 23, 2016

Capturing the sparkle, wit, and selective skewering in Thomas Sterling’s annual closing ISC keynote is challenging. This year was his 13th, which perhaps conveys the engaging manner and substantive content he delivers. Like many in the room, Sterling is an HPC pioneer as well as the director of CREST, the Center for Research in Extreme Scale Technologies, Indiana University. In his ISC talk, Sterling holds up a mirror to the HPC world, shares what he sees, and invites all to look in as well and see what they may.

Sterling
Sterling

It’s perhaps no surprise that China was high on Sterling’s list this year. He also paid homage to Marvin Minsky, provided encouragement and gentle chiding for the National Strategic Computing Initiative (NSCI) and there were shout-outs to Intel, Data Vortex, and Africa CHPCS. He also suggested the supercomputing world may resemble a great iceberg where the public systems we see and worry about are actually just the just the tip of the iceberg while hidden below are huge commercial and government system whose capabilities perhaps dwarf those of the Top500.

You get the picture. His view is expansive. Presented here is a very brief (apologies to Prof. Sterling) random walk through Sterling’s talk and few slides from his ISC deck. Certainly not everything is covered but there’s plenty to chew on. (If you bump into Sterling you might ask of his interesting theory why dinosaurs never made it onto The Ark, which didn’t make it into the article.)

ISC.Sterling.Trends

The Elephant in the Room

There’s no way around it, said Sterling, the Chinese now dominate supercomputing big time (see HPCwire article, China Debuts 93-Petaflops ‘Sunway’ with Homegrown Processors). The Sunway TaihuLight system, achieved 93 petaflops out of a theoretical peak of 125 petaflops, giving it an efficiency of 74.51 percent. The new king of the Top500 solidifies China’s position.

Instead of hand wringing, Sterling said, “There’s a wow factor here and let’s sit back and really enjoy this for a moment. For me even more impressive is the fact that this is a China homegrown system. They designed the architecture, the structure and instruction set, they designed and deployed the electronics, and they did the systems integration. There are some parts contributed from outside but I’ll tell you all of our big machines back in the states also have some memory that wasn’t home grown.” Sunway’s power budget of only 15MW is extraordinary, he said.

There’s been a fair amount of comment around Sunway’s perceived shortcoming. Duly noted, agreed Sterling. “In terms of memory capacity, it’s pretty light weight with 125 PF peak performance it had about 1/100th that amount in terms of memory capacity at 1.3PB. Its bandwidth also, in terms of the rate with which information is delivered from the memory versus the peak rate of computing isn’t impressive. Its HPSG benchmark – which is proving more realistic in exercising more part of the machine – is below one percent.”

ISC.Sterling.Sunway pic

That said the world is changing. By Sterling’s count we are now in our sixth computing epoch. Technology is forcing us into a new class of optimization, which “we are reluctant, in some sense intransigent” in pursuing, he said.

“As we have moved into the era of [multicore/manycore] with an strong emphasis on heterogeneous computing we’ve all known that we’ve moved away from the more conventional and frankly more comfortable model,” he said. Floating point efficiency, a key driver of architecture, “is not the critical objective function it was. Today it is among the easiest things to manufacture and to deploy and so we should be making a lot of them everywhere.”

Instead, said Sterling, it’s the instruction issue rate that counts and that should prompt changes in thinking. “We routinely waste die area by building an enormous memory hierarchies and then add speculative execution in many different forms including pre-fetching, speculative fetching, branch prediction and any number mechanisms and architecture to do what? To keep the ALU busy. [We] need to clean up all that mess. That’s what the Chinese machine is attempting to do. Perhaps we should be valuing the FPU availability not the utilization. This machine does that,” he said.

ISC.Sterling.Sunway

Some critics have said that’s all well and good but the new machine can’t do anything useful. Sterling noted that Sunway runs three Gordon Bell prize finalists – clearly excellent applications – and there are tasks it can handle quite well. He also noted China now has the largest number of deployed systems on the Top500 – 167 surpassing the US’s 165 deployed.

“While the numbers don’t matter this indicates a keen commitment, a commitment not just to having the high stature machine but rather to having a large number of medium scale systems doing a lot of the heavy lifting of high performance computing,” Sterling noted. The supercomputer game is hardly over, but the current round has gone to China.

Marvelous Marvin Minsky

“This year we lost Marvin Minsky,” said Sterling. “His name is synonymous with the term of artificial intelligence in the same way Seymour Cray is synonymous with the word supercomputer. In the early years of the 50s and 60s Marvin laid [computer science] foundations and won essentially every award one can win including the Alan Turing Award in 1969.”

ISC.Sterling.Minsky

Minsky was the co-founder of the MIT Artificial Intelligence Laboratory, who passed away in January. Part of what makes the passing notable, said Sterling is the palpable change in the industry around deep learning, machine learning, cognitive computing none. AI may become the one of the biggest application for supercomputing said Sterling. But rather than dense floating point or even large data processing to do pattern recognition and clustering, it will require, “symbolic computing, the creation of machines that think, more importantly, the creation of machines that understand.”

To Sterling that still seems far away. “Let’s acknowledged the fact the today machines don’t learn anything. We learn. If they learned they wouldn’t just have data converted to information in pretty pictures. They would have data converted to knowledge and be able to manipulate that.”

Maybe that’s good. Minsky believed intelligent machines would take over. He once joked if humans we’re lucky the machines would keep us keep us as pets, said Sterling. In a slightly more serious moment, Minsky offered “yes intelligent machines would take over but they will be our children,” said Sterling.

However the AI adventure plays out, Minsky was seminal force in developing basic ideas around the path to AI and the computing approaches necessary to achieve it.

Quo Vadis NSCI

By now most of the HPC community is familiar with the NSCI program, the ambitious “whole of government” program to promote U.S. leadership in HPC. (see HPCWire article, White House Launches National HPC Strategy) Launched by Presidential Executive Order late last July, there has been some confusion in the community over the slowness of the program to take shape. A detailed implementation plan was due before the end of last year – there is a draft but few have seen it.

“One of the objectives and it is highlighted in red [see slide] is to keep the US at the forefront of HPC capabilities. That’s what it says. Of course there’s an underlying assumption there which I’ll leave to you…” said Sterling. “I struggled to get a phrase that would let me get a positive approach to this and still [do] to be honest. The US was (and is) in launch mode towards exascale, and shortly after ISC 15, NCSI was declared.”

ISC.Sterling.NSCI

The Exacale Computing Project (ECP), which predates NSCI but was subsumed under it, is still progressing largely as planned. “DOE is very sensitive about the term project, [which is] not synonymous with program [and] not synonymous with initiative; project means a very specific rigorous carefully regulated highly professional activity to lead to the end result,” he said, noting that Paul Messina of Argonne National Lab is the project’s leader had run a “very lucid” session on the ECP earlier in ISC.

Shout-Outs to Data Vortex, and CHPCS

A big part of ISC this week surrounded Intel’s official release of Xeon Phi/KNL to general availability and a building chorus OEM announcing plans rapidly introduce product incorporating the chip and Omni-Path Architecture. Sterling had good things to say about both KNL and OPA. He also had optimism for new technology from Data Vortex and praise for South Africa’s supercomputing progress:

  • Data Vortex. “The Data Vortex computer is a very different animal. It is created around a new and very different class of communication. [It’s] focused on lightweight fine grained messages, in fact very fine grained, each payload is only 64 bits with an address space available of 64 bits so that’s quite on the edge. Its high-rate, high bandwidth network [has] both low latency and its contention free,” said Sterling who suggests Data Vortex may be most useful as a domain specific or a domain narrow machine as there are a wide range of applications that are particularly well suited for its architecture.
  • Lengau (Cheetah). Last month the Centre for High Performance Computing (CHPC) at the Council for Scientific and Industrial Research (CSIR) unveiled the fastest computer in Africa, the Lengau system. Sterling applauded the accomplishment noting, “This is no small accomplishments 15X the last CHPC system. Its rmax is a 785 TF,” said Sterling, It was a timely piece of praise as Team South Africa had taken to the stage  short time before to collect its third HPCAC-ISC Student Cluster Competition championship.

How Big and How Fast

Towards the end of his presentation, Sterling said, “I’d like to leave us with a question and I’d like to talk about supercomputing in the shadows. We are the HPC community but do we in fact represent [all of ] HPC? I am a big supporter of HPL (LINPACK benchmark).” But many systems are unranked or poorly structured for the LINPACK bechmark. “There’s a whole world of high performance computing hidden in the cloud, of course I am being metaphorical here, Amazon, Google, Microsoft. Some of you may know how big those systems are. I don’t. But they are enormous and we don’t see them through our methods of evaluation.

“Sometimes they are hidden by intent, [such as] the intelligence community, and I’ll just say this – I have been in the basement,” he said drawing a laugh, and adding there are financial and banking systems that are “enormous, quite possible dwarfing what we view. Finally there are special purpose devices being used such as the Anton machine, for n body problems in molecular dynamics. The SKA (Square Kilometer Area project) is building a very complex configuration of multiple computer systems to process all the data. How do we run a LINPACK on a quantum computer?

“So I really leave you with this question. Do we really know how fast computing is, and if not, how should we as a community broaden our reach, our perspective to quantify and evaluate the trends in technology, the trends in architecture, the trends in applications that span the entire set of what high performance computing is and will be?”

 ISC.Sterling.Decade.Themes

 

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

A Beginner’s Guide to the ASC19 Finals

April 22, 2019

Three thousand watts. That's how much power the competitors in the 2019 ASC Student Supercomputer Challenge here in Dalian, China, have to work with. Everybody would like more juice to run compute-intensive HPC simulatio Read more…

By Alex Woodie

Is Data Science the Fourth Pillar of the Scientific Method?

April 18, 2019

Nvidia CEO Jensen Huang revived a decade-old debate last month when he said that modern data science (AI plus HPC) has become the fourth pillar of the scientific method. While some disagree with the notion that statistic Read more…

By Alex Woodie

At ASF 2019: The Virtuous Circle of Big Data, AI and HPC

April 18, 2019

We've entered a new phase in IT -- in the world, really -- where the combination of big data, artificial intelligence, and high performance computing is pushing the bounds of what's possible in business and science, in w Read more…

By Alex Woodie with Doug Black and Tiffany Trader

HPE Extreme Performance Solutions

HPE and Intel® Omni-Path Architecture: How to Power a Cloud

Learn how HPE and Intel® Omni-Path Architecture provide critical infrastructure for leading Nordic HPC provider’s HPCFLOW cloud service.

powercloud_blog.jpgFor decades, HPE has been at the forefront of high-performance computing, and we’ve powered some of the fastest and most robust supercomputers in the world. Read more…

IBM Accelerated Insights

Bridging HPC and Cloud Native Development with Kubernetes

The HPC community has historically developed its own specialized software stack including schedulers, filesystems, developer tools, container technologies tuned for performance and large-scale on-premises deployments. Read more…

Google Open Sources TensorFlow Version of MorphNet DL Tool

April 18, 2019

Designing optimum deep neural networks remains a non-trivial exercise. “Given the large search space of possible architectures, designing a network from scratch for your specific application can be prohibitively expens Read more…

By John Russell

A Beginner’s Guide to the ASC19 Finals

April 22, 2019

Three thousand watts. That's how much power the competitors in the 2019 ASC Student Supercomputer Challenge here in Dalian, China, have to work with. Everybody Read more…

By Alex Woodie

At ASF 2019: The Virtuous Circle of Big Data, AI and HPC

April 18, 2019

We've entered a new phase in IT -- in the world, really -- where the combination of big data, artificial intelligence, and high performance computing is pushing Read more…

By Alex Woodie with Doug Black and Tiffany Trader

Interview with 2019 Person to Watch Michela Taufer

April 18, 2019

Today, as part of our ongoing HPCwire People to Watch focus series, we are highlighting our interview with 2019 Person to Watch Michela Taufer. Michela -- the Read more…

By HPCwire Editorial Team

Intel Gold U-Series SKUs Reveal Single Socket Intentions

April 18, 2019

Intel plans to jump into the single socket market with a portion of its just announced Cascade Lake microprocessor line according to one media report. This isn Read more…

By John Russell

BSC Researchers Shrink Floating Point Formats to Accelerate Deep Neural Network Training

April 15, 2019

Sometimes calculating solutions as precisely as a computer can wastes more CPU resources than is necessary. A case in point is with deep learning. In early stag Read more…

By Ken Strandberg

Intel Extends FPGA Ecosystem with 10nm Agilex

April 11, 2019

The insatiable appetite for higher throughput and lower latency – particularly where edge analytics and AI, network functions, or for a range of datacenter ac Read more…

By Doug Black

Nvidia Doubles Down on Medical AI

April 9, 2019

Nvidia is collaborating with medical groups to push GPU-powered AI tools into clinical settings, including radiology and drug discovery. The GPU leader said Monday it will collaborate with the American College of Radiology (ACR) to provide clinicians with its Clara AI tool kit. The partnership would allow radiologists to leverage AI techniques for diagnostic imaging using their own clinical data. Read more…

By George Leopold

Digging into MLPerf Benchmark Suite to Inform AI Infrastructure Decisions

April 9, 2019

With machine learning and deep learning storming into the datacenter, the new challenge is optimizing infrastructure choices to support diverse ML and DL workfl Read more…

By John Russell

The Case Against ‘The Case Against Quantum Computing’

January 9, 2019

It’s not easy to be a physicist. Richard Feynman (basically the Jimi Hendrix of physicists) once said: “The first principle is that you must not fool yourse Read more…

By Ben Criger

Why Nvidia Bought Mellanox: ‘Future Datacenters Will Be…Like High Performance Computers’

March 14, 2019

“Future datacenters of all kinds will be built like high performance computers,” said Nvidia CEO Jensen Huang during a phone briefing on Monday after Nvidia revealed scooping up the high performance networking company Mellanox for $6.9 billion. Read more…

By Tiffany Trader

ClusterVision in Bankruptcy, Fate Uncertain

February 13, 2019

ClusterVision, European HPC specialists that have built and installed over 20 Top500-ranked systems in their nearly 17-year history, appear to be in the midst o Read more…

By Tiffany Trader

Intel Reportedly in $6B Bid for Mellanox

January 30, 2019

The latest rumors and reports around an acquisition of Mellanox focus on Intel, which has reportedly offered a $6 billion bid for the high performance interconn Read more…

By Doug Black

It’s Official: Aurora on Track to Be First US Exascale Computer in 2021

March 18, 2019

The U.S. Department of Energy along with Intel and Cray confirmed today that an Intel/Cray supercomputer, "Aurora," capable of sustained performance of one exaf Read more…

By Tiffany Trader

Looking for Light Reading? NSF-backed ‘Comic Books’ Tackle Quantum Computing

January 28, 2019

Still baffled by quantum computing? How about turning to comic books (graphic novels for the well-read among you) for some clarity and a little humor on QC. The Read more…

By John Russell

IBM Quantum Update: Q System One Launch, New Collaborators, and QC Center Plans

January 10, 2019

IBM made three significant quantum computing announcements at CES this week. One was introduction of IBM Q System One; it’s really the integration of IBM’s Read more…

By John Russell

Deep500: ETH Researchers Introduce New Deep Learning Benchmark for HPC

February 5, 2019

ETH researchers have developed a new deep learning benchmarking environment – Deep500 – they say is “the first distributed and reproducible benchmarking s Read more…

By John Russell

Leading Solution Providers

SC 18 Virtual Booth Video Tour

Advania @ SC18 AMD @ SC18
ASRock Rack @ SC18
DDN Storage @ SC18
HPE @ SC18
IBM @ SC18
Lenovo @ SC18 Mellanox Technologies @ SC18
NVIDIA @ SC18
One Stop Systems @ SC18
Oracle @ SC18 Panasas @ SC18
Supermicro @ SC18 SUSE @ SC18 TYAN @ SC18
Verne Global @ SC18

IBM Bets $2B Seeking 1000X AI Hardware Performance Boost

February 7, 2019

For now, AI systems are mostly machine learning-based and “narrow” – powerful as they are by today's standards, they're limited to performing a few, narro Read more…

By Doug Black

The Deep500 – Researchers Tackle an HPC Benchmark for Deep Learning

January 7, 2019

How do you know if an HPC system, particularly a larger-scale system, is well-suited for deep learning workloads? Today, that’s not an easy question to answer Read more…

By John Russell

Arm Unveils Neoverse N1 Platform with up to 128-Cores

February 20, 2019

Following on its Neoverse roadmap announcement last October, Arm today revealed its next-gen Neoverse microarchitecture with compute and throughput-optimized si Read more…

By Tiffany Trader

Intel Launches Cascade Lake Xeons with Up to 56 Cores

April 2, 2019

At Intel's Data-Centric Innovation Day in San Francisco (April 2), the company unveiled its second-generation Xeon Scalable (Cascade Lake) family and debuted it Read more…

By Tiffany Trader

France to Deploy AI-Focused Supercomputer: Jean Zay

January 22, 2019

HPE announced today that it won the contract to build a supercomputer that will drive France’s AI and HPC efforts. The computer will be part of GENCI, the Fre Read more…

By Tiffany Trader

Oil and Gas Supercloud Clears Out Remaining Knights Landing Inventory: All 38,000 Wafers

March 13, 2019

The McCloud HPC service being built by Australia’s DownUnder GeoSolutions (DUG) outside Houston is set to become the largest oil and gas cloud in the world th Read more…

By Tiffany Trader

Intel Extends FPGA Ecosystem with 10nm Agilex

April 11, 2019

The insatiable appetite for higher throughput and lower latency – particularly where edge analytics and AI, network functions, or for a range of datacenter ac Read more…

By Doug Black

UC Berkeley Paper Heralds Rise of Serverless Computing in the Cloud – Do You Agree?

February 13, 2019

Almost exactly ten years to the day from publishing of their widely-read, seminal paper on cloud computing, UC Berkeley researchers have issued another ambitious examination of cloud computing - Cloud Programming Simplified: A Berkeley View on Serverless Computing. The new work heralds the rise of ‘serverless computing’ as the next dominant phase of cloud computing. Read more…

By John Russell

  • arrow
  • Click Here for More Headlines
  • arrow
Do NOT follow this link or you will be banned from the site!
Share This