HPC Serves as a ‘Rosetta Stone’ for the Information Age

By Warren Froelich

July 12, 2018

Today high-performance computing is at the forefront of a new gold rush, a rush to discovery using an ever-growing flood of information and data. Computing is now essential to science discovery like never before. We are the modern pioneers pushing the bounds of science for the betterment of society. — SC17 General Chair Bernd Mohr, Jülich Supercomputing Centre 

In an age defined and transformed by its data, several large-scale scientific instruments around the globe might be viewed as a mother lode of precious data.

With names seemingly created for a techno-speak glossary, these interferometers, cyclotrons, sequencers, solenoids, satellite altimeters, and cryo-electron microscopes are churning out data in previously unthinkable and seemingly incomprehensible quantities — billions, trillions and quadrillions of bits and bytes of electro-magnetic code.

Like the famed Rosetta Stone that enabled Ancient Egyptian inscriptions to be decoded, high-performance computing transforms 21st digital data into valuable insight. Image credit: Olaf Herrmann

Yet, policy-makers from the National Science Foundation (NSF) and others plotting future directions in science believe that hidden within these veritable mountain-sized mines of information are clues to questions that have confounded humanity since their first thoughts: answers about those bits of glitter in the night sky, the nature of matter, the causes of disease, the origins of life and even why and how we think about such things.

For this reason, the ability to convert this seemingly unintelligible digital data into rapid, meaningful discoveries has taken on added significance. Indeed, one of the NSF’s 10 Big Ideas for the future includes “Harnessing Data for the 21st Century Science and Engineering.”

Enter advanced or high-performance computing (HPC) which sifts and separates waste from valuable digital nuggets and, somewhat like a Rosetta Stone of the information age, decodes and translates this data into valuable insight.

“Advanced computing, along with experts charged with building and making the most of these HPC systems, has been critical to many Nobel Prizes, including work involving traditional modeling and simulation, to projects designed for more data-intensive workloads,” said Michael Norman, director of the San Diego Supercomputer Center (SDSC) at UC San Diego.

As evidence, Norman and others point to several recent Nobel Prizes in chemistry and physics — including international collaborations exploring the dark side of the universe and others delving into the dynamics of proteins critical for tomorrow’s targeted therapies.

Each has relied on the marriage of supercomputing technology and expertise with large-scale scientific instruments to achieve their goals, all connected by faster and faster high-speed communications networks. And each touches on other Big Ideas from the NSF, such as “The Era of Multi-Messenger Astrophysics” that include a collection of approaches to expand our observations and understandings of the universe; a “Quantum Leap” into the understanding the behavior of matter and energy at very small – atomic and subatomic – scales; and “Understanding the Rules of Life”, an initiative that will require convergence of research across biology, computer science, mathematics, behavioral sciences, and engineering.

SDSC’s Petascale Comet Supercomputer. Credit: Ben Tolo, SDSC

Some of this effort is based on the solution of fundamental mathematical equations to create models or simulations using HPC systems now capable of generating quadrillions of calculations per second, such as Comet, funded by the NSF and housed at SDSC. Other HPC research requires the access, analysis, and interpretation of previously unfathomable amounts of data via a modality called high-throughput computing (HTC) being generated from a wide cross-section of sensors and detectors. Simulation and data analysis along with experimentation sometimes complement and even blend with one another for discovery.

“HTC is a way of consuming computer resources, including those we label as HPC,” said Frank Würthwein, professor of physics at UC San Diego and Distributed High-Throughput Computing Lead at SDSC. “The way these large-scale instruments do analysis requires the HTC ‘modality’ of computing. This is distinct from the standard ‘submit a job to the queue’ which is what people traditionally do for simulations.”

An Integrated Data Ecosystem

Those on the technological front line recognize that the challenges to keep up with the data explosion are enormous. Among other things, much of the science requires the integration of computational resources in an ecosystem that includes sophisticated workflow tools to orchestrate complex pathways for scheduling, data transfer, and processing. Massive sets of data collected through these efforts also require tools and techniques for filtering and processing, plus analytical techniques to extract key information. Moreover, the system needs to be effectively automated across different types of resources, including instruments and data archives.

Some suggest that all these components should be orchestrated into what’s being called a “super facility.” The goal, according to the U.S. Department of Energy, is to bring together users at multiple institutions “allowing geographically dispersed collaborators to tap into scientific resources and expertise, and analyze and share data with other users—all in real time and without having to leave the comfort of their office or lab.”

Said Würthwein: “These large-scale scientific instruments depend on large international cyberinfrastructures that a ‘super facility’ must integrate into seamlessly. The HPC system cannot be an island unto itself.”

The NSF concurs. “The grand challenges of today – protecting human health, understanding the food, energy, water nexus; exploring the universe on all scales – will not be solved by one discipline alone,” the agency stated in a 2017 report prepared for Congress. “They require convergence: the merging of ideas, approaches, and technologies from widely diverse fields of knowledge to stimulate innovation and discovery.”

Armed with ever-more powerful large-scale scientific instruments, research teams around the globe – some encompassing a wide variety disciplines – already are converging to build an impressive portfolio of scientific advances and discoveries, with supercomputers serving as critical linchpin for all these investigations.

Cosmic Discoveries

On July 4, 2012, at the CERN laboratory for particle physics outside Geneva, Switzerland, a theory first proposed in 1964 by François Englert and Peter W. Higgs was confirmed with the discovery of a Higgs particle. The theory, which garnered the duo the 2013 Nobel Prize in physics, is a central part of the Standard Model of particle physics that describes how the world is constructed at its most fundamental level, from the intense waves of energy and primordial particles released from the “Big Bang,” to the planet we inhabit, to those glittering specks of light we observe in the night sky.

The Compact Muon Solenoid (CMS) is a general-purpose detector at the Large Hadron Collider (LHC), which is the world’s largest and most powerful particle accelerator. Courtesy CERN.

Under a partnership with UC San Diego physicists and the Open Science Grid (OSG), a multi-disciplinary research partnership funded by the U.S. Department of Energy and the NSF, SDSC’s Gordon supercomputer provided auxiliary computing capacity to process massive raw data generated by the Compact Muon Solenoid (CMS) — one of two general purpose particle detectors at the Large Hadron Collider (LHC). LHC experiments are among the largest ever seen in physics, with each experiment involving collaborations of close to 200 institutions in more than 40 countries, involving in excess of 3,000 scientists and engineers.

“Access to Gordon, and its excellent computing speed due to its flash-based memory, really helped push forward the processing schedule for us,” said Würthwein, a member of the CMS project and executive director of OSG “This was one of the first ever integrations of HTC with a large HPC system and with only a few weeks’ notice, we were able to gain access to Gordon and complete the runs, making the data available for analysis in time to provide crucial input toward the international planning meetings on the future of particle physics.”

In February 2016, an international team representing more than 20 countries announced the first-ever detection of gravitational waves in the universe, based on the tell-tale “chirp” signature of two black holes merging about 1.3 billion years ago. The collision sent what some referred to as a “ripple in the fabric of space time”: gravitational waves, hypothesized by Albert Einstein a century ago. The signal was detected on earth, first by the NSF-funded Laser Interferometer Gravitational Wave Observatory (LIGO) near Livingston, Louisiana; and then seven milliseconds later, and 1,890 miles away, at the second LIGO interferometer in Hanford, Washington. Three members of the team won the 2017 Nobel Prize in Physics for the discovery.

LIGO operates two detector sites — one near Hanford in eastern Washington, and another near Livingston, Louisiana. The Livingston detector site is pictured here. Courtesy LIGO Collaboration.

SDSC’s Comet was one of several supercomputers used by researchers to confirm the landmark discovery.

“LIGO’s discovery of gravitational waves from the binary black hole required large-scale data analysis to validate the discovery claim,” said Duncan Brown, The Charles Brightman Professor of Physics at Syracuse University’s Department of Physics who studies gravitational waveforms for black holes and neutron star binaries. “This includes measuring how significant the signal is compared to noise in the detector, and re-analyzing the data with simulated signals to ensure that we understand the astrophysical sensitivity of the search. Comet’s computer cycles were extremely important for us to complete large-scale simulations and fast validation of the search.”

Less than a year after the first discovery of gravitational waves, in October 2017 researchers announced they had detected gravitational waves generated by the collision of two neutron stars more than 130 light years from earth, via the two LIGO instruments and the Europe-based Virgo interferometer, followed shortly by multiple telescopes and satellites built to capture light from the universe. This combination of observational instruments bears testimony to what’s become known as multi-messenger astronomy (MMA), where multiple instruments — built to detect different forms of electromagnetic radiation – are choreographed with one another, essentially in real time, to view the same patch of sky. Once again, Comet was one of several HPC systems to verify the signal, with allocations from NSF’s Extreme Science and Engineering Discovery Environment (XSEDE) and the OSG.

“The correlation of the three interferometers, 2 from LIGO and one from Virgo significantly shrunk the area in the sky for where to look,” said Würthwein.

Added Syracuse University’s Brown: “Comet’s contribution through the OSG and XSEDE allowed us to rapidly turn around the offline analysis in about a day. That, in turn allowed us to do several one-day runs, as opposed to having to spend several weeks before publishing our findings.”

This image shows a high-energy neutrino event superimposed on a view of the IceCube Lab (ICL) at the South Pole. Courtesy IceCube Collaboration.

Since being postulated in December 1930 by Wolfgang Pauli, cosmologists have been hunting for neutrinos: subatomic particles that lack an electric charge, particles once described as “the most tiny quantity of reality ever imagined by a human being.” For the most part, cosmic neutrinos are believed to have been created about 15 billion years ago, soon after the birth of the universe. Others emerged more recently from some of the most violent actions in the universe, such as exploding stars, gamma ray bursts, black holes and neutron stars. But unlike photons and other charged particles, neutrinos can emerge from their sources and, like cosmological ghosts, pass through the universe unscathed.

To help catch these near-massless messengers from deep space, an international team of researchers funded by the NSF set up IceCube, an observatory containing an array of 5,160 optical sensors deep within a cubic kilometer of ice at the South Pole. Encompassing 300 physicists from 49 institutions in 12 countries, IceCube already has achieved its primary goal of detecting the extraterrestrial flux of very high-energy neutrinos.

Frank Halzen, principal investigator of the IceCube Observatory and physics professor at the University of Wisconsin-Madison, explained the importance of the Comet supercomputer for isolating the signature pattern of neutrinos:  “The IceCube neutrino detector transforms natural Antarctic ice at the South Pole into a particle detector. Progress in understanding the precise optical properties of the ice leads to increasing complexity in simulating the propagation of photons in the instrument and to a better overall performance of the detector.”

“The photon propagation in the ice is very well-suited to run in graphics processing units (GPUs) hardware, such as those on Comet.” Halzen continued. “Pursuing efficient access to a large amount of GPU computing power is therefore of great importance to ensure that future IceCube analysis reaches the maximum precision and that the full scientific potential of the instrument is exploited.”

Stay tuned for Part II

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

Why HPC Storage Matters More Now Than Ever: Analyst Q&A

September 17, 2021

With soaring data volumes and insatiable computing driving nearly every facet of economic, social and scientific progress, data storage is seizing the spotlight. Hyperion Research analyst and noted storage expert Mark No Read more…

GigaIO Gets $14.7M in Series B Funding to Expand Its Composable Fabric Technology to Customers

September 16, 2021

Just before the COVID-19 pandemic began in March 2020, GigaIO introduced its Universal Composable Fabric technology, which allows enterprises to bring together any HPC and AI resources and integrate them with networking, Read more…

What’s New in HPC Research: Solar Power, ExaWorks, Optane & More

September 16, 2021

In this regular feature, HPCwire highlights newly published research in the high-performance computing community and related domains. From parallel programming to exascale to quantum computing, the details are here. Read more…

Cerebras Brings Its Wafer-Scale Engine AI System to the Cloud

September 16, 2021

Five months ago, when Cerebras Systems debuted its second-generation wafer-scale silicon system (CS-2), co-founder and CEO Andrew Feldman hinted of the company’s coming cloud plans, and now those plans have come to fruition. Today, Cerebras and Cirrascale Cloud Services are launching... Read more…

AI Hardware Summit: Panel on Memory Looks Forward

September 15, 2021

What will system memory look like in five years? Good question. While Monday's panel, Designing AI Super-Chips at the Speed of Memory, at the AI Hardware Summit, tackled several topics, the panelists also took a brief glimpse into the future. Unlike compute, storage and networking, which... Read more…

AWS Solution Channel

Supporting Climate Model Simulations to Accelerate Climate Science

The Amazon Sustainability Data Initiative (ASDI), AWS is donating cloud resources, technical support, and access to scalable infrastructure and fast networking providing high performance computing (HPC) solutions to support simulations of near-term climate using the National Center for Atmospheric Research (NCAR) Community Earth System Model Version 2 (CESM2) and its Whole Atmosphere Community Climate Model (WACCM). Read more…

ECMWF Opens Bologna Datacenter in Preparation for Atos Supercomputer

September 14, 2021

In January 2020, the European Centre for Medium-Range Weather Forecasts (ECMWF) – a juggernaut in the weather forecasting scene – signed a four-year, $89-million contract with European tech firm Atos to quintuple its supercomputing capacity. With the deal approaching the two-year mark, ECMWF... Read more…

Why HPC Storage Matters More Now Than Ever: Analyst Q&A

September 17, 2021

With soaring data volumes and insatiable computing driving nearly every facet of economic, social and scientific progress, data storage is seizing the spotlight Read more…

Cerebras Brings Its Wafer-Scale Engine AI System to the Cloud

September 16, 2021

Five months ago, when Cerebras Systems debuted its second-generation wafer-scale silicon system (CS-2), co-founder and CEO Andrew Feldman hinted of the company’s coming cloud plans, and now those plans have come to fruition. Today, Cerebras and Cirrascale Cloud Services are launching... Read more…

AI Hardware Summit: Panel on Memory Looks Forward

September 15, 2021

What will system memory look like in five years? Good question. While Monday's panel, Designing AI Super-Chips at the Speed of Memory, at the AI Hardware Summit, tackled several topics, the panelists also took a brief glimpse into the future. Unlike compute, storage and networking, which... Read more…

ECMWF Opens Bologna Datacenter in Preparation for Atos Supercomputer

September 14, 2021

In January 2020, the European Centre for Medium-Range Weather Forecasts (ECMWF) – a juggernaut in the weather forecasting scene – signed a four-year, $89-million contract with European tech firm Atos to quintuple its supercomputing capacity. With the deal approaching the two-year mark, ECMWF... Read more…

Quantum Computer Market Headed to $830M in 2024

September 13, 2021

What is one to make of the quantum computing market? Energized (lots of funding) but still chaotic and advancing in unpredictable ways (e.g. competing qubit tec Read more…

Amazon, NCAR, SilverLining Team for Unprecedented Cloud Climate Simulations

September 10, 2021

Earth’s climate is, to put it mildly, not in a good place. In the wake of a damning report from the Intergovernmental Panel on Climate Change (IPCC), scientis Read more…

After Roadblocks and Renewals, EuroHPC Targets a Bigger, Quantum Future

September 9, 2021

The EuroHPC Joint Undertaking (JU) was formalized in 2018, beginning a new era of European supercomputing that began to bear fruit this year with the launch of several of the first EuroHPC systems. The undertaking, however, has not been without its speed bumps, and the Union faces an uphill... Read more…

How Argonne Is Preparing for Exascale in 2022

September 8, 2021

Additional details came to light on Argonne National Laboratory’s preparation for the 2022 Aurora exascale-class supercomputer, during the HPC User Forum, held virtually this week on account of pandemic. Exascale Computing Project director Doug Kothe reviewed some of the 'early exascale hardware' at Argonne, Oak Ridge and NERSC (Perlmutter), while Ti Leggett, Deputy Project Director & Deputy Director... Read more…

Ahead of ‘Dojo,’ Tesla Reveals Its Massive Precursor Supercomputer

June 22, 2021

In spring 2019, Tesla made cryptic reference to a project called Dojo, a “super-powerful training computer” for video data processing. Then, in summer 2020, Tesla CEO Elon Musk tweeted: “Tesla is developing a [neural network] training computer called Dojo to process truly vast amounts of video data. It’s a beast! … A truly useful exaflop at de facto FP32.” Read more…

Berkeley Lab Debuts Perlmutter, World’s Fastest AI Supercomputer

May 27, 2021

A ribbon-cutting ceremony held virtually at Berkeley Lab's National Energy Research Scientific Computing Center (NERSC) today marked the official launch of Perlmutter – aka NERSC-9 – the GPU-accelerated supercomputer built by HPE in partnership with Nvidia and AMD. Read more…

Esperanto, Silicon in Hand, Champions the Efficiency of Its 1,092-Core RISC-V Chip

August 27, 2021

Esperanto Technologies made waves last December when it announced ET-SoC-1, a new RISC-V-based chip aimed at machine learning that packed nearly 1,100 cores onto a package small enough to fit six times over on a single PCIe card. Now, Esperanto is back, silicon in-hand and taking aim... Read more…

Enter Dojo: Tesla Reveals Design for Modular Supercomputer & D1 Chip

August 20, 2021

Two months ago, Tesla revealed a massive GPU cluster that it said was “roughly the number five supercomputer in the world,” and which was just a precursor to Tesla’s real supercomputing moonshot: the long-rumored, little-detailed Dojo system. “We’ve been scaling our neural network training compute dramatically over the last few years,” said Milan Kovac, Tesla’s director of autopilot engineering. Read more…

CentOS Replacement Rocky Linux Is Now in GA and Under Independent Control

June 21, 2021

The Rocky Enterprise Software Foundation (RESF) is announcing the general availability of Rocky Linux, release 8.4, designed as a drop-in replacement for the soon-to-be discontinued CentOS. The GA release is launching six-and-a-half months after Red Hat deprecated its support for the widely popular, free CentOS server operating system. The Rocky Linux development effort... Read more…

Google Launches TPU v4 AI Chips

May 20, 2021

Google CEO Sundar Pichai spoke for only one minute and 42 seconds about the company’s latest TPU v4 Tensor Processing Units during his keynote at the Google I Read more…

Intel Completes LLVM Adoption; Will End Updates to Classic C/C++ Compilers in Future

August 10, 2021

Intel reported in a blog this week that its adoption of the open source LLVM architecture for Intel’s C/C++ compiler is complete. The transition is part of In Read more…

AMD-Xilinx Deal Gains UK, EU Approvals — China’s Decision Still Pending

July 1, 2021

AMD’s planned acquisition of FPGA maker Xilinx is now in the hands of Chinese regulators after needed antitrust approvals for the $35 billion deal were receiv Read more…

Leading Solution Providers

Contributors

Hot Chips: Here Come the DPUs and IPUs from Arm, Nvidia and Intel

August 25, 2021

The emergence of data processing units (DPU) and infrastructure processing units (IPU) as potentially important pieces in cloud and datacenter architectures was Read more…

Julia Update: Adoption Keeps Climbing; Is It a Python Challenger?

January 13, 2021

The rapid adoption of Julia, the open source, high level programing language with roots at MIT, shows no sign of slowing according to data from Julialang.org. I Read more…

10nm, 7nm, 5nm…. Should the Chip Nanometer Metric Be Replaced?

June 1, 2020

The biggest cool factor in server chips is the nanometer. AMD beating Intel to a CPU built on a 7nm process node* – with 5nm and 3nm on the way – has been i Read more…

HPE Wins $2B GreenLake HPC-as-a-Service Deal with NSA

September 1, 2021

In the heated, oft-contentious, government IT space, HPE has won a massive $2 billion contract to provide HPC and AI services to the United States’ National Security Agency (NSA). Following on the heels of the now-canceled $10 billion JEDI contract (reissued as JWCC) and a $10 billion... Read more…

Quantum Roundup: IBM, Rigetti, Phasecraft, Oxford QC, China, and More

July 13, 2021

IBM yesterday announced a proof for a quantum ML algorithm. A week ago, it unveiled a new topology for its quantum processors. Last Friday, the Technical Univer Read more…

Intel Launches 10nm ‘Ice Lake’ Datacenter CPU with Up to 40 Cores

April 6, 2021

The wait is over. Today Intel officially launched its 10nm datacenter CPU, the third-generation Intel Xeon Scalable processor, codenamed Ice Lake. With up to 40 Read more…

Frontier to Meet 20MW Exascale Power Target Set by DARPA in 2008

July 14, 2021

After more than a decade of planning, the United States’ first exascale computer, Frontier, is set to arrive at Oak Ridge National Laboratory (ORNL) later this year. Crossing this “1,000x” horizon required overcoming four major challenges: power demand, reliability, extreme parallelism and data movement. Read more…

Intel Unveils New Node Names; Sapphire Rapids Is Now an ‘Intel 7’ CPU

July 27, 2021

What's a preeminent chip company to do when its process node technology lags the competition by (roughly) one generation, but outmoded naming conventions make it seem like it's two nodes behind? For Intel, the response was to change how it refers to its nodes with the aim of better reflecting its positioning within the leadership semiconductor manufacturing space. Intel revealed its new node nomenclature, and... Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire