HPC Serves as a ‘Rosetta Stone’ for the Information Age

By Warren Froelich

July 12, 2018

Today high-performance computing is at the forefront of a new gold rush, a rush to discovery using an ever-growing flood of information and data. Computing is now essential to science discovery like never before. We are the modern pioneers pushing the bounds of science for the betterment of society. — SC17 General Chair Bernd Mohr, Jülich Supercomputing Centre 

In an age defined and transformed by its data, several large-scale scientific instruments around the globe might be viewed as a mother lode of precious data.

With names seemingly created for a techno-speak glossary, these interferometers, cyclotrons, sequencers, solenoids, satellite altimeters, and cryo-electron microscopes are churning out data in previously unthinkable and seemingly incomprehensible quantities — billions, trillions and quadrillions of bits and bytes of electro-magnetic code.

Like the famed Rosetta Stone that enabled Ancient Egyptian inscriptions to be decoded, high-performance computing transforms 21st digital data into valuable insight. Image credit: Olaf Herrmann

Yet, policy-makers from the National Science Foundation (NSF) and others plotting future directions in science believe that hidden within these veritable mountain-sized mines of information are clues to questions that have confounded humanity since their first thoughts: answers about those bits of glitter in the night sky, the nature of matter, the causes of disease, the origins of life and even why and how we think about such things.

For this reason, the ability to convert this seemingly unintelligible digital data into rapid, meaningful discoveries has taken on added significance. Indeed, one of the NSF’s 10 Big Ideas for the future includes “Harnessing Data for the 21st Century Science and Engineering.”

Enter advanced or high-performance computing (HPC) which sifts and separates waste from valuable digital nuggets and, somewhat like a Rosetta Stone of the information age, decodes and translates this data into valuable insight.

“Advanced computing, along with experts charged with building and making the most of these HPC systems, has been critical to many Nobel Prizes, including work involving traditional modeling and simulation, to projects designed for more data-intensive workloads,” said Michael Norman, director of the San Diego Supercomputer Center (SDSC) at UC San Diego.

As evidence, Norman and others point to several recent Nobel Prizes in chemistry and physics — including international collaborations exploring the dark side of the universe and others delving into the dynamics of proteins critical for tomorrow’s targeted therapies.

Each has relied on the marriage of supercomputing technology and expertise with large-scale scientific instruments to achieve their goals, all connected by faster and faster high-speed communications networks. And each touches on other Big Ideas from the NSF, such as “The Era of Multi-Messenger Astrophysics” that include a collection of approaches to expand our observations and understandings of the universe; a “Quantum Leap” into the understanding the behavior of matter and energy at very small – atomic and subatomic – scales; and “Understanding the Rules of Life”, an initiative that will require convergence of research across biology, computer science, mathematics, behavioral sciences, and engineering.

SDSC’s Petascale Comet Supercomputer. Credit: Ben Tolo, SDSC

Some of this effort is based on the solution of fundamental mathematical equations to create models or simulations using HPC systems now capable of generating quadrillions of calculations per second, such as Comet, funded by the NSF and housed at SDSC. Other HPC research requires the access, analysis, and interpretation of previously unfathomable amounts of data via a modality called high-throughput computing (HTC) being generated from a wide cross-section of sensors and detectors. Simulation and data analysis along with experimentation sometimes complement and even blend with one another for discovery.

“HTC is a way of consuming computer resources, including those we label as HPC,” said Frank Würthwein, professor of physics at UC San Diego and Distributed High-Throughput Computing Lead at SDSC. “The way these large-scale instruments do analysis requires the HTC ‘modality’ of computing. This is distinct from the standard ‘submit a job to the queue’ which is what people traditionally do for simulations.”

An Integrated Data Ecosystem

Those on the technological front line recognize that the challenges to keep up with the data explosion are enormous. Among other things, much of the science requires the integration of computational resources in an ecosystem that includes sophisticated workflow tools to orchestrate complex pathways for scheduling, data transfer, and processing. Massive sets of data collected through these efforts also require tools and techniques for filtering and processing, plus analytical techniques to extract key information. Moreover, the system needs to be effectively automated across different types of resources, including instruments and data archives.

Some suggest that all these components should be orchestrated into what’s being called a “super facility.” The goal, according to the U.S. Department of Energy, is to bring together users at multiple institutions “allowing geographically dispersed collaborators to tap into scientific resources and expertise, and analyze and share data with other users—all in real time and without having to leave the comfort of their office or lab.”

Said Würthwein: “These large-scale scientific instruments depend on large international cyberinfrastructures that a ‘super facility’ must integrate into seamlessly. The HPC system cannot be an island unto itself.”

The NSF concurs. “The grand challenges of today – protecting human health, understanding the food, energy, water nexus; exploring the universe on all scales – will not be solved by one discipline alone,” the agency stated in a 2017 report prepared for Congress. “They require convergence: the merging of ideas, approaches, and technologies from widely diverse fields of knowledge to stimulate innovation and discovery.”

Armed with ever-more powerful large-scale scientific instruments, research teams around the globe – some encompassing a wide variety disciplines – already are converging to build an impressive portfolio of scientific advances and discoveries, with supercomputers serving as critical linchpin for all these investigations.

Cosmic Discoveries

On July 4, 2012, at the CERN laboratory for particle physics outside Geneva, Switzerland, a theory first proposed in 1964 by François Englert and Peter W. Higgs was confirmed with the discovery of a Higgs particle. The theory, which garnered the duo the 2013 Nobel Prize in physics, is a central part of the Standard Model of particle physics that describes how the world is constructed at its most fundamental level, from the intense waves of energy and primordial particles released from the “Big Bang,” to the planet we inhabit, to those glittering specks of light we observe in the night sky.

The Compact Muon Solenoid (CMS) is a general-purpose detector at the Large Hadron Collider (LHC), which is the world’s largest and most powerful particle accelerator. Courtesy CERN.

Under a partnership with UC San Diego physicists and the Open Science Grid (OSG), a multi-disciplinary research partnership funded by the U.S. Department of Energy and the NSF, SDSC’s Gordon supercomputer provided auxiliary computing capacity to process massive raw data generated by the Compact Muon Solenoid (CMS) — one of two general purpose particle detectors at the Large Hadron Collider (LHC). LHC experiments are among the largest ever seen in physics, with each experiment involving collaborations of close to 200 institutions in more than 40 countries, involving in excess of 3,000 scientists and engineers.

“Access to Gordon, and its excellent computing speed due to its flash-based memory, really helped push forward the processing schedule for us,” said Würthwein, a member of the CMS project and executive director of OSG “This was one of the first ever integrations of HTC with a large HPC system and with only a few weeks’ notice, we were able to gain access to Gordon and complete the runs, making the data available for analysis in time to provide crucial input toward the international planning meetings on the future of particle physics.”

In February 2016, an international team representing more than 20 countries announced the first-ever detection of gravitational waves in the universe, based on the tell-tale “chirp” signature of two black holes merging about 1.3 billion years ago. The collision sent what some referred to as a “ripple in the fabric of space time”: gravitational waves, hypothesized by Albert Einstein a century ago. The signal was detected on earth, first by the NSF-funded Laser Interferometer Gravitational Wave Observatory (LIGO) near Livingston, Louisiana; and then seven milliseconds later, and 1,890 miles away, at the second LIGO interferometer in Hanford, Washington. Three members of the team won the 2017 Nobel Prize in Physics for the discovery.

LIGO operates two detector sites — one near Hanford in eastern Washington, and another near Livingston, Louisiana. The Livingston detector site is pictured here. Courtesy LIGO Collaboration.

SDSC’s Comet was one of several supercomputers used by researchers to confirm the landmark discovery.

“LIGO’s discovery of gravitational waves from the binary black hole required large-scale data analysis to validate the discovery claim,” said Duncan Brown, The Charles Brightman Professor of Physics at Syracuse University’s Department of Physics who studies gravitational waveforms for black holes and neutron star binaries. “This includes measuring how significant the signal is compared to noise in the detector, and re-analyzing the data with simulated signals to ensure that we understand the astrophysical sensitivity of the search. Comet’s computer cycles were extremely important for us to complete large-scale simulations and fast validation of the search.”

Less than a year after the first discovery of gravitational waves, in October 2017 researchers announced they had detected gravitational waves generated by the collision of two neutron stars more than 130 light years from earth, via the two LIGO instruments and the Europe-based Virgo interferometer, followed shortly by multiple telescopes and satellites built to capture light from the universe. This combination of observational instruments bears testimony to what’s become known as multi-messenger astronomy (MMA), where multiple instruments — built to detect different forms of electromagnetic radiation – are choreographed with one another, essentially in real time, to view the same patch of sky. Once again, Comet was one of several HPC systems to verify the signal, with allocations from NSF’s Extreme Science and Engineering Discovery Environment (XSEDE) and the OSG.

“The correlation of the three interferometers, 2 from LIGO and one from Virgo significantly shrunk the area in the sky for where to look,” said Würthwein.

Added Syracuse University’s Brown: “Comet’s contribution through the OSG and XSEDE allowed us to rapidly turn around the offline analysis in about a day. That, in turn allowed us to do several one-day runs, as opposed to having to spend several weeks before publishing our findings.”

This image shows a high-energy neutrino event superimposed on a view of the IceCube Lab (ICL) at the South Pole. Courtesy IceCube Collaboration.

Since being postulated in December 1930 by Wolfgang Pauli, cosmologists have been hunting for neutrinos: subatomic particles that lack an electric charge, particles once described as “the most tiny quantity of reality ever imagined by a human being.” For the most part, cosmic neutrinos are believed to have been created about 15 billion years ago, soon after the birth of the universe. Others emerged more recently from some of the most violent actions in the universe, such as exploding stars, gamma ray bursts, black holes and neutron stars. But unlike photons and other charged particles, neutrinos can emerge from their sources and, like cosmological ghosts, pass through the universe unscathed.

To help catch these near-massless messengers from deep space, an international team of researchers funded by the NSF set up IceCube, an observatory containing an array of 5,160 optical sensors deep within a cubic kilometer of ice at the South Pole. Encompassing 300 physicists from 49 institutions in 12 countries, IceCube already has achieved its primary goal of detecting the extraterrestrial flux of very high-energy neutrinos.

Frank Halzen, principal investigator of the IceCube Observatory and physics professor at the University of Wisconsin-Madison, explained the importance of the Comet supercomputer for isolating the signature pattern of neutrinos:  “The IceCube neutrino detector transforms natural Antarctic ice at the South Pole into a particle detector. Progress in understanding the precise optical properties of the ice leads to increasing complexity in simulating the propagation of photons in the instrument and to a better overall performance of the detector.”

“The photon propagation in the ice is very well-suited to run in graphics processing units (GPUs) hardware, such as those on Comet.” Halzen continued. “Pursuing efficient access to a large amount of GPU computing power is therefore of great importance to ensure that future IceCube analysis reaches the maximum precision and that the full scientific potential of the instrument is exploited.”

Stay tuned for Part II

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

Data West Brings Technology Leaders to SDSC

December 6, 2018

Data and technology enthusiasts from around the world descended upon the San Diego Supercomputing Center (SDSC) for the third annual Data West conference, which is taking place this week on the campus of the University o Read more…

By Alex Woodie

Topology Can Help Us Find Patterns in Weather

December 6, 2018

Topology--–the study of shapes-- seems to be all the rage. You could even say that data has shape, and shape matters. Shapes are comfortable and familiar concepts, so it is intriguing to see that many applications are Read more…

By James Reinders

What’s New in HPC Research: Automatic Energy Efficiency, DNA Data Analysis, Post-Exascale & More

December 6, 2018

In this bimonthly feature, HPCwire highlights newly published research in the high-performance computing community and related domains. From exascale to quantum computing, the details are here. Read more…

By Oliver Peckham

HPE Extreme Performance Solutions

AI Can Be Scary. But Choosing the Wrong Partners Can Be Mortifying!

As you continue to dive deeper into AI, you will discover it is more than just deep learning. AI is an extremely complex set of machine learning, deep learning, reinforcement, and analytics algorithms with varying compute, storage, memory, and communications needs. Read more…

IBM Accelerated Insights

Five Steps to Building a Data Strategy for AI

Our data-centric world is driving many organizations to apply advanced analytics that use artificial intelligence (AI). AI provides intelligent answers to challenging business questions. AI also enables highly personalized user experiences, built when data scientists and analysts learn new information from data that would otherwise go undetected using traditional analytics methods. Read more…

Zettascale by 2035? China Thinks So

December 6, 2018

Exascale machines (of at least a 1 exaflops peak) are anticipated to arrive by around 2020, a few years behind original predictions; and given extreme-scale performance challenges are not getting any easier, it makes sense that researchers are already looking ahead to the next big 1,000x performance goal post: zettascale computing. Read more…

By Tiffany Trader

Topology Can Help Us Find Patterns in Weather

December 6, 2018

Topology--–the study of shapes-- seems to be all the rage. You could even say that data has shape, and shape matters. Shapes are comfortable and familiar conc Read more…

By James Reinders

Zettascale by 2035? China Thinks So

December 6, 2018

Exascale machines (of at least a 1 exaflops peak) are anticipated to arrive by around 2020, a few years behind original predictions; and given extreme-scale performance challenges are not getting any easier, it makes sense that researchers are already looking ahead to the next big 1,000x performance goal post: zettascale computing. Read more…

By Tiffany Trader

Robust Quantum Computers Still a Decade Away, Says Nat’l Academies Report

December 5, 2018

The National Academies of Science, Engineering, and Medicine yesterday released a report – Quantum Computing: Progress and Prospects – whose optimism about Read more…

By John Russell

Revisiting the 2008 Exascale Computing Study at SC18

November 29, 2018

A report published a decade ago conveyed the results of a study aimed at determining if it were possible to achieve 1000X the computational power of the the Read more…

By Scott Gibson

AWS Debuts Lustre as a Service, Accelerates Data Transfer

November 28, 2018

From the Amazon re:Invent main stage in Las Vegas today, Amazon Web Services CEO Andy Jassy introduced Amazon FSx for Lustre, citing a growing body of applicati Read more…

By Tiffany Trader

AWS Launches First Arm Cloud Instances

November 28, 2018

AWS, a macrocosm of the emerging high-performance technology landscape, wants to be everywhere you want to be and offer everything you want to use (or at least Read more…

By Doug Black

Move Over Lustre & Spectrum Scale – Here Comes BeeGFS?

November 26, 2018

Is BeeGFS – the parallel file system with European roots – on a path to compete with Lustre and Spectrum Scale worldwide in HPC environments? Frank Herold Read more…

By John Russell

DOE Under Secretary for Science Paul Dabbar Interviewed at SC18

November 21, 2018

During the 30th annual SC conference in Dallas last week, SC18 hosted U.S. Department of Energy Under Secretary for Science Paul M. Dabbar. In attendance Nov. 13-14, Dabbar delivered remarks at the Top500 panel, met with a number of industry stakeholders and toured the show floor. He also met with HPCwire for an interview, where we discussed the role of the DOE in advancing leadership computing. Read more…

By Tiffany Trader

Quantum Computing Will Never Work

November 27, 2018

Amid the gush of money and enthusiastic predictions being thrown at quantum computing comes a proposed cold shower in the form of an essay by physicist Mikhail Read more…

By John Russell

Cray Unveils Shasta, Lands NERSC-9 Contract

October 30, 2018

Cray revealed today the details of its next-gen supercomputing architecture, Shasta, selected to be the next flagship system at NERSC. We've known of the code-name "Shasta" since the Argonne slice of the CORAL project was announced in 2015 and although the details of that plan have changed considerably, Cray didn't slow down its timeline for Shasta. Read more…

By Tiffany Trader

IBM at Hot Chips: What’s Next for Power

August 23, 2018

With processor, memory and networking technologies all racing to fill in for an ailing Moore’s law, the era of the heterogeneous datacenter is well underway, Read more…

By Tiffany Trader

House Passes $1.275B National Quantum Initiative

September 17, 2018

Last Thursday the U.S. House of Representatives passed the National Quantum Initiative Act (NQIA) intended to accelerate quantum computing research and developm Read more…

By John Russell

CERN Project Sees Orders-of-Magnitude Speedup with AI Approach

August 14, 2018

An award-winning effort at CERN has demonstrated potential to significantly change how the physics based modeling and simulation communities view machine learni Read more…

By Rob Farber

Summit Supercomputer is Already Making its Mark on Science

September 20, 2018

Summit, now the fastest supercomputer in the world, is quickly making its mark in science – five of the six finalists just announced for the prestigious 2018 Read more…

By John Russell

AMD Sets Up for Epyc Epoch

November 16, 2018

It’s been a good two weeks, AMD’s Gary Silcott and Andy Parma told me on the last day of SC18 in Dallas at the restaurant where we met to discuss their show news and recent successes. Heck, it’s been a good year. Read more…

By Tiffany Trader

US Leads Supercomputing with #1, #2 Systems & Petascale Arm

November 12, 2018

The 31st Supercomputing Conference (SC) - commemorating 30 years since the first Supercomputing in 1988 - kicked off in Dallas yesterday, taking over the Kay Ba Read more…

By Tiffany Trader

Leading Solution Providers

SC 18 Virtual Booth Video Tour

Advania @ SC18 AMD @ SC18
ASRock Rack @ SC18
DDN Storage @ SC18
HPE @ SC18
IBM @ SC18
Lenovo @ SC18 Mellanox Technologies @ SC18
NVIDIA @ SC18
One Stop Systems @ SC18
Oracle @ SC18 Panasas @ SC18
Supermicro @ SC18 SUSE @ SC18 TYAN @ SC18
Verne Global @ SC18

TACC’s ‘Frontera’ Supercomputer Expands Horizon for Extreme-Scale Science

August 29, 2018

The National Science Foundation and the Texas Advanced Computing Center announced today that a new system, called Frontera, will overtake Stampede 2 as the fast Read more…

By Tiffany Trader

HPE No. 1, IBM Surges, in ‘Bucking Bronco’ High Performance Server Market

September 27, 2018

Riding healthy U.S. and global economies, strong demand for AI-capable hardware and other tailwind trends, the high performance computing server market jumped 28 percent in the second quarter 2018 to $3.7 billion, up from $2.9 billion for the same period last year, according to industry analyst firm Hyperion Research. Read more…

By Doug Black

Nvidia’s Jensen Huang Delivers Vision for the New HPC

November 14, 2018

For nearly two hours on Monday at SC18, Jensen Huang, CEO of Nvidia, presented his expansive view of the future of HPC (and computing in general) as only he can do. Animated. Backstopped by a stream of data charts, product photos, and even a beautiful image of supernovae... Read more…

By John Russell

Germany Celebrates Launch of Two Fastest Supercomputers

September 26, 2018

The new high-performance computer SuperMUC-NG at the Leibniz Supercomputing Center (LRZ) in Garching is the fastest computer in Germany and one of the fastest i Read more…

By Tiffany Trader

Houston to Field Massive, ‘Geophysically Configured’ Cloud Supercomputer

October 11, 2018

Based on some news stories out today, one might get the impression that the next system to crack number one on the Top500 would be an industrial oil and gas mon Read more…

By Tiffany Trader

Intel Confirms 48-Core Cascade Lake-AP for 2019

November 4, 2018

As part of the run-up to SC18, taking place in Dallas next week (Nov. 11-16), Intel is doling out info on its next-gen Cascade Lake family of Xeon processors, specifically the “Advanced Processor” version (Cascade Lake-AP), architected for high-performance computing, artificial intelligence and infrastructure-as-a-service workloads. Read more…

By Tiffany Trader

Google Releases Machine Learning “What-If” Analysis Tool

September 12, 2018

Training machine learning models has long been time-consuming process. Yesterday, Google released a “What-If Tool” for probing how data point changes affect a model’s prediction. The new tool is being launched as a new feature of the open source TensorBoard web application... Read more…

By John Russell

The Convergence of Big Data and Extreme-Scale HPC

August 31, 2018

As we are heading towards extreme-scale HPC coupled with data intensive analytics like machine learning, the necessary integration of big data and HPC is a curr Read more…

By Rob Farber

  • arrow
  • Click Here for More Headlines
  • arrow
Do NOT follow this link or you will be banned from the site!
Share This