Behind the Gordon Bell Prize-Winning Spike Protein Simulations

By Oliver Peckham

March 11, 2021

Four months ago, Rommie Amaro and her colleagues were accepting the first-ever Gordon Bell Special Prize for High Performance Computing-Based COVID-19 Research. At the time, cases were slowly ramping up in advance of what we now know was to become a devastating winter surge. When Amaro spoke to the National Science Foundation (NSF) last week, on the other hand, the setting was different: we now know the vaccines work, cases are plummeting, and looking back on the pandemic suddenly doesn’t seem like a fanciful notion.

Dr. Rommie Amaro, professor and endowed chair in the Department of Chemistry and Biochemistry at the University of California, San Diego, leads the Amaro Lab.

It’s been over a year since Amaro and her co-authors published the first atomic-level simulation of the full-length spike protein – the now-notorious mechanism that allows SARS-CoV-2 to invade human cells, which is targeted by all of the approved vaccines. Just before the novel coronavirus truly began ramping up, Amaro’s lab at the University of California, San Diego was wrapping up some lengthy research.

“Until about February of last year, my lab … had been focused for a number of years in studying the influenza virus and its glycoproteins,” Amaro said. They published in early 2020 – and then as Italy began to fall to COVID-19, they realized what they had to do. By mid-February, researchers from the University of Texas at Austin and the National Institutes of Health (NIH) had provided the necessary data to get started: the first cryoelectron microscope model of the virus’ spike protein. “The day that structure dropped into the bioarchive … was when we really pivoted our efforts to SARS-CoV-2,” Amaro recalled.

Her lab worked quickly, bringing AI, analytics and HPC to bear with startling speed to produce that first computational model of the spike protein. “If two years ago, you would have said that you – this would have been unimaginable, to think that anybody could have accomplished [it],” Amaro said. “And it was just only possible because of just a tremendous sort of collaborative effort from a number of teams.”

The lab’s computational biology approach allowed them to “see” elements hidden from real-world microscopy. “We build these highly detailed atomic-level models and then we’re approximating that system down to its many atoms,” Amaro explained. “And so all we’re doing is defining a potential function [that] basically describes the interactions that all the atoms in our system have with each other, and then we’re simply integrating Newton’s equation of motion over time[.] And we perform this numerical integration millions and billions and trillions of time[.]”

“We want to sort of give more insight into the bits that they cannot see with the experiments,” she continued. “And this is, I think, the beautiful synergy that exists at this interdisciplinary interface between experimental science and computational science, but also together with physics and chemistry and biology and math.”

Into the abyss

So, with that first model done, the team delved deeper, producing models of the receptor-binding domain (RBD) of the spike protein in its “open” and “closed” conformations, which had been observed, but still vexed researchers. As they fleshed out more and more of the spike protein, they arrived at its glycans.

“The proteins get sort of this extra decoration, this extra flourish of sugars, or glycans,” Amaro said. “Literally, if you look at this, it sort of looks like ornaments on a Christmas tree.” These ornaments serve, by and large, to shield the protein from the scrutiny of the human immune system, which see the sugary coatings as innocuous.

In partnership with a wide range of institutions, the researchers were able to reconstruct these glycans on the spike protein using molecular “recipes” that determine their structure. And, finally, this allowed them to reconstruct the full-length spike protein in excruciating detail: multiple states, all of its glycans, membranes with different lipids and much more.

“And then, we sort of simulate it, right?” Amaro said. “And so we start to see how these atoms move and how they wiggle and jiggle.”

And, it turns out, that wiggling and jiggling was quite revealing. Using the new simulations, the researchers saw that those mysterious open and closed conformations of spike protein served, in fact, to expose the RBD beyond the glycan shields in preparation for binding with human cells.

“Instead of calling this ‘up’ and ‘down,’ if they had known about the sugars at the time when they were first naming, they would have called it a defending mode and an attacking mode,” Amaro said.

Image courtesy of Rommie Amaro.

This invaluable work won the wide-ranging team the Gordon Bell Special Prize at SC20. Along the way, of course, they used a similarly wide range of supercomputers, including heavy-hitters like Summit at the Oak Ridge Leadership Computing Facility (OLCF) and Frontera at the Texas Advanced Computing Center (TACC).

“This is more than just graphics,” Amaro said. “These are more than just pretty pictures. It’s not a video game. These are molecular dynamics simulations – this is numerical, statistical mechanics. And so what that means is that … this motion that we’re predicting is done in accordance with rigorous theoretical laws – to, of course, some approximation, but what’s powerful about this is that it allows us to extract from these microscopic properties macroscopic, experimentally testable predictions.”

What’s next?

While there is now – at long last – an air of finality and retrospection to discussions of the COVID-19 pandemic, Amaro and her colleagues aren’t hanging up their hats just yet.

“As we keep going, you know, we’ve also been very interested to develop models of the entire virus,” she said, adding that she was “intensely interested” in the airborne transmissibility of the virus as a research topic.

The work also provided fertile ground for some new norms – at least, for now. 

“Scientists, we always hold our cards close to our chest because science rewards people being first,” Amaro said. But, of course, such siloed work was not conducive to ending a pandemic. “And so in March, we drafted a set of principles that nearly every molecular simulation group in the world committed to. This included the use of preprint servers, fair data, sharing of systems and … all of that. It led to the creation of the Copenhagen Molecular Structure and Therapeutics Hub, which is another NSF-sponsored investment in molecular simulation. … It’s like a clearinghouse for simulations, data, systems, methods all over the world.”

The dataset Amaro and her colleagues produced on Frontera has already been downloaded more than 4,000 times. This kind of open access to data and software, she said, was crucial to ensure that when “the next thing hits,” researchers will be ready.


The work discussed in this article involved an extensive array of institutions and individuals who produced multiple academic papers. The Gordon Bell Prize-winning paper, titled “AI-Driven Multiscale Simulations Illuminate Mechanisms of SARS-CoV-2 Spike Dynamics,” was authored by Lorenzo Casalino, Abigail Dommer, Zied Gaieb, Emilia P. Barros, Terra Sztain, Surl-Hee Ahn, Anda Trifan, Alexander Brace, Anthony Bogetti, Heng Ma, Hyungro Lee, Matteo Turilli, View ORCID ProfileSyma Khalid, Lillian Chong, Carlos Simmerling, David J. Hardy, Julio D. C. Maia, James C. Phillips, Thorsten Kurth, Abraham Stern, Lei Huang, John McCalpin, Mahidhar Tatineni, Tom Gibbs, John E. Stone, Shantenu Jha, Arvind Ramanathan and Rommie E. Amaro. The paper can be accessed here.

Header image: an image from Rommie Amaro’s acknowledgements slide highlighting some of the collaborators on the research.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

Supercomputer-Powered Climate Model Makes Startling Sea Level Rise Prediction

April 19, 2021

The climate science community is tasked with striking a difficult balance: inspiring precisely the amount of alarm commensurate to the climate crisis. Make estimates that are too conservative, and the public might not re Read more…

San Diego Supercomputer Center Opens ‘Expanse’ to Industry Users

April 15, 2021

When San Diego Supercomputer Center (SDSC) at the University of California San Diego was getting ready to deploy its flagship Expanse supercomputer for the large research community it supports, it also sought to optimize Read more…

GTC21: Dell Building Cloud Native Supercomputers at U Cambridge and Durham

April 14, 2021

In conjunction with GTC21, Dell Technologies today announced new supercomputers at universities across DiRAC (Distributed Research utilizing Advanced Computing) in the UK with plans to explore use of Nvidia BlueField DPU technology. The University of Cambridge will expand... Read more…

The Role and Potential of CPUs in Deep Learning

April 14, 2021

Deep learning (DL) applications have unique architectural characteristics and efficiency requirements. Hence, the choice of computing system has a profound impact on how large a piece of the DL pie a user can finally enj Read more…

GTC21: Nvidia Launches cuQuantum; Dips a Toe in Quantum Computing

April 13, 2021

Yesterday Nvidia officially dipped a toe into quantum computing with the launch of cuQuantum SDK, a development platform for simulating quantum circuits on GPU-accelerated systems. As Nvidia CEO Jensen Huang emphasized in his keynote, Nvidia doesn’t plan to build... Read more…

AWS Solution Channel

Research computing with RONIN on AWS

To allow more visibility into and management of Amazon Web Services (AWS) resources and expenses and minimize the cloud skills training required to operate these resources, AWS Partner RONIN created the RONIN research computing platform. Read more…

Nvidia Aims Clara Healthcare at Drug Discovery, Imaging via DGX

April 12, 2021

Nvidia Corp. continues to expand its Clara healthcare platform with the addition of computational drug discovery and medical imaging tools based on its DGX A100 platform, related InfiniBand networking and its AGX developer kit. The Clara partnerships announced during... Read more…

San Diego Supercomputer Center Opens ‘Expanse’ to Industry Users

April 15, 2021

When San Diego Supercomputer Center (SDSC) at the University of California San Diego was getting ready to deploy its flagship Expanse supercomputer for the larg Read more…

GTC21: Dell Building Cloud Native Supercomputers at U Cambridge and Durham

April 14, 2021

In conjunction with GTC21, Dell Technologies today announced new supercomputers at universities across DiRAC (Distributed Research utilizing Advanced Computing) in the UK with plans to explore use of Nvidia BlueField DPU technology. The University of Cambridge will expand... Read more…

The Role and Potential of CPUs in Deep Learning

April 14, 2021

Deep learning (DL) applications have unique architectural characteristics and efficiency requirements. Hence, the choice of computing system has a profound impa Read more…

GTC21: Nvidia Launches cuQuantum; Dips a Toe in Quantum Computing

April 13, 2021

Yesterday Nvidia officially dipped a toe into quantum computing with the launch of cuQuantum SDK, a development platform for simulating quantum circuits on GPU-accelerated systems. As Nvidia CEO Jensen Huang emphasized in his keynote, Nvidia doesn’t plan to build... Read more…

Nvidia Aims Clara Healthcare at Drug Discovery, Imaging via DGX

April 12, 2021

Nvidia Corp. continues to expand its Clara healthcare platform with the addition of computational drug discovery and medical imaging tools based on its DGX A100 platform, related InfiniBand networking and its AGX developer kit. The Clara partnerships announced during... Read more…

Nvidia Serves Up Its First Arm Datacenter CPU ‘Grace’ During Kitchen Keynote

April 12, 2021

Today at Nvidia’s annual spring GPU Technology Conference (GTC), held virtually once more due to the pandemic, the company unveiled its first ever Arm-based CPU, called Grace in honor of the famous American programmer Grace Hopper. The announcement of the new... Read more…

Nvidia Debuts BlueField-3 – Its Next DPU with Big Plans for an Expanded Role

April 12, 2021

Nvidia today announced its next generation data processing unit (DPU) – BlueField-3 – adding more substance to its evolving concept of the DPU as a full-fledged partner to CPUs and GPUs in delivering advanced computing. Nvidia is pitching the DPU as an active engine... Read more…

Nvidia’s Newly DPU-Enabled SuperPod Is a Multi-Tenant, Cloud-Native Supercomputer

April 12, 2021

At GTC 2021, Nvidia has announced an upgraded iteration of its DGX SuperPods, calling the new offering “the first cloud-native, multi-tenant supercomputer.” Read more…

Julia Update: Adoption Keeps Climbing; Is It a Python Challenger?

January 13, 2021

The rapid adoption of Julia, the open source, high level programing language with roots at MIT, shows no sign of slowing according to data from Julialang.org. I Read more…

Intel Launches 10nm ‘Ice Lake’ Datacenter CPU with Up to 40 Cores

April 6, 2021

The wait is over. Today Intel officially launched its 10nm datacenter CPU, the third-generation Intel Xeon Scalable processor, codenamed Ice Lake. With up to 40 Read more…

CERN Is Betting Big on Exascale

April 1, 2021

The European Organization for Nuclear Research (CERN) involves 23 countries, 15,000 researchers, billions of dollars a year, and the biggest machine in the worl Read more…

Programming the Soon-to-Be World’s Fastest Supercomputer, Frontier

January 5, 2021

What’s it like designing an app for the world’s fastest supercomputer, set to come online in the United States in 2021? The University of Delaware’s Sunita Chandrasekaran is leading an elite international team in just that task. Chandrasekaran, assistant professor of computer and information sciences, recently was named... Read more…

HPE Launches Storage Line Loaded with IBM’s Spectrum Scale File System

April 6, 2021

HPE today launched a new family of storage solutions bundled with IBM’s Spectrum Scale Erasure Code Edition parallel file system (description below) and featu Read more…

10nm, 7nm, 5nm…. Should the Chip Nanometer Metric Be Replaced?

June 1, 2020

The biggest cool factor in server chips is the nanometer. AMD beating Intel to a CPU built on a 7nm process node* – with 5nm and 3nm on the way – has been i Read more…

Saudi Aramco Unveils Dammam 7, Its New Top Ten Supercomputer

January 21, 2021

By revenue, oil and gas giant Saudi Aramco is one of the largest companies in the world, and it has historically employed commensurate amounts of supercomputing Read more…

Quantum Computer Start-up IonQ Plans IPO via SPAC

March 8, 2021

IonQ, a Maryland-based quantum computing start-up working with ion trap technology, plans to go public via a Special Purpose Acquisition Company (SPAC) merger a Read more…

Leading Solution Providers

Contributors

Can Deep Learning Replace Numerical Weather Prediction?

March 3, 2021

Numerical weather prediction (NWP) is a mainstay of supercomputing. Some of the first applications of the first supercomputers dealt with climate modeling, and Read more…

Livermore’s El Capitan Supercomputer to Debut HPE ‘Rabbit’ Near Node Local Storage

February 18, 2021

A near node local storage innovation called Rabbit factored heavily into Lawrence Livermore National Laboratory’s decision to select Cray’s proposal for its CORAL-2 machine, the lab’s first exascale-class supercomputer, El Capitan. Details of this new storage technology were revealed... Read more…

New Deep Learning Algorithm Solves Rubik’s Cube

July 25, 2018

Solving (and attempting to solve) Rubik’s Cube has delighted millions of puzzle lovers since 1974 when the cube was invented by Hungarian sculptor and archite Read more…

African Supercomputing Center Inaugurates ‘Toubkal,’ Most Powerful Supercomputer on the Continent

February 25, 2021

Historically, Africa hasn’t exactly been synonymous with supercomputing. There are only a handful of supercomputers on the continent, with few ranking on the Read more…

AMD Launches Epyc ‘Milan’ with 19 SKUs for HPC, Enterprise and Hyperscale

March 15, 2021

At a virtual launch event held today (Monday), AMD revealed its third-generation Epyc “Milan” CPU lineup: a set of 19 SKUs -- including the flagship 64-core, 280-watt 7763 part --  aimed at HPC, enterprise and cloud workloads. Notably, the third-gen Epyc Milan chips achieve 19 percent... Read more…

The History of Supercomputing vs. COVID-19

March 9, 2021

The COVID-19 pandemic poses a greater challenge to the high-performance computing community than any before. HPCwire's coverage of the supercomputing response t Read more…

HPE Names Justin Hotard New HPC Chief as Pete Ungaro Departs

March 2, 2021

HPE CEO Antonio Neri announced today (March 2, 2021) the appointment of Justin Hotard as general manager of HPC, mission critical solutions and labs, effective Read more…

Microsoft, HPE Bringing AI, Edge, Cloud to Earth Orbit in Preparation for Mars Missions

February 12, 2021

The International Space Station will soon get a delivery of powerful AI, edge and cloud computing tools from HPE and Microsoft Azure to expand technology experi Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire