AI-Based ‘Artificial Chemist’ Directs Automated Lab to Create Better MRI Contrast Agents

January 6, 2022

Artificial-intelligence algorithm developed on XSEDE-allocated systems promises better MRI agents with unprecedented speed of discovery

Jan. 6, 2022 — Using XSEDE-allocated supercomputers, scientists at Carnegie Mellon University (CMU) have created an “artificial chemist,” a computer program that mimics the expertise of human chemists. The artificial intelligence (AI) system is capable of directing an automated laboratory to synthesize new contrast agents for medical MRI imaging. The new contrast agents, thanks to the AI, have a ratio of signal-to-noise as much as 50% higher than previous state-of-the-art, human-designed materials. This performance boost offers the possibility of more detailed medical scans of the human body, improving diagnosis.

Thanks to the back-and-forth between the CMU AI’s exploration of improved MRI contrast agents (left) and lab testing of the candidates by a robotic instrument (center), the work produced 397 candidates with improved performance compared with known contrast agents. Surprisingly, while as expected the best-performing of these candidates had a high number of fluorine-19 atoms in their structure, they showed better performance than known contrast agents with more of the element (right).

The CMU team, led by prof. Olexandr Isayev, built their AI using advanced research computers at XSEDE resources the Pittsburgh Supercomputing Center (PSC) and the Texas Advanced Computing Center (TACC). The robotic lab instrument is located at the University of North Carolina at Chapel Hill (UNC). The collaborators plan to develop the software so that it’s capable of more general chemical design for other applications in medicine, chemistry and materials science.

Why It’s Important

New chemical compounds – particularly a type of chemical called a polymer, which is made up of smaller building blocks called monomers – are a mainstay of advancement in chemistry, medicine, computing and other fields. A major limitation in this field, though, is that humans, who learn from each other, may be in “ruts,” working the same way they always have and not seeing promising alternatives. The CMU-led team wanted to create a general-purpose AI chemist that could teach itself how to select combinations of  monomers, avoiding human bias.

“Previous efforts in materials discovery have relied on either luck or human intuition, which both suffer from inherent biases and limitations in knowledge,” said Olexandr Isayev at CMU.

To create an AI chemist that could forge new paths in synthetic chemistry, the scientists used an approach called automated machine learning. The first step in the process is the usual one of selecting the best performer from a large group of possible machine learning models. The scientists then refine the winning model by real-world testing of the resulting contrast-agent candidates in the UNC lab, putting the results of that testing back into the AI. By going back and forth between the computer and the lab, the AI could correct its mistakes and biases.

The overall project posed serious challenges. The first was a truly vast number of possible polymers that could be produced by the automated lab’s reagent set. Unlike working with simulated or historical data, the development of an AI algorithm plus acquiring new data on the fly by real experiments required taking into account the cost and number of experiments. To some extent, the team could control this by giving the AI a limited set of reagents – but even for a small set of six organic monomers, the space of possible experiments was over 50,000. The CMU team would need powerful computing resources. They would also have to refine the AI model in repeated training steps to cover the huge multidimensionality of the problem in search of the best-performing polymers while conducting only a small fraction of possible experiments. To overcome this challenge, they turned to XSEDE.

How XSEDE Helped

XSEDE supplied the group with access to powerful supercomputers containing graphics processing units, or GPUs. These processors were originally designed to create realistic images for computer gaming. But their unique capabilities for “parallel computing” proved to be ideal for AI research. Starting in 2012, a GPU revolution swept the AI field, powering many of the groundbreaking AI tools we now take for granted.

PSC’s XSEDE-allocated Bridges-2 system, as well as the NSF Petascale Computing Resource Frontera at TACC, offered the team powerful new GPUs to “train” the AI, as well as the massive memory (RAM) needed to keep track of the problem’s multidimensionality.

Filipp Gusev, a graduate student in Isayev’s group in the joint CMU-University of Pittsburgh PhD Program in Computational Biology, designed the AI software in a way that didn’t rely on historically biased knowledge as experienced by humans. Instead, in a process called machine learning, or ML, it started with a small “training set” of successful MRI contrast agents to use as a starting point for the model. By acquiring new information in a process called active learning, the AI tested its predictions of what made a polymer a good contrast agent against a “testing set” of polymers whose effectiveness weren’t labeled, correcting itself when it predicted wrongly and requesting new data. Finally, exploring the chemistry of a representative group of synthesized polymers selected by AI without human supervision, it came up with its own set of rules.

The work was enabled by the robotic system built by the Frank Leibfarth group at UNC Chapel Hill. Leibfarth’s group  had built an automated continuous-flow system designed to build polymers that can be used to create plastics, packaging and a number of useful materials. The lab provides the “hands” of the operation. The “brain” was supplied by the Isayev team’s AI. Graduate student Marcus Reis of Leibfarth’s group was the co-first author for the work from UNC.

“Even a small model space leads to intensive computing requirements. Because the calculations are done over a set of classes of machine learning models exhaustively to get maximum information from the data and repeated on each refinement after getting new data, we needed a lot of computing power. [XSEDE] helped us speed up this project,” said Filipp Gusev at CMU.

Through a series of eight refinements, Gusev’s AI was able to narrow a potential 50,000 polymers to a list of only 397 experimentally synthesized. Iterating between the computer and the lab identified the best performing of these candidates. These performed as much as 50% better than current MRI contrast agents.

These winning candidates posed a surprise to the human chemists. Clinical MRI works by detecting changes in a strong magnetic field created by substances in the human body. One family of MRI contrast agents uses the isotope fluorine-19 (19F), which has the ability to interact with dissolved oxygen in body fluids. This interaction can be detected in a strong magnetic field and tells doctors where oxygen is concentrated in living tissues.

Scientists had long thought that more is better in terms of 19F solution concentration – the more 19F atoms that a contrast agent could pack in a smaller space, the better. But 19F also makes the polymer less soluble in water – and if the polymer can’t be dissolved, it can’t be injected.

The leading candidates the AI picked did contain enough 19F to create a strong signal. As they had hoped, the AI had found a “Goldilocks” point of just enough to give a strong signal while still being soluble, a point that humans had not predicted. The result offers hope that AI-guided design can create chemical tools that surpass what human experts can design.

The CMU team reported their results in a paper in the Journal of the American Chemical Society in November, 2021. They would next like to extend the approach toward other types of polymers and organic materials.

O.I. acknowledges support from NSF CHE-1802789 and CHE- 2041108. This work used the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by National Science Foundation grant number ACI-1548562. Specifically, it used the Bridges-2 system, which is supported by NSF award number ACI-1928147, at the Pittsburgh Supercomputing Center. This research is part of the Frontera computing project at the Texas Advanced Computing Center. Frontera is made possible by the National Science Foundation award OAC-1818253. F.L. acknowledges the UNC Department of Chemistry’s NMR Core Laboratory that provided expertise and instrumentation that enabled this study with support from the National Science Foundation (CHE-1828183 and CHE-0922858).


Source: Ken Chiacchia, Pittsburgh Supercomputing Center

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industry updates delivered to you every week!

IBM Quantum Summit: Two New QPUs, Upgraded Qiskit, 10-year Roadmap and More

December 4, 2023

IBM kicks off its annual Quantum Summit today and will announce a broad range of advances including its much-anticipated 1121-qubit Condor QPU, a smaller 133-qubit Heron QPU, that’s optimized for combining with multipl Read more…

The Annual SCinet Mandala

November 30, 2023

Perhaps you have seen images of Tibetan Buddhists creating beautiful and intricate images with colored sand. These sand mandalas can take weeks to create, only to be ritualistically dismantled when the image is finished. Read more…

Alibaba Shuts Down its Quantum Computing Effort

November 30, 2023

In case you missed it, China’s e-commerce giant Alibaba has shut down its quantum computing research effort. It’s not entirely clear what drove the change. Reuters’ reported earlier this week that Alibaba “cut a Read more…

SC23: The Ethics of Supercomputing

November 29, 2023

Why should HPC practitioners care about ethics? And, what are our ethics in HPC? These questions were central to a lively discussion at the SC23 Birds-of-a-Feather (BoF) session: With Great Power Comes Great Responsib Read more…

Grace Hopper’s Big Debut in AWS Cloud While Graviton4 Launches

November 29, 2023

Editors Note: Additional Coverage of the AWS-Nvidia 65 Exaflop ‘Ultra-Cluster’ and Graviton4 can be found on our sister site Datanami. Amazon Web Services will soon be home to a new Nvidia-built supercomputer that Read more…

AWS Solution Channel

Deploying AI/ML at the Edge with Omniflow’s Sustainable Smart Lamppost, NVIDIA, and AWS

Imagine a world where a lamppost does more than just illuminate streets; it actively contributes to a smarter, safer, and more sustainable community. Using Amazon Web Services (AWS) and NVIDIA technologies, Omniflow is turning this vision into a reality. Read more…

QCT Solution Channel

QCT and Intel Codeveloped QCT DevCloud Program to Jumpstart HPC and AI Development

Organizations and developers face a variety of issues in developing and testing HPC and AI applications. Challenges they face can range from simply having access to a wide variety of hardware, frameworks, and toolkits to time spent on installation, development, testing, and troubleshooting which can lead to increases in cost. Read more…

Give a Little (on Tuesday), Get a Lot

November 28, 2023

HPC is built on open source. While building HPC systems with "open plumbing" has enormous advantages, there can also be some challenges. As illustrated in the classic XKCD comic, the entire dependency tree of many usefu Read more…

IBM Quantum Summit: Two New QPUs, Upgraded Qiskit, 10-year Roadmap and More

December 4, 2023

IBM kicks off its annual Quantum Summit today and will announce a broad range of advances including its much-anticipated 1121-qubit Condor QPU, a smaller 133-qu Read more…

The Annual SCinet Mandala

November 30, 2023

Perhaps you have seen images of Tibetan Buddhists creating beautiful and intricate images with colored sand. These sand mandalas can take weeks to create, only Read more…

SC23: The Ethics of Supercomputing

November 29, 2023

Why should HPC practitioners care about ethics? And, what are our ethics in HPC? These questions were central to a lively discussion at the SC23 Birds-of-a-Fe Read more…

Grace Hopper’s Big Debut in AWS Cloud While Graviton4 Launches

November 29, 2023

Editors Note: Additional Coverage of the AWS-Nvidia 65 Exaflop ‘Ultra-Cluster’ and Graviton4 can be found on our sister site Datanami. Amazon Web Service Read more…

Analyst Panel Says Take the Quantum Computing Plunge Now…

November 27, 2023

Should you start exploring quantum computing? Yes, said a panel of analysts convened at Tabor Communications HPC and AI on Wall Street conference earlier this y Read more…

SCREAM wins Gordon Bell Climate Prize at SC23

November 21, 2023

The first Gordon Bell Prize for Climate Modeling was presented at SC23 in Denver. The award went to a team led by Sandia National Laboratories that had develope Read more…

SC23 BOF: Inclusivity Progress and Challenges

November 21, 2023

New to SC23 was a series of talks on Inclusivity topics. Sponsored by the Inclusivity Committee and open to all conference attendees, these 90-minute birds-of-a Read more…

Supercomputing 2023: Odds and Ends from the Show

November 20, 2023

This year's fantastic Supercomputing 2023 was back in full form. Attendees seemed to be glad that the show was back in Denver, which was a preferred destination Read more…

CORNELL I-WAY DEMONSTRATION PITS PARASITE AGAINST VICTIM

October 6, 1995

Ithaca, NY --Visitors to this year's Supercomputing '95 (SC'95) conference will witness a life-and-death struggle between parasite and victim, using virtual Read more…

SGI POWERS VIRTUAL OPERATING ROOM USED IN SURGEON TRAINING

October 6, 1995

Surgery simulations to date have largely been created through the development of dedicated applications requiring considerable programming and computer graphi Read more…

U.S. Will Relax Export Restrictions on Supercomputers

October 6, 1995

New York, NY -- U.S. President Bill Clinton has announced that he will definitely relax restrictions on exports of high-performance computers, giving a boost Read more…

Dutch HPC Center Will Have 20 GFlop, 76-Node SP2 Online by 1996

October 6, 1995

Amsterdam, the Netherlands -- SARA, (Stichting Academisch Rekencentrum Amsterdam), Academic Computing Services of Amsterdam recently announced that it has pur Read more…

Cray Delivers J916 Compact Supercomputer to Solvay Chemical

October 6, 1995

Eagan, Minn. -- Cray Research Inc. has delivered a Cray J916 low-cost compact supercomputer and Cray's UniChem client/server computational chemistry software Read more…

NEC Laboratory Reviews First Year of Cooperative Projects

October 6, 1995

Sankt Augustin, Germany -- NEC C&C (Computers and Communication) Research Laboratory at the GMD Technopark has wrapped up its first year of operation. Read more…

Sun and Sybase Say SQL Server 11 Benchmarks at 4544.60 tpmC

October 6, 1995

Mountain View, Calif. -- Sun Microsystems, Inc. and Sybase, Inc. recently announced the first benchmark results for SQL Server 11. The result represents a n Read more…

New Study Says Parallel Processing Market Will Reach $14B in 1999

October 6, 1995

Mountain View, Calif. -- A study by the Palo Alto Management Group (PAMG) indicates the market for parallel processing systems will increase at more than 4 Read more…

Leading Solution Providers

Contributors

SC23 Booth Videos

AMD @ SC23
AWS @ SC23
Altair @ SC23
CoolIT @ SC23
Cornelis Networks @ SC23
CoreHive @ SC23
DDC @ SC23
HPE @ SC23 with Justin Hotard
HPE @ SC23 with Trish Damkroger
Intel @ SC23
Intelligent Light @ SC23
Lenovo @ SC23
Penguin Solutions @ SC23
QCT Intel @ SC23
Tyan AMD @ SC23
Tyan Intel @ SC23
HPCwire LIVE from SC23 Playlist

CORNELL I-WAY DEMONSTRATION PITS PARASITE AGAINST VICTIM

October 6, 1995

Ithaca, NY --Visitors to this year's Supercomputing '95 (SC'95) conference will witness a life-and-death struggle between parasite and victim, using virtual Read more…

SGI POWERS VIRTUAL OPERATING ROOM USED IN SURGEON TRAINING

October 6, 1995

Surgery simulations to date have largely been created through the development of dedicated applications requiring considerable programming and computer graphi Read more…

U.S. Will Relax Export Restrictions on Supercomputers

October 6, 1995

New York, NY -- U.S. President Bill Clinton has announced that he will definitely relax restrictions on exports of high-performance computers, giving a boost Read more…

Dutch HPC Center Will Have 20 GFlop, 76-Node SP2 Online by 1996

October 6, 1995

Amsterdam, the Netherlands -- SARA, (Stichting Academisch Rekencentrum Amsterdam), Academic Computing Services of Amsterdam recently announced that it has pur Read more…

Cray Delivers J916 Compact Supercomputer to Solvay Chemical

October 6, 1995

Eagan, Minn. -- Cray Research Inc. has delivered a Cray J916 low-cost compact supercomputer and Cray's UniChem client/server computational chemistry software Read more…

NEC Laboratory Reviews First Year of Cooperative Projects

October 6, 1995

Sankt Augustin, Germany -- NEC C&C (Computers and Communication) Research Laboratory at the GMD Technopark has wrapped up its first year of operation. Read more…

Sun and Sybase Say SQL Server 11 Benchmarks at 4544.60 tpmC

October 6, 1995

Mountain View, Calif. -- Sun Microsystems, Inc. and Sybase, Inc. recently announced the first benchmark results for SQL Server 11. The result represents a n Read more…

New Study Says Parallel Processing Market Will Reach $14B in 1999

October 6, 1995

Mountain View, Calif. -- A study by the Palo Alto Management Group (PAMG) indicates the market for parallel processing systems will increase at more than 4 Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire