Why AI Alone Won’t Solve Drug Discovery

By Alex Woodie

August 31, 2022

Verseon is an AI-powered drug company – in theory, anyway. The Bay Area startup does use machine learning to help predict which combination of molecules will yield novel compound to test in the lab. With over a dozen drug candidates at different stages of development, Verseon’s approach seems to be working. So when the company’s CEO had some choice words about AI, it was a little bit surprising.

“This is one of the problems,” Adityo Prakash, the CEO and co-founder of Verseon, told HPCwire sister publication Datanami. “People are saying, ‘Oh, AI will solve everything.’ They give you fancy words. ‘We’ll ingest all of this longitudinal data and we’ll do latitudinal analysis.’

“It’s all garbage,” he says. “It’s just hype.”

To be sure, there are misconceptions around AI, what it is capable of, and what it is not capable of. Some people seem to imagine AI as a magical box that generates accurate answers to complex questions based on input data. Have a tough problem? Put AI in the pipeline, and your problem is solved.

In reality, getting AI to work on real-world problems is a lot more difficult than that. While AI can do some pretty cool stuff, it’s applicability is quite a bit narrower than your average Joe on the street is led to believe.

AI can only solve problems for which there is adequate training data. And considering the enormous expanses of emptiness when it comes to data in pharmacological field, AI really hasn’t even begun to scratch the surface of what’s possible, according to Prakash.

‘Nowhere Close’

We have barely scratched the surface when it comes to exploring the universe of chemical compounds (Image courtesy Verseon)

Before the interview earlier this month, Prakash read a Datanami story about the recent Accenture study that found only 12% of companies are actually leveraging AI. That strikes a note with Prakash, who seems to suspect that number might be high.

“I can assure you, those companies aren’t in the AI medicine space,” the CEO says. “There’s a lot of hype in using quote-unquote ‘AI’ for solving all that ails medicine. But it’s really nowhere close to being used properly.”

The big reason? Biology, it turns out, isn’t a domain for big data, Prakash says. “It‘s a domain of small data,” he says.

While the universe of potential molecular compounds and drugs is essentially infinite, our knowledge of that space is severely limited, Prakash says. The process of testing new compounds is painstakingly slow. Once a potential candidate is established by chemists, a sample is put into a test chamber, and other chemicals are added to gauge the effect.

Robotics are employed here to speed up the process. “It’s called high-throughput screening,” Prakash says. “But it’s basically a fancy name for trial and error.”

The drug giants of the world – the Pfizers, Novartises, and Roches–have the means to ramp up this testing regime. But even with all the billions of dollars these companies put into R&D, their collections of compounds – or drug “backbones” as Prakash calls them – will probably number only 6 or 7 million, collectively, he says.

Compare that to his estimate of the total number of possible chemical compounds in the universe, which is on the order of 10 to the 33rd power.

“You realize, you’re not even fishing in a tidepool,” he says. “You’re fishing in a tiny little shot glass.”

What’s more, that shot glass isn’t even as diverse as it could be, Prakash says, as it’s filled with “me-too” compounds that have the same backbone, the same scaffold, but small little chemical tweaks.

What’s a drug discovery company to do?

Back to Physics

Verseon built a proprietary physics-based modeling library that can piece known 100,000 molecules together in novel ways (Image courtesy Verseon)

Once you have a sample compound and you know something about it, AI has the power to make predictions for you. It can tell you, based on molecular modeling, what impacts it likely will have on the body, including effectiveness, side effects, etc.

But the big problem is coming up with the promising novel compounds to test to begin with, Prakash says. This is where Verseon has taken a novel approach.

“The solution to this is actually doing something a little different,” Prakash says. “The solution to this is to see if we can generate synthetic data, in modern AI parlance.”

Verseon has devised a system that includes a database of known molecules. The physics model has a collection of more than 100,000 molecular building blocks that it can piece together, like atomic Lego blocks, according to Prakash.

“You have to first create that virtual ocean of possibility on the computer,” he says. “This is the first time we have an unlimited supply of novel drug-like, but most importantly, synthesizable compounds. It’s a dynamic molecule creation engine. I can point it to any part of that chemical universe and make as many as I want.”

The next step involves a proprietary physics model that that determine how novel binders will actually work with real-life proteins. This is where many drug-discovery systems have failed, because they don’t adequately take into account all of the various factors influencing the marriage of chemical to protein, including things like where the checmical bonds take place, on which spike proteins, and how water will act as a “chaperone” between the two, Prakash says.

“You have to be able to do this by modeling the molecular physics at a level of accuracy that presently was impossible before,” Prakash says. “But if you can do that, then you can run against any proteins of interest [and explore] this vast area of possibilities from that chemical ocean to come up with perfect custom designed binders.”

Any given run on Verseon’s HPC cluster can generate upwards of a couple of hundred families of chemical backbones, Prakash says. The company sends the most promising of the theoretical compounds to the chemistry lab, where they are synthetized into an actual compound and tested using the standard methods. If all goes well–and the failure rate of course is high–it then moves onto the clinic for testing.

Proof and Pudding

Prakash says the company originally intended to build this software and then sell it. But after considering how difficult it was to come up with potential drug candidates, and how profitable getting just one successful drug through all the testing phases, it decided to keep its invention and become a drug company instead.

“When we originally started the company, our plan was to build a toolset that other people would use,” he says. “[But] on a per-answer basis, the answers are so valuable, you build your own pharmaceutical engine, which is what we have evolved into.”

Once physics-based modeling generates possible synthetic compounds, AI can help narrow down the best candidates based on various criteria (Image courtesy Verseon)

Verseon has invested about $150 million into building the full platform and the current drug pipeline, the CEO says. The company’s cluster, which is used for HPC modeling and AI, offers about 2 petaflops of computational power, which Prakash expects will grow.

Over the past few years, Verseon has commenced 45 drug discovery programs, and currently has 16 drug candidates across three major disease categories, including cardio-pulmonary diseases, cancer, and infectious diseases. Several of these drugs have completed preclinical trials and are set to begin phase one clinical trials; one has already begun phase one.

The company is bullish on its approach, and claims that the drug candidates it has discovered could not have been found any other way. “Nobody is even remotely close,” Prakash says. “It’s built on fundamental advancements in science that don’t exist anywhere else.”

“These are completely novel molecules that the computer has predicted,” he continues. “I’m actually filling in useful islands of data in that vast chemical ocean.”

Prakash says Verseon’s approach has three main benefits over the traditional approach in the areas of novelty, numbers, and speed. These benefits come down to the interplay between the physics model generating synthetic data, but AI also plays a role, especially in helping to recommend nearby compounds when the library of compounds and its impacts begins to get filled out. The company also uses AI to assist with patient selection.

Both AI and HPC are needed to make this approach work.

“The AI and HPC physics modeling, they work kind of hand in hand to really do this in a way that’s feasible and workable,” he says. “For some things, deep learning is great, when we have dense data somewhere.”

“[But] you cannot just use existing AI tools and pull down some existing libraries and say, ‘Here is some deep learning model,’ and it’s done,” Prakash says. “You need to have the technical chops to be able to modify and build even your own AI tools as necessary to be able to do this.”

Considering the amount of hype heaped on AI for drug discovery, Verseon’s “back to physics” approach is certainly novel and refreshing. It will be interesting to see what new drug candidates the company can generate using this method, and whether it will inspire other companies to take a “me-too” approach.

This article first appeared on Datanami.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industry updates delivered to you every week!

Empowering High-Performance Computing for Artificial Intelligence

April 19, 2024

Artificial intelligence (AI) presents some of the most challenging demands in information technology, especially concerning computing power and data movement. As a result of these challenges, high-performance computing Read more…

Kathy Yelick on Post-Exascale Challenges

April 18, 2024

With the exascale era underway, the HPC community is already turning its attention to zettascale computing, the next of the 1,000-fold performance leaps that have occurred about once a decade. With this in mind, the ISC Read more…

2024 Winter Classic: Texas Two Step

April 18, 2024

Texas Tech University. Their middle name is ‘tech’, so it’s no surprise that they’ve been fielding not one, but two teams in the last three Winter Classic cluster competitions. Their teams, dubbed Matador and Red Read more…

2024 Winter Classic: The Return of Team Fayetteville

April 18, 2024

Hailing from Fayetteville, NC, Fayetteville State University stayed under the radar in their first Winter Classic competition in 2022. Solid students for sure, but not a lot of HPC experience. All good. They didn’t Read more…

Software Specialist Horizon Quantum to Build First-of-a-Kind Hardware Testbed

April 18, 2024

Horizon Quantum Computing, a Singapore-based quantum software start-up, announced today it would build its own testbed of quantum computers, starting with use of Rigetti’s Novera 9-qubit QPU. The approach by a quantum Read more…

2024 Winter Classic: Meet Team Morehouse

April 17, 2024

Morehouse College? The university is well-known for their long list of illustrious graduates, the rigor of their academics, and the quality of the instruction. They were one of the first schools to sign up for the Winter Read more…

Kathy Yelick on Post-Exascale Challenges

April 18, 2024

With the exascale era underway, the HPC community is already turning its attention to zettascale computing, the next of the 1,000-fold performance leaps that ha Read more…

Software Specialist Horizon Quantum to Build First-of-a-Kind Hardware Testbed

April 18, 2024

Horizon Quantum Computing, a Singapore-based quantum software start-up, announced today it would build its own testbed of quantum computers, starting with use o Read more…

MLCommons Launches New AI Safety Benchmark Initiative

April 16, 2024

MLCommons, organizer of the popular MLPerf benchmarking exercises (training and inference), is starting a new effort to benchmark AI Safety, one of the most pre Read more…

Exciting Updates From Stanford HAI’s Seventh Annual AI Index Report

April 15, 2024

As the AI revolution marches on, it is vital to continually reassess how this technology is reshaping our world. To that end, researchers at Stanford’s Instit Read more…

Intel’s Vision Advantage: Chips Are Available Off-the-Shelf

April 11, 2024

The chip market is facing a crisis: chip development is now concentrated in the hands of the few. A confluence of events this week reminded us how few chips Read more…

The VC View: Quantonation’s Deep Dive into Funding Quantum Start-ups

April 11, 2024

Yesterday Quantonation — which promotes itself as a one-of-a-kind venture capital (VC) company specializing in quantum science and deep physics  — announce Read more…

Nvidia’s GTC Is the New Intel IDF

April 9, 2024

After many years, Nvidia's GPU Technology Conference (GTC) was back in person and has become the conference for those who care about semiconductors and AI. I Read more…

Google Announces Homegrown ARM-based CPUs 

April 9, 2024

Google sprang a surprise at the ongoing Google Next Cloud conference by introducing its own ARM-based CPU called Axion, which will be offered to customers in it Read more…

Nvidia H100: Are 550,000 GPUs Enough for This Year?

August 17, 2023

The GPU Squeeze continues to place a premium on Nvidia H100 GPUs. In a recent Financial Times article, Nvidia reports that it expects to ship 550,000 of its lat Read more…

Synopsys Eats Ansys: Does HPC Get Indigestion?

February 8, 2024

Recently, it was announced that Synopsys is buying HPC tool developer Ansys. Started in Pittsburgh, Pa., in 1970 as Swanson Analysis Systems, Inc. (SASI) by John Swanson (and eventually renamed), Ansys serves the CAE (Computer Aided Engineering)/multiphysics engineering simulation market. Read more…

Intel’s Server and PC Chip Development Will Blur After 2025

January 15, 2024

Intel's dealing with much more than chip rivals breathing down its neck; it is simultaneously integrating a bevy of new technologies such as chiplets, artificia Read more…

Choosing the Right GPU for LLM Inference and Training

December 11, 2023

Accelerating the training and inference processes of deep learning models is crucial for unleashing their true potential and NVIDIA GPUs have emerged as a game- Read more…

Baidu Exits Quantum, Closely Following Alibaba’s Earlier Move

January 5, 2024

Reuters reported this week that Baidu, China’s giant e-commerce and services provider, is exiting the quantum computing development arena. Reuters reported � Read more…

Comparing NVIDIA A100 and NVIDIA L40S: Which GPU is Ideal for AI and Graphics-Intensive Workloads?

October 30, 2023

With long lead times for the NVIDIA H100 and A100 GPUs, many organizations are looking at the new NVIDIA L40S GPU, which it’s a new GPU optimized for AI and g Read more…

Shutterstock 1179408610

Google Addresses the Mysteries of Its Hypercomputer 

December 28, 2023

When Google launched its Hypercomputer earlier this month (December 2023), the first reaction was, "Say what?" It turns out that the Hypercomputer is Google's t Read more…

AMD MI3000A

How AMD May Get Across the CUDA Moat

October 5, 2023

When discussing GenAI, the term "GPU" almost always enters the conversation and the topic often moves toward performance and access. Interestingly, the word "GPU" is assumed to mean "Nvidia" products. (As an aside, the popular Nvidia hardware used in GenAI are not technically... Read more…

Leading Solution Providers

Contributors

Shutterstock 1606064203

Meta’s Zuckerberg Puts Its AI Future in the Hands of 600,000 GPUs

January 25, 2024

In under two minutes, Meta's CEO, Mark Zuckerberg, laid out the company's AI plans, which included a plan to build an artificial intelligence system with the eq Read more…

China Is All In on a RISC-V Future

January 8, 2024

The state of RISC-V in China was discussed in a recent report released by the Jamestown Foundation, a Washington, D.C.-based think tank. The report, entitled "E Read more…

Shutterstock 1285747942

AMD’s Horsepower-packed MI300X GPU Beats Nvidia’s Upcoming H200

December 7, 2023

AMD and Nvidia are locked in an AI performance battle – much like the gaming GPU performance clash the companies have waged for decades. AMD has claimed it Read more…

DoD Takes a Long View of Quantum Computing

December 19, 2023

Given the large sums tied to expensive weapon systems – think $100-million-plus per F-35 fighter – it’s easy to forget the U.S. Department of Defense is a Read more…

Nvidia’s New Blackwell GPU Can Train AI Models with Trillions of Parameters

March 18, 2024

Nvidia's latest and fastest GPU, codenamed Blackwell, is here and will underpin the company's AI plans this year. The chip offers performance improvements from Read more…

Eyes on the Quantum Prize – D-Wave Says its Time is Now

January 30, 2024

Early quantum computing pioneer D-Wave again asserted – that at least for D-Wave – the commercial quantum era has begun. Speaking at its first in-person Ana Read more…

GenAI Having Major Impact on Data Culture, Survey Says

February 21, 2024

While 2023 was the year of GenAI, the adoption rates for GenAI did not match expectations. Most organizations are continuing to invest in GenAI but are yet to Read more…

The GenAI Datacenter Squeeze Is Here

February 1, 2024

The immediate effect of the GenAI GPU Squeeze was to reduce availability, either direct purchase or cloud access, increase cost, and push demand through the roof. A secondary issue has been developing over the last several years. Even though your organization secured several racks... Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire