Aleksei Aksimentiev, University of Illinois Professor of Physics, and graduate student Jingqian Liu, co-authors on the recent protein identification study. (Credit: TACC)

Lead author Henry Brinkerhoff, who pioneered this work as a postdoc in physicist Cees Dekker’s lab, likens the protein to a necklace with different-sized beads. “Imagine you turn on the tap as you slowly move that necklace down the drain, which in this case is the nanopore,” he said. “If a big bead is blocking the drain, the water flowing through will only be a trickle; if you have smaller beads in the necklace right at the drain, more water can flow through.”

With their technique, the researchers can measure the amount of ion current very precisely — but not exactly, because the step-wise passage through the pore is irregular. However, by loading the liquid medium with helicases, the researchers can get many separate, overlapping reads of the same molecule, or in their terms, they can “rewind” the protein and read its amino acid sequence again. Doing so, reduced the errors from 13% to practically zero.

Their approach allowed the researchers to discriminate peptide variants that differed by only a single-amino acid — something they proved by creating synthetic peptides with only one amino acid changed and showing the system could discriminate among them.

But to read out the individual amino acids, they first had to know what sort of signal each one produces at it travels through the pore. Some of these signals may be counterintuitive, the researchers found.

For instance, when the bulky tryptophan amino acid moved through the constriction, the ion current first decreased and then, counterintuitively, increased relative to the small and medium-sized variants.

To understand the origin of these patterns, the team relied on supercomputer simulations by computational biologist Aleksei Aksimentiev (UIUC), performed on several of the fastest supercomputers available to academic researchers in the world: Frontera, at the Texas Advanced Computing Center; Blue Waters, at the National Center for Supercomputing Applications; and Expanse, at the San Diego Supercomputer Center.

Aksimentiev’s team used a method called molecular dynamics simulation to recreate the behavior of the nanopore, proteins, and the surrounding medium, with atomic resolution. Such simulations cannot fully capture the true timescale of the nanopore activity, which extends to seconds. But by generating 40 to 50 initial states at different positions, and then running 70 simulations in parallel, the team was able to derive statistics for different confirmations of peptides. From those, they computed the current and compared it to experiments. This computational work was led by Jingqian Liu, a biophysics graduate student in Aksimentiev’s lab.

The simulations included 30,000 atoms interacting over 200 to 500 nanosecond and were able to match experimental results. More importantly, they showed why certain amino acids produce counterintuitive signals as they pass through the nanopore. In the case of the tryptophan variant, the signal could be traced back to a binding of the peptide side chain to the nanopore surface above the constriction.

“For each specific conformation, we could see what happened to the sidechain, whether it interacts with the surface or remains inside of the pore,” said Aksimentiev, professor of Physics at UIUC. “Then we could establish directly that the binding of the sidechain enhanced the current.”

The simulations took weeks to generate on Frontera, currently the 10th fastest supercomputer in the world and the most powerful at any university. But they would have taken years with the type of computing cluster available on most campuses. The single protein identification research — for which there is a global race for success — was published online by Science as a “First Release” on November 4, 2021. The research was supported by the Dutch Research Council, U.S. National Institutes of Health, and U.S. National Science Foundation, among others.

“There are tremendous opportunities to develop diagnostics by reading individual protein using this nanopore approach,” Aksimentiev said. “The computation will play a big role in developing these technologies. It’s amazing that with computer models we can reproduce experiments and tell what sort of interactions are going on on the nano-scale.”

Not only that, computer models provide a different modality for design, allowing researchers to test nanopores of different size or with strategically placed residues that can produce enhanced signals.

More work is required to perform reads longer than 20 amino acids and to identify amino acids that are heterogeneously charged, but Aksimentiev ventures that in three to five years it may be possible to develop a working model.

“We think that our new approach will allow us to detect post-translational changes,” said Dekker, “and thus shine some light on the proteins that we carry with us.”


Source: Aaron Dubrow, Texas Advanced Computing Center