TACC’s Frontera Enables AI Analysis of X-ray and Genetic Data for Skeletal Research

September 14, 2023

Sept. 14, 2023 — Using artificial intelligence (AI) to analyze tens of thousands of X-ray images and genetic sequences, researchers from The University of Texas at Austin and New York Genome Center have been able to pinpoint the genes that shape our skeletons, from the width of our shoulders to the length of our legs.

The genes that shape the human skeleton have been identified, in research that used TACC’s Frontera supercomputer to run analysis of deep learning models. Image Credit: TACC.

The research, published July 2023 as the cover article in Science, pulls back a curtain on our evolutionary past and opens a window into a future where doctors can better predict patients’ risks of developing conditions such as back pain or arthritis in later life.

“Our research is a powerful demonstration of the impact of AI in medicine, particularly when it comes to analyzing and quantifying imaging data, as well as integrating this information with health records and genetics rapidly and at large scale,” said Vagheesh Narasimhan, an assistant professor of integrative biology as well as statistics and data science, who led the multidisciplinary team of researchers to provide the genetic map of skeletal proportions.

Humans are the only large primates to have longer legs than arms, a change in the skeletal form that is critical in enabling the ability to walk on two legs. The scientists sought to determine which genetic changes underlie anatomical differences that are clearly visible in the fossil record leading to modern humans, from Australopithecus to Neanderthals. They also wanted to find out how these skeletal proportions allowing bipedalism affect the risk of many musculoskeletal diseases such as arthritis of the knee and hip — conditions that affect billions of people in the world and are the leading causes of adult disability in the United States.

The researchers used deep learning models to perform automatic quantification on 39,000 medical images to measure distances between shoulders, knees, ankles and other points in the body. By comparing these measurements to each person’s genetic sequence, they found 145 points in the genome that control skeletal proportions.

The computer vision model called HRNet performed the image classification. The researchers used two rounds of transfer learning, where knowledge learned from a completing a task was reincorporated to boost model performance. The scientists validated their approach on images taken of the same person more than two years apart and found that the correlation in their measurements was greater than 99 percent accurate.

All analysis was performed on the UKBiobank dataset, which has paired imaging, genetic and lifetime electronic healthcare records on 30,000 anonymized individuals. Researchers used deep learning models for performing quality control and for measurements on the imaging data. To connect these measurements, they carried out genome-wide association studies that employ a linear mixed model to control for population structure amongst individuals in the dataset.

Scientists Rely on Supercomputing 

The main computational challenges were on the imaging side of the study, where the science team needed access to high performance GPUs for model training. Another major challenge was the genetic data, which is extremely large and accounts for millions of positions and tens of thousands of individuals.

Narasimhan was awarded an allocation on TACC’s Frontera, the fastest academic supercomputer in the U.S., which he used to meet these computational challenges.

“TACC has both offerings for large memory GPUs as well as a large number of high-performance CPUs,” Narasimhan said. “Genetic data is naively parallelized, meaning that TACC is ideally suited to allow us to run our association analysis independently on different parts of the data and stitch them back together at the end. We used tens of thousands of hours of compute to perform these calculations, and it would have been impossible without access to this resource.”

“Our work provides a road map connecting specific genes with skeletal lengths of different parts of the body, allowing developmental biologists to investigate these in a systematic way,” said Tarjinder (T.J.) Singh, the study’s co-author, and associate member at NYGC and assistant professor in the Columbia University Department of Psychiatry.

The team also examined how skeletal proportions associate with major musculoskeletal diseases and showed that individuals with a higher ratio of hip width to height were found to be more likely to develop osteoarthritis and pain in their hips. Similarly, people with higher ratios of femur (thigh bone) length to height were more likely to develop arthritis in their knees, knee pain and other knee problems. People with a higher ratio of torso length to height were more likely to develop back pain.

“These disorders develop from biomechanical stresses on the joints over a lifetime,” said Eucharist Kun, a UT Austin biochemistry graduate student and lead author on the paper. “Skeletal proportions affect everything from our gait to how we sit, and it makes sense that they are risk factors in these disorders.”

The results of their work also have implications for our understanding of evolution. The researchers noted that several genetic segments that controlled skeletal proportions overlapped more than expected with areas of the genome called human accelerated regions. These are sections of the genome shared by great apes and many vertebrates but are significantly diverged in humans. This provides genomic rationale for the divergence in our skeletal anatomy.

Said Narasimhan said: “In the past, research conducted in human evolution used extremely low sample sizes and involved manual measurement. Thus, the field was largely restricted to qualitative analysis. The integration of genetic information has created a revolution in the field allowing for precise statistical estimates to be produced from the ever-expanding amount of data available. Genetic data is inherently large in size and methods to analyze them have gone hand in hand with technological development to obtain such information. These methods have always been employed on supercomputing environments which allow for distributed computing and efficient I/O.”

In addition to Kun and Narasimhan, the co-authors are Tarjinder Singh of the New York Genome Center and Columbia University; Emily M. Javan, Olivia Smith, Javier de la Fuente, Brianna I. Flynn, Kushal Vajrala, Zoe Trutner, Prakash Jayakumar and Elliot M. Tucker-Drob of UT Austin; Faris Gulamali of Icahn School of Medicine at Mount Sinai; and Mashaal Sohail of Universidad Nacional Autonoma de Mexico. The research was funded by the Allen Institute, Good Systems at UT Austin and the National Institutes of Health, with graduate student fellowship support provided by the National Science Foundation and UT Austin’s provost’s office.

This article was adapted from a news release by Esther Robards-Forbes, College of Natural Sciences, UT Austin.


Source: Jorge Salazar, TACC

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industry updates delivered to you every week!

Watsonx Brings AI Visibility to Banking Systems

September 21, 2023

A new set of AI-based code conversion tools is available with IBM watsonx. Before introducing the new "watsonx," let's talk about the previous generation Watson, perhaps better known as "Jeopardy!-Watson." The origi Read more…

Researchers Advance Topological Superconductors for Quantum Computing

September 21, 2023

Quantum computers process information using quantum bits, or qubits, based on fragile, short-lived quantum mechanical states. To make qubits robust and tailor them for applications, researchers from the Department of Ene Read more…

Fortran: Still Compiling After All These Years

September 20, 2023

A recent article appearing in EDN (Electrical Design News) points out that on this day, September 20, 1954, the first Fortran program ran on a mainframe computer. Originally developed by IBM, Fortran (or FORmula TRANslat Read more…

Intel’s Gelsinger Lays Out Vision and Map at Innovation 2023 Conference

September 20, 2023

Intel’s sprawling, optimistic vision for the future was on full display yesterday in CEO Pat Gelsinger’s opening keynote at the Intel Innovation 2023 conference being held in San Jose. While technical details were sc Read more…

Intel Showcases “AI Everywhere” Strategy in MLPerf Inferencing v3.1

September 18, 2023

Intel used the latest MLPerf Inference (version 3.1) results as a platform to reinforce its developing “AI Everywhere” vision, which rests upon 4th gen Xeon CPUs and Gaudi2 (Habana) accelerators. Both fared well on t Read more…

AWS Solution Channel

Shutterstock 1679562793

How Maxar Builds Short Duration ‘Bursty’ HPC Workloads on AWS at Scale

Introduction

High performance computing (HPC) has been key to solving the most complex problems in every industry and has been steadily changing the way we work and live. Read more…

QCT Solution Channel

QCT and Intel Codeveloped QCT DevCloud Program to Jumpstart HPC and AI Development

Organizations and developers face a variety of issues in developing and testing HPC and AI applications. Challenges they face can range from simply having access to a wide variety of hardware, frameworks, and toolkits to time spent on installation, development, testing, and troubleshooting which can lead to increases in cost. Read more…

Survey: Majority of US Workers Are Already Using Generative AI Tools, But Company Policies Trail Behind

September 18, 2023

A new survey from the Conference Board indicates that More than half of US employees are already using generative AI tools, at least occasionally, to accomplish work-related tasks. Yet some three-quarters of companies st Read more…

Watsonx Brings AI Visibility to Banking Systems

September 21, 2023

A new set of AI-based code conversion tools is available with IBM watsonx. Before introducing the new "watsonx," let's talk about the previous generation Watson Read more…

Intel’s Gelsinger Lays Out Vision and Map at Innovation 2023 Conference

September 20, 2023

Intel’s sprawling, optimistic vision for the future was on full display yesterday in CEO Pat Gelsinger’s opening keynote at the Intel Innovation 2023 confer Read more…

Intel Showcases “AI Everywhere” Strategy in MLPerf Inferencing v3.1

September 18, 2023

Intel used the latest MLPerf Inference (version 3.1) results as a platform to reinforce its developing “AI Everywhere” vision, which rests upon 4th gen Xeon Read more…

China’s Quiet Journey into Exascale Computing

September 17, 2023

As reported in the South China Morning Post HPC pioneer Jack Dongarra mentioned the lack of benchmarks from recent HPC systems built by China. “It’s a we Read more…

Nvidia Releasing Open-Source Optimized Tensor RT-LLM Runtime with Commercial Foundational AI Models to Follow Later This Year

September 14, 2023

Nvidia's large-language models will become generally available later this year, the company confirmed. Organizations widely rely on Nvidia's graphics process Read more…

MLPerf Releases Latest Inference Results and New Storage Benchmark

September 13, 2023

MLCommons this week issued the results of its latest MLPerf Inference (v3.1) benchmark exercise. Nvidia was again the top performing accelerator, but Intel (Xeo Read more…

Need Some H100 GPUs? Nvidia is Listening

September 12, 2023

During a recent earnings call, Tesla CEO Elon Musk, the world's richest man, summed up the shortage of Nvidia enterprise GPUs in a few sentences.  "We're us Read more…

Intel Getting Squeezed and Benefiting from Nvidia GPU Shortages

September 10, 2023

The shortage of Nvidia's GPUs has customers searching for scrap heap to kickstart makeshift AI projects, and Intel is benefitting from it. Customers seeking qui Read more…

CORNELL I-WAY DEMONSTRATION PITS PARASITE AGAINST VICTIM

October 6, 1995

Ithaca, NY --Visitors to this year's Supercomputing '95 (SC'95) conference will witness a life-and-death struggle between parasite and victim, using virtual Read more…

SGI POWERS VIRTUAL OPERATING ROOM USED IN SURGEON TRAINING

October 6, 1995

Surgery simulations to date have largely been created through the development of dedicated applications requiring considerable programming and computer graphi Read more…

U.S. Will Relax Export Restrictions on Supercomputers

October 6, 1995

New York, NY -- U.S. President Bill Clinton has announced that he will definitely relax restrictions on exports of high-performance computers, giving a boost Read more…

Dutch HPC Center Will Have 20 GFlop, 76-Node SP2 Online by 1996

October 6, 1995

Amsterdam, the Netherlands -- SARA, (Stichting Academisch Rekencentrum Amsterdam), Academic Computing Services of Amsterdam recently announced that it has pur Read more…

Cray Delivers J916 Compact Supercomputer to Solvay Chemical

October 6, 1995

Eagan, Minn. -- Cray Research Inc. has delivered a Cray J916 low-cost compact supercomputer and Cray's UniChem client/server computational chemistry software Read more…

NEC Laboratory Reviews First Year of Cooperative Projects

October 6, 1995

Sankt Augustin, Germany -- NEC C&C (Computers and Communication) Research Laboratory at the GMD Technopark has wrapped up its first year of operation. Read more…

Sun and Sybase Say SQL Server 11 Benchmarks at 4544.60 tpmC

October 6, 1995

Mountain View, Calif. -- Sun Microsystems, Inc. and Sybase, Inc. recently announced the first benchmark results for SQL Server 11. The result represents a n Read more…

New Study Says Parallel Processing Market Will Reach $14B in 1999

October 6, 1995

Mountain View, Calif. -- A study by the Palo Alto Management Group (PAMG) indicates the market for parallel processing systems will increase at more than 4 Read more…

Leading Solution Providers

Contributors

CORNELL I-WAY DEMONSTRATION PITS PARASITE AGAINST VICTIM

October 6, 1995

Ithaca, NY --Visitors to this year's Supercomputing '95 (SC'95) conference will witness a life-and-death struggle between parasite and victim, using virtual Read more…

SGI POWERS VIRTUAL OPERATING ROOM USED IN SURGEON TRAINING

October 6, 1995

Surgery simulations to date have largely been created through the development of dedicated applications requiring considerable programming and computer graphi Read more…

U.S. Will Relax Export Restrictions on Supercomputers

October 6, 1995

New York, NY -- U.S. President Bill Clinton has announced that he will definitely relax restrictions on exports of high-performance computers, giving a boost Read more…

Dutch HPC Center Will Have 20 GFlop, 76-Node SP2 Online by 1996

October 6, 1995

Amsterdam, the Netherlands -- SARA, (Stichting Academisch Rekencentrum Amsterdam), Academic Computing Services of Amsterdam recently announced that it has pur Read more…

Cray Delivers J916 Compact Supercomputer to Solvay Chemical

October 6, 1995

Eagan, Minn. -- Cray Research Inc. has delivered a Cray J916 low-cost compact supercomputer and Cray's UniChem client/server computational chemistry software Read more…

NEC Laboratory Reviews First Year of Cooperative Projects

October 6, 1995

Sankt Augustin, Germany -- NEC C&C (Computers and Communication) Research Laboratory at the GMD Technopark has wrapped up its first year of operation. Read more…

Sun and Sybase Say SQL Server 11 Benchmarks at 4544.60 tpmC

October 6, 1995

Mountain View, Calif. -- Sun Microsystems, Inc. and Sybase, Inc. recently announced the first benchmark results for SQL Server 11. The result represents a n Read more…

New Study Says Parallel Processing Market Will Reach $14B in 1999

October 6, 1995

Mountain View, Calif. -- A study by the Palo Alto Management Group (PAMG) indicates the market for parallel processing systems will increase at more than 4 Read more…

ISC 2023 Booth Videos

Cornelis Networks @ ISC23
Dell Technologies @ ISC23
Intel @ ISC23
Lenovo @ ISC23
Microsoft @ ISC23
ISC23 Playlist
  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire