Comet Supercomputer Assists With Genomic Research

November 3, 2016

Nov. 3 — One of the most detailed genomic studies of any ecosystem to date has revealed an underground world of stunning microbial diversity, and added dozens of new branches to the tree of life.

The bacterial bonanza comes from scientists who reconstructed the genomes of more than 2,500 microbes from sediment and groundwater samples collected at an aquifer in Colorado. The effort was led by researchers from the Department of Energy’s Lawrence Berkeley National Laboratory (Berkeley Lab) and UC Berkeley. DNA sequencing was performed at the Joint Genome Institute, a DOE Office of Science User Facility, and analyses were conducted with the aid of the CIPRES gateway and the Comet supercomputer, based at the San Diego Supercomputer Center (SDSC) at the University of California San Diego.

As reported online October 24 in the journal Nature Communications, the scientists netted genomes from 80 percent of all known bacterial phyla, a remarkable degree of biological diversity at one location. They also discovered 47 new phylum-level bacterial groups, naming many of them after influential microbiologists and other scientists. And they learned new insights about how microbial communities work together to drive processes that are critical to the planet’s climate and life everywhere, such as the carbon and nitrogen cycles.

These findings shed light on one of Earth’s most important and least understood realms of life. The subterranean world hosts up to one-fifth of all biomass, but it remains a mystery.

“We didn’t expect to find this incredible microbial diversity. But then again, we know little about the roles of subsurface microbes in biogeochemical processes, and more broadly, we don’t really know what’s down there,” says Jill Banfield, a Senior Faculty Scientist in Berkeley Lab’s Climate & Ecosystem Sciences Division and a UC Berkeley professor in the departments of Earth and Planetary Science, and Environmental Science, Policy, and Management.

Added UC Berkeley’s Karthik Anantharaman, the first author of the paper: “To better understand what subsurface microbes are up to, our approach is to access their entire genomes. This enabled us to discover a greater interdependency among microbes than we’ve seen before.”

The research is part of a Berkeley Lab-led project called Watershed Function Scientific Focus Area (formerly Sustainable Systems Scientific Focus Area 2.0). The project is developing a predictive understanding of terrestrial environments from the genome to the watershed scale. The field research takes place at a research site near the town of Rifle, Colorado, where for the past several years scientists have conducted experiments designed to stimulate populations of subterranean microbes that are naturally present in very low numbers.

The scientists sent soil and water samples from these experiments to the Joint Genome Institute for terabase-scale metagenomic sequencing. This high-throughput method isolates and purifies DNA from environmental samples, and then sequences one trillion base pairs of DNA at a time. Next, the scientists used bioinformatics tools developed in Banfield’s lab along with those from the CIPRES Science Gateway to analyze the data.

“The CIPRES Science Gateway and the Comet supercomputer were instrumental to our work,” Banfield said. “Considering the unprecedented size of our sequence datasets, we were unable to complete any runs for inferring trees on other servers.” The CIPRES Science Gateway and Comet are available through the Extreme Science and Engineering Discovery Environment (XSEDE). Supported by the National Science Foundation, XSEDE is the most advanced, powerful, and robust collection of integrated advanced digital resources and services in the world.

The scientists’ approach has redrawn the tree of life. Between the 47 new bacterial groups reported in this work, and 35 new groups published last year (also found at the Rifle site), Banfield’s team has doubled the number of known bacterial groups.

With discovery comes naming rights. The scientists named many of the new bacteria groups after Berkeley Lab and UC Berkeley researchers. For example, there’s Candidatus Andersenbacteria, after phylochip inventor Gary Andersen, and there’s Candidatus Doudnabacteria, after CRISPR genome-editing pioneer Jennifer Doudna.

“Berkeley now dominates the tree of life as it does the periodic table,” Banfield says, in a nod to the sixteen elements discovered at Berkeley Lab and UC Berkeley.

Another big outcome is a deeper understanding of the roles subsurface microbes play in globally important carbon, hydrogen, nitrogen, and sulfur cycles. This information will help to better represent these cycles in predictive models such as climate simulations.

The scientists conducted metabolic analyses of 36 percent of the organisms detected in the aquifer system. They focused on a phenomenon called metabolic handoff, which essentially means one microbe’s waste is another microbe’s food. It’s known from lab studies that handoffs are needed in certain reactions, but these interconnected networks are widespread and vastly more complex in the real world.

To understand why it’s important to represent metabolic handoffs as accurately as possible in models, consider nitrate, a groundwater contaminant from fertilizers. Subsurface microbes are the primary driver in reducing nitrate to harmless nitrogen gas. There are four steps in this denitrification process, and the third step creates nitrous oxide—one of the most potent greenhouse gases. The process breaks down if microbes that carry out the fourth step are inactive when a pulse of nitrate enters the system.

“If microbes aren’t there to accept the nitrous oxide handoff, then the greenhouse gas escapes into the atmosphere,” says Anantharaman.

The scientists found the carbon, hydrogen, nitrogen, and sulfur cycles are all driven by metabolic handoffs that require an unexpectedly high degree of interdependence among microbes. The vast majority of microorganisms can’t fully reduce a compound on their own. It takes a team. There are also backup microbes ready to perform a handoff if first-string microbes are unavailable.

“The combination of high microbial diversity and interconnections through metabolic handoffs likely results in high ecosystem resilience,” says Banfield.

Other co-authors of the paper include Berkeley Lab’s Eoin Brodie, Susan Hubbard, Ulas Karaoz, and Kenneth Williams; and UC Berkeley’s Chris Brown, Cindy Castelle, Laura Hug, Alexander Probst, Itai Sharon, Andrea Singh, and Brian Thomas. The research is supported by the Department of Energy’s Office of Science.

About SDSC

As an Organized Research Unit of UC San Diego, SDSC is considered a leader in data-intensive computing and cyberinfrastructure, providing resources, services, and expertise to the national research community, including industry and academia. Cyberinfrastructure refers to an accessible, integrated network of computer-based resources and expertise, focused on accelerating scientific inquiry and discovery. SDSC supports hundreds of multidisciplinary programs spanning a wide variety of domains, from earth sciences and biology to astrophysics, bioinformatics, and health IT. SDSC’s Comet joins the Center’s data-intensive Gordon cluster, and are both part of the National Science Foundation’s XSEDE (Extreme Science and Engineering Discovery Environment) program.

About LBNL

Lawrence Berkeley National Laboratory addresses the world’s most urgent scientific challenges by advancing sustainable energy, protecting human health, creating new materials, and revealing the origin and fate of the universe. Founded in 1931, Berkeley Lab’s scientific expertise has been recognized with 13 Nobel prizes. The University of California manages Berkeley Lab for the U.S. Department of Energy’s Office of Science. For more, visit www.lbl.gov.


Source: SDSC

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

NREL ‘Eagle’ Supercomputer to Advance Energy Tech R&D

August 14, 2018

The U.S. Department of Energy (DOE) National Renewable Energy Laboratory (NREL) has contracted with Hewlett Packard Enterprise (HPE) for a new 8-petaflops (peak) supercomputer that will be used to advance early-stage R&a Read more…

By Tiffany Trader

Training Time Slashed for Deep Learning

August 14, 2018

Fast.ai, an organization offering free courses on deep learning, claimed a new speed record for training a popular image database using Nvidia GPUs running on public cloud infrastructure. A pair of researchers trained Read more…

By George Leopold

CERN Project Sees Orders-of-Magnitude Speedup with AI Approach

August 14, 2018

An award-winning effort at CERN has demonstrated potential to significantly change how the physics based modeling and simulation communities view machine learning. The CERN team demonstrated that AI-based models have the Read more…

By Rob Farber

HPE Extreme Performance Solutions

Introducing the First Integrated System Management Software for HPC Clusters from HPE

How do you manage your complex, growing cluster environments? Answer that big challenge with the new HPC cluster management solution: HPE Performance Cluster Manager. Read more…

IBM Accelerated Insights

Super Problem Solving

You might think that tackling the world’s toughest problems is a job only for superheroes, but at special places such as the Oak Ridge National Laboratory, supercomputers are the real heroes. Read more…

Rigetti Eyes Scaling with 128-Qubit Architecture

August 10, 2018

Rigetti Computing plans to build a 128-qubit quantum computer based on an equivalent quantum processor that leverages emerging hybrid computing algorithms used to test programs and potential applications. Founded in 2 Read more…

By George Leopold

NREL ‘Eagle’ Supercomputer to Advance Energy Tech R&D

August 14, 2018

The U.S. Department of Energy (DOE) National Renewable Energy Laboratory (NREL) has contracted with Hewlett Packard Enterprise (HPE) for a new 8-petaflops (peak Read more…

By Tiffany Trader

CERN Project Sees Orders-of-Magnitude Speedup with AI Approach

August 14, 2018

An award-winning effort at CERN has demonstrated potential to significantly change how the physics based modeling and simulation communities view machine learni Read more…

By Rob Farber

Intel Announces Cooper Lake, Advances AI Strategy

August 9, 2018

Intel's chief datacenter exec Navin Shenoy kicked off the company's Data-Centric Innovation Summit Wednesday, the day-long program devoted to Intel's datacenter Read more…

By Tiffany Trader

SLATE Update: Making Math Libraries Exascale-ready

August 9, 2018

Practically-speaking, achieving exascale computing requires enabling HPC software to effectively use accelerators – mostly GPUs at present – and that remain Read more…

By John Russell

Summertime in Washington: Some Unexpected Advanced Computing News

August 8, 2018

Summertime in Washington DC is known for its heat and humidity. That is why most people get away to either the mountains or the seashore and things slow down. H Read more…

By Alex R. Larzelere

NSF Invests $15 Million in Quantum STAQ

August 7, 2018

Quantum computing development is in full ascent as global backers aim to transcend the limitations of classical computing by leveraging the magical-seeming prop Read more…

By Tiffany Trader

By the Numbers: Cray Would Like Exascale to Be the Icing on the Cake

August 1, 2018

On its earnings call held for investors yesterday, Cray gave an accounting for its latest quarterly financials, offered future guidance and provided an update o Read more…

By Tiffany Trader

Google is First Partner in NIH’s STRIDES Effort to Speed Discovery in the Cloud

July 31, 2018

The National Institutes of Health, with the help of Google, last week launched STRIDES - Science and Technology Research Infrastructure for Discovery, Experimen Read more…

By John Russell

Leading Solution Providers

SC17 Booth Video Tours Playlist

Altair @ SC17

Altair

AMD @ SC17

AMD

ASRock Rack @ SC17

ASRock Rack

CEJN @ SC17

CEJN

DDN Storage @ SC17

DDN Storage

Huawei @ SC17

Huawei

IBM @ SC17

IBM

IBM Power Systems @ SC17

IBM Power Systems

Intel @ SC17

Intel

Lenovo @ SC17

Lenovo

Mellanox Technologies @ SC17

Mellanox Technologies

Microsoft @ SC17

Microsoft

Penguin Computing @ SC17

Penguin Computing

Pure Storage @ SC17

Pure Storage

Supericro @ SC17

Supericro

Tyan @ SC17

Tyan

Univa @ SC17

Univa

  • arrow
  • Click Here for More Headlines
  • arrow
Do NOT follow this link or you will be banned from the site!
Share This