Visit additional Tabor Communication Publications
July 04, 2012
XSEDE Resource Provides Open-Access Phylogenetic Supercomputing
XSEDE resource provides open-access phylogenetic supercomputing
SAN DIEGO, July 3 -- A new Web resource developed at the San Diego Supercomputer Center (SDSC) at the University of California, San Diego is helping thousands of researchers worldwide unravel the enigmas of phylogenetics, the study of evolutionary relationships among virtually every species on the planet.
The CIPRES Science Gateway (CIPRES stands for Cyber Infrastructure for Phylogenetic RESearch), created by SDSC researchers, allows these studies to proceed in significantly shorter times without having to understand how to operate complex computers. Scientists anywhere in the world upload their data via a Web browser free of charge under a grant provided by the National Science Foundation (NSF).
CIPRES is part of the NSF’s Extreme Science and Engineering Discovery Environment (XSEDE). It is part of the XSEDE Science Gateway initiative, designed to provide scientists with broad and easy access to supercomputers.
Researchers say the gateway, and access to powerful supercomputers, are helping to answer increasingly sophisticated phylogenetic questions.
“The CIPRES Science Gateway makes it possible for researchers to make use of all this new information more quickly and effectively,” said Mark Miller, principal investigator of the CIPRES Gateway. “Our team is excited to have supported more than 300 publications of phylogenetic studies involving species in every branch of the Tree of Life.”
“It’s an important additional step in the conduct of science,” said Peter Nelson, a graduate student in the Department of Botany & Plant Pathology at Oregon State University in Corvallis. “This is a new opportunity for people who don’t yet have grant money, but who want to do meaningful research – and you don’t have to leave your computer.”
Nelson, a theorist in botany, is trying to understand the evolutionary processes that may operate one way in genetically homogeneous communities, but in a different way in more genetically diverse communities. He studies the divergence of tree species in North America. “We use GenBank and other sequence databases to gather the data, and free software is available to edit the sequences,” he said. “But the process is so computationally intensive I could never have accomplished it on a personal computer.”
Shedding new light on origins
All life forms, from simple bacteria to primates and plants, descended from a single common ancestor. A diagram of all the evolutionary relationships looks like a highly branched tree with the common ancestor at the base of the trunk, and extinct and living groups forming the branches. All living species are represented by leaves at the tips of the outermost limbs. This Tree of Life, like evolution itself, is not static; rather the branching process continues today as groups of individuals in single species, such as the Eastern Meadowlark appear to be splitting into two because of long-term geographical or environmental factors.
The phylogenetic history of each living species is contained in its DNA, and SDSC’s CIPRES Gateway is helping scientists analyze all the evolutionary relationships by making it possible for them to compare similarities and differences in the DNA among large numbers of species.
Phylogenetics is essential to understanding not only the history of life on earth, but also how populations of flowering plants, insects, crustaceans, fish, fungi, insects and microorganisms slowly change in response to their surroundings.
Such studies can also shed new light on how and where lineages began after challenging long-accepted theories. Researchers, for example, are using the CIPRES Gateway to clarify the evolution of wild grapes, which University of Florida Botany Professor J. Richard Abbott wrote, “indicate that American lineages could be older than Asian.” Abbott and his co-authors reported the controversial finding in a report in the February 2012 issue of Molecular Phylogenetics and Evolution.
In another project, Andrew F. Hugall and Devi Stuart-Fox, zoology researchers in the Department of Zoology at the University of Melbourne in Australia, used the CIPRES Gateway to provide the first phylogenetic analysis supporting an evolutionary theory that new species of birds are generated faster when the ancestral species exhibits color variations in its feathers.
Hugall and Stuart-Fox reported in the May 9, 2012, issue of Nature that speciation rates were almost three times higher for so-called color polymorphic species of birds of prey than similar monomorphic bird species. As the prevalence of feather-color polymorphism falls, so too does the rate of speciation.
The discipline of phylogenetic systematics combines taxonomy, or the description and naming of living species as well as fossilized life forms found in natural history museums, with modern phylogenetic studies. Systematic biologists combine a variety of sources of information, analyses and hypotheses to organize related groups of species, such as vertebrates, into clades and clades within clades. For example, the vertebrate clade is further subdivided into clades of amphibians, primates, rodents, and other groups of related species.
“Studies by systematic and evolutionary biologists have historically been limited by the number of available DNA sequences in public databases like GenBank,” said Miller. However, he added that modern DNA sequencing technologies generate data so quickly that analyzing all relevant data on conventional laptops can take weeks.
“There is a huge need in the community for easy access to computing resources,” said Miller. To meet that enthusiastic demand, Miller’s team at SDSC and their collaborators around the country continue to combine emerging techniques in computational biology with computer science.
As an Organized Research Unit of UC San Diego, SDSC is considered a leader in data-intensive computing and all aspects of ‘big data’, which includes data integration, performance modeling, data mining, software development, workflow automation, and more. SDSC supports hundreds of multidisciplinary programs spanning a wide variety of domains, from earth sciences and biology to astrophysics, bioinformatics, and health IT. With its two newest supercomputer systems, Trestles and Gordon, SDSC is a partner in XSEDE (Extreme Science and Engineering Discovery Environment), the most advanced collection of integrated digital resources and services in the world.
Source: San Diego Supercomputer Center
In a recent solicitation, the NSF laid out needs for furthering its scientific and engineering infrastructure with new tools to go beyond top performance, Having already delivered systems like Stampede and Blue Waters, they're turning an eye to solving data-intensive challenges. We spoke with the agency's Irene Qualters and Barry Schneider about..
Large-scale, worldwide scientific initiatives rely on some cloud-based system to both coordinate efforts and manage computational efforts at peak times that cannot be contained within the combined in-house HPC resources. Last week at Google I/O, Brookhaven National Lab’s Sergey Panitkin discussed the role of the Google Compute Engine in providing computational support to ATLAS, a detector of high-energy particles at the Large Hadron Collider (LHC).
The Xeon Phi coprocessor might be the new kid on the high performance block, but out of all first-rate kickers of the Intel tires, the Texas Advanced Computing Center (TACC) got the first real jab with its new top ten Stampede system.We talk with the center's Karl Schultz about the challenges of programming for Phi--but more specifically, the optimization...
May 22, 2013 |
At some point in the not-too-distant future, building powerful, miniature computing systems will be considered a hobby for high schoolers, just as robotics or even Lego-building are today. That could be made possible through recent advancements made with the Raspberry Pi computers.
May 16, 2013 |
When it comes to cloud, long distances mean unacceptably high latencies. Researchers from the University of Bonn in Germany examined those latency issues of doing CFD modeling in the cloud by utilizing a common CFD and its utilization in HPC instance types including both CPU and GPU cores of Amazon EC2.
May 15, 2013 |
Supercomputers at the Department of Energy’s National Energy Research Scientific Computing Center (NERSC) have worked on important computational problems such as collapse of the atomic state, the optimization of chemical catalysts, and now modeling popping bubbles.
May 10, 2013 |
Program provides cash awards up to $10,000 for the best open-source end-user applications deployed on 100G network.
May 09, 2013 |
The Japanese government has revealed its plans to best its previous K Computer efforts with what they hope will be the first exascale system...
05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas | From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.
04/15/2013 | Bull | “50% of HPC users say their largest jobs scale to 120 cores or less.” How about yours? Are your codes ready to take advantage of today’s and tomorrow’s ultra-parallel HPC systems? Download this White Paper by Analysts Intersect360 Research to see what Bull and Intel’s Center for Excellence in Parallel Programming can do for your codes.
In this demonstration of SGI DMF ZeroWatt disk solution, Dr. Eng Lim Goh, SGI CTO, discusses a function of SGI DMF software to reduce costs and power consumption in an exascale (Big Data) storage datacenter.
The Cray CS300-AC cluster supercomputer offers energy efficient, air-cooled design based on modular, industry-standard platforms featuring the latest processor and network technologies and a wide range of datacenter cooling requirements.