The Leading Source for Global News and Information Covering the Ecosystem of High Productivity Computing
September 29, 2006
Since human immunodeficiency virus (HIV) -- the virus that causes AIDS -- was first isolated in 1981, 25 million people have died of the disease. Worldwide it is estimated that there are currently more than 65 million adults and children infected with the virus. Nearly a quarter of a century on from identifying the virus we now have effective treatments. But they're expensive and, if doses are missed, the effects can be catastrophic. HIV, in short, continues to be one of the major threats to global health.
High performance computing (HPC) is playing a key role in coming up with an HIV vaccine. Even within a single infected patient the rapidly evolving HIV virus creates an enormous amount of data for potential analysis. This kind of statistical analysis requires randomization testing on an enormous scale, which in turn needs a computationally intensive approach to shorten the timescale between potentially ground-breaking medical observations and the arrival of effective treatments.
For the past three years, David Heckerman, senior researcher, and Carl Kadie, research software development engineer, both at Microsoft Research, have been applying their backgrounds in machine learning to the challenge of using high performance computing to pursue the ultimate AIDS vaccine design.
HPCwire: What are the primary goals of your work on HIV vaccine design?
Heckerman: Vaccine design has two key aims. We're working on the payload or what is called the "immunogen" of the vaccine -- the substance that provokes an immune response. The other important goal of vaccine design is the means of delivering the vaccine, or what is called the "vector."
The tricky part in designing the immunogen is the fact that the HIV virus mutates so rapidly. So pernicious is the virus that it contains an in-built protein that deliberately makes mistakes when it is replicated. By mutating rapidly, the virus is able to constantly escape attacks by our immune system.
Our plan is to identify the potential escape routes of the virus and to prime our immune systems to target the virus, regardless of which exit route it chooses.
HPCwire: Why is HPC particularly important when it comes to finding a cure for HIV?
Kadie: When HIV infects a new person it actually changes to throw off the mutations it developed in the previous person. The way it changes depends on the type of immune system you have. There are hundreds of different types of immune systems, leading to many different types of HIV. There are certain patterns of mutation that are repeating themselves across the human population around the world that are not obvious to the human eye -- thousands of human sequences to look at. Much of our work involves building simulations of how the virus mutates in response to attacks by the immune system. Our most accurate simulations are extremely computationally intensive, requiring HPC.
HPCwire: During his Supercomputing 2005 keynote, Bill Gates addressed the transformation resulting from the availability of massive amounts of real-world data from low-cost sensors. What opportunities and challenges does this scenario present in your work?
Heckerman: In our case, the real world sensors are the countless health workers scattered throughout the world -- especially in third-world areas where HIV is endemic -- who are monitoring the health of the local inhabitants, taking blood samples for experimentation purposes, and offering treatment whenever possible.
HPCwire: What kind of HPC cluster do you have in place to do your work?
Kadie: The simulations require massive amounts of computation -- sometimes as much as a CPU year of computation for a single run. Using Windows Compute Cluster Server 2003, we have racked up dozens of CPU years of results that are helping us further our understanding of the way HIV interacts with our immune system.
The HPC cluster is based on 25 IBM eServer 326 boxes, with two AMD Opteron processors per machine running at 2.6GHz. Of the eight new research programs we've got under way, six use .NET (C# and C++/CLI), one is in "R" and one in native C++.
HPCwire: In order to achieve the next big breakthrough, what advanced computing capabilities are on your wish list?
Kadie: We need to be able to grab more cycles from more computers in as painless a way possible. One of the key roadblocks here is trust. When moving beyond clusters to grids, we can no longer trust the machines on which we run our simulations, and the owners of those machines can't trust our software. A solution that makes computation seamless in this environment is critical.
HPCwire: How do you think your HIV vaccine design work will have progressed in five years time?
Heckerman: We're currently moving from using HPC to simulate the reactions to exposure to HIV in test tubes, to animal studies, particularly mice. Five years down the line we hope to be in the human phase, innoculating humans with an HIV vaccine simulating the -- hopefully effective -- response of their immune systems.
Our statistical analyses could also prove of potential benefit in looking at other viral infections where you see a lot of mutations. Hepatitis C, is an example where we might also be able to come up with an effective vaccine with the help of our HPC-based approach.
HPCwire: Taking a mile high view, what kind of the impact do you think your current work with HPC and computational statistics will have on the medical profession in general?
Heckerman: Experts are pointing to "personalized medicine" as the next breatkthrough in medical care. In the current medical paradigm, the patient becomes aware of certain symptoms and goes to the doctor. He or she is then prescribed treatments taught in medical schools -- treatments almost invariably based on effectiveness in the general population rather than effectiveness in the individual patient. While such treatments often work, sometimes they don't and can have side effects that cause as many problems as they solve.
Personalized medicine, in contrast, involves assessing an individual's genetic predisposition to a particular disease or treatments. With such information, diseases could be treated before exacting enormous personal and social costs, and treatments that cause harmful side effects in a particular individual could be avoided. The HPC methods that we have been applying to the study of HIV can be used to tease out the relationship between genes, disease, and treatment and thus provide a basis for personalized medicine.
-----
Carl Kadie David Heckerman
(Digg, Technorati, more)
PGI Accelerator™ Fortran 95/03 and C99 compilers for x64+NVIDIA
Accelerate applications on x64+GPU platforms by adding OpenMP-like compiler directives to existing Fortran and C programs. Available now for Linux, MacOS and Windows. Download a free 15 day trial.
Platform HPC Workgroup Manager
Platform HPC Workgroup Manager integrates all the cluster productivity tools you need to deploy, run and manage your HPC environment.
Mar 19 | OfficialWire | New super to support intelligence work Down Under. Read more...
Mar 18 | ChannelWeb | Westmere parts already showing up in HPC machines. Read more...
Mar 17 | The Register | But what about the tier ones? Read more...
Mar 17 | Cadalyst Magazine | A new generation of workstations is changing the nature of technical computing. Read more...
Mar 17 | Linux Magazine | Latest iteration of Sun Grid Engine able to tap into Cloud. Read more...
Jan 12 | | In-depth look at vSMP Foundation server virtualization technology, technical implementation, use cases and capabilities. The technical whitepaper provides an architectural overview and details on the three vSMP Foundation products: vSMP Foundation for SMP, vSMP Foundation for Cluster and vSMP Foundation for Cloud.
Jan 18 | | This white paper discusses Gore’s copper cable assemblies, and how they continue to exceed the standards for providing reliable, cost-effective solutions for high-performance computer applications.
Join this online panel discussion for live Q&A with leading industry experts, analysts, and end-users to discuss the latest innovations, best practices, barriers to implementation, and measurable benefits of server virtualization with a particular focus on today's real world solutions.
Learn about scalable fault-tolerant architectures and examples of energy efficient and scalable supercomputing clusters using dual QDR InfiniBand to combine capacity computing with network failover capabilities with the help of programming languages such as MPI and a robust Linux cluster management package.
LIVE@SCO9: The IBM team discusses new innovations in hardware, software and services that help clients better understand their workloads and get insight from their R&D efforts. Technology demonstrations include the soon-to-be-released Power7 HPC processor, the DCS990 system with 2.4 petabytes of storage, the xCAT management tool, secure HPC cloud computing and more. Winners of two HPCwire Readers' and Editors’ Choice Awards! Take the IBM virtual tour at SC09 or more information go online to: http://www-03.ibm.com/systems/deepcomputing/sc09.html