July 05, 2012
PITTSBURGH, PA, July 3 -- A training program of the National Resource for Biomedical Supercomputing (NRBSC) at the Pittsburgh Supercomputing Center (PSC) has taken a unique pro-active role toward filling the gap in scientific training at minority-serving institutions (MSIs). Since 2001, with funding from NIH’s National Institute of General Medical Sciences, PSC’s MARC (Minority Access to Research Careers) program has evolved from providing individual training in what was at first a newly emerging discipline, bioinformatics, to a focus on the development of curricula and research programs at partner universities.
“We’ve implemented a multi-disciplinary course in sequence-based bioinformatics at more than 10 universities,” says PSC scientist Hugh Nicholas, who directs the MARC program. As of 2011, with renewal of the program from NIH for five years, the effort of Nicholas and his PSC colleagues, Troy Wymore and Alex Ropelewski, in working with five partner MSIs — North Carolina AT & T; University of Puerto Rico, Mayaguez; Johnson C. Smith University; Tennessee State University; and Jackson State University — is to build a concentration or minor in bioinformatics. This expanded outreach reflects dramatic changes in bioinformatics, a field that over the last few years has exploded, because of powerful new sequencing technologies, with data to be analyzed and opportunities to do important research.
“The program has shifted direction,” says Ricardo González, a professor at the University of Puerto Rico, School of Medicine, and co-principal investigator with Nicholas in PSC’s MARC program. “We’ve become good enough to establish bioinformatics programs or tracks at these universities and to provide a solid foundation for their faculty and students to carry out research in this field.”
Along with workshops at PSC, the MARC staff travels to partner MSIs to offer intensive on-site workshops. The program also provides a model curriculum for bioinformatics, with course materials in related aspects of biology, computational science and mathematics, and offers teaching assistance for newly established courses. “At each campus with which we’ve partnered,” says Nicholas, “we’ve trained people who are now capable of teaching a basic bioinformatics course.”
Many peer-reviewed papers have already resulted from PSC’s program, notes González, work that is especially important, he observes, in light of a 2011 NIH study (Science, August 19, 2011) finding that black scientists were a third less likely than white counterparts to get a research project funded. “The PSC program has been addressing the uneven playing field that affects black researchers for 10 years.”
González and faculty liaisons from the partner MSIs gathered at PSC on June 28 to assess progress as PSC’s program concluded its 2012 Summer Institute. This two-week workshop, which PSC has hosted since 2003, trains graduate students, who can use bioinformatics tools to complete dissertation research, and faculty who plan to establish an introductory bioinformatics course at their home institution. This year 17 faculty and graduate students from eight universities attended.
In addition to the Summer Institute, the program also offers a seven-week research internship on-site at PSC, with 10 participants this year. These internships build connections for young scientists with resources at two major research universities, Carnegie Mellon and the University of Pittsburgh, and have often led to published research — including a project in sequencing of bacteriophage genes by one of the program’s first interns, from Howard University: http://www.psc.edu/science/2004/dobbins/
Through the MARC program, partner MSIs have access to PSC resources, including BioU, a computing cluster used for bioinformatics teaching, and PSC’s Blacklight an SGI® Altix® UV1000 system that is the largest shared-memory systems in the world, and is a resource of the NSF’s XSEDE (Extreme Science and Engineering Discovery Environment) cyberinfrastructure program.
XSEDE access is especially useful in bioinformatics as “next generation” sequencing instruments produce skyrocketing quantities of genomics data and make possible many new avenues of research. Tools such as Penn State’s Galaxy — an open, web-based platform used for data intensive bioinformatics workflows — help to reduce the steep learning curve. “Galaxy eliminates a lot of the complexity of getting started in this field,” says Nicholas. “It has become the preferred way to perform bioinformatics analyses for both research and teaching.”
PSC has recently added a high-speed network link to the main Galaxy portal at Penn State. See:http://psc.edu/index.php/newscenter/71-2012press/670-pittsburgh-supercomputing-center-provides-direct-link-from-galaxy-to-the-xsede-backbone
PSC’s workshops in the techniques of sequence analysis began in 1989 in response to the development of computing technologies as powerful tools for biological research. “The traditional manner of biological training has been laboratory focused,” notes Nicholas, an expert in genetic sequence analysis, “and it’s a distinct shift in training to incorporate computational techniques into the biologist’s repertoire.”
Having emerged over the last decade as a major thrust of the life sciences, bioinformatics marries information science and statistics with biology. “Computational tools,” says Nicholas, “allow researchers to understand biological systems like never before. But researchers need to learn about these new and rapidly improving tools.”
The MARC program sprang from a collaboration with the University of Puerto Rico School of Medicine, after González took a series of PSC workshops in the late 1990s. He invited Nicholas and other PSC scientists to provide their training on-site to his colleagues in San Juan and encouraged the PSC scientists to seek funding under NIH’s MARC initiative.
“This is one of the longest funded training grants in NIH,” says González, “and it is strong because of PSC’s ability to build bridges with scientists at historically black universities and MSIs.” Among other MARC programs, PSC’s is unique at being situated external to an MSI and in its focus on building and sustaining curricula and research programs at partner MSIs.
The current advisory committee for PSC’s MARC program: Charles L. Brooks III, University of Michigan; Dannie Durand, Carnegie Mellon University; Daniel M. Gusfield, University of California; Michael Gribskov, Purdue University, and Stephen B. Aley, University of Texas at El Paso.
About PSC’s MARC program:
http://www.nrbsc.org/education/workshops/marc_summer_institute_2012/
About NRBSC: http://www.nrbsc.org
About PSC
The Pittsburgh Supercomputing Center is a joint effort of Carnegie Mellon University and the University of Pittsburgh together with Westinghouse Electric Company. Established in 1986, PSC is supported by several federal agencies, the Commonwealth of Pennsylvania and private industry, and is a service provider in the National Science Foundation XSEDE program.
-----
Source: Pittsburgh Supercomputing Center
The Xeon Phi coprocessor might be the new kid on the high performance block, but out of all first-rate kickers of the Intel tires, the Texas Advanced Computing Center (TACC) got the first real jab with its new top ten Stampede system.We talk with the center's Karl Schultz about the challenges of programming for Phi--but more specifically, the optimization...
Read more...
Although Horst Simon was named Deputy Director of Lawrence Berkeley National Laboratory, he maintains his strong ties to the scientific computing community as an editor of the TOP500 list and as an invited speaker at conferences.
Read more...
Supercomputing veteran, Bo Ewald, has been neck-deep in bleeding edge system development since his twelve-year stint at Cray Research back in the mid-1980s, which was followed by his tenure at large organizations like SGI and startups, including Scale Eight Corporation and Linux Networx. He has put his weight behind quantum company....
Read more...
May 16, 2013 |
When it comes to cloud, long distances mean unacceptably high latencies. Researchers from the University of Bonn in Germany examined those latency issues of doing CFD modeling in the cloud by utilizing a common CFD and its utilization in HPC instance types including both CPU and GPU cores of Amazon EC2.
Read more...
May 15, 2013 |
Supercomputers at the Department of Energy’s National Energy Research Scientific Computing Center (NERSC) have worked on important computational problems such as collapse of the atomic state, the optimization of chemical catalysts, and now modeling popping bubbles.
Read more...
May 10, 2013 |
Program provides cash awards up to $10,000 for the best open-source end-user applications deployed on 100G network.
Read more...
May 09, 2013 |
The Japanese government has revealed its plans to best its previous K Computer efforts with what they hope will be the first exascale system...
Read more...
May 08, 2013 |
For engineers looking to leverage high-performance computing, the accessibility of a cloud-based approach is a powerful draw, but there are costs that may not be readily apparent.
Read more...
05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas | From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.
04/15/2013 | Bull | “50% of HPC users say their largest jobs scale to 120 cores or less.” How about yours? Are your codes ready to take advantage of today’s and tomorrow’s ultra-parallel HPC systems? Download this White Paper by Analysts Intersect360 Research to see what Bull and Intel’s Center for Excellence in Parallel Programming can do for your codes.
In this demonstration of SGI DMF ZeroWatt disk solution, Dr. Eng Lim Goh, SGI CTO, discusses a function of SGI DMF software to reduce costs and power consumption in an exascale (Big Data) storage datacenter.
The Cray CS300-AC cluster supercomputer offers energy efficient, air-cooled design based on modular, industry-standard platforms featuring the latest processor and network technologies and a wide range of datacenter cooling requirements.