Visit additional Tabor Communication Publications
June 22, 2011
HAMBURG, GERMANY– June 22, 2011 [International Supercomputing Conference Stand 850] – Fusion-io (NYSE: FIO), provider of a next-generation shared data decentralization platform, today announced that its technology has been utilized to realize significant performance improvements in MySQL database queries for bioinformatics research. The Protein Data Bank (PDB) research is being conducted at the University of California, San Diego, in collaboration with the San Diego Supercomputer Center (SDSC). Researchers at SDSC noted that replacing hard drive disks (HDDs) with Fusion-io technology in their database infrastructure reduced query times from 30 minutes to three minutes.
Fusion’s ioMemory technology is being utilized by the SDSC, an organized research unit of UC San Diego, in its data-intensive computing initiatives. As part of these efforts, researchers are using the technology to advance research on the Protein Data Bank, the world’s most comprehensive repository for three-dimensional structures of large molecules and nucleic acids. The research carried out for the PDB serves as a foundation for accelerating the development of life-saving drugs, synthetic proteins used in medicine and other treatments for illnesses.
“As SDSC strives to serve as a center for research excellence, we focus heavily on evaluating how flash-based technologies can improve scientific discovery,” said Allan Snavely, Associate Director of SDSC and co-Principal Investigator of the Center’s upcoming data-intensive high-performance computing (HPC) system, Gordon, which is currently in development. “Through our research, we are finding that storage memory products like those offered by Fusion-io are considerably faster than hard drive disks for many large memory and data-intensive problems, and that flash-based storage memory has the potential to make supercomputing-level results accessible outside the traditional user base.”
Using Fusion’s server-deployed technologies, researchers were able to reduce the time required to conduct complex MySQL database queries that determine relationships between proteins and the effects proteins can have on each other. Using traditional hard drive disks, a query analyzing more than 200 million protein structures took 30 minutes.
“With Fusion’s shared data decentralization technologies – which place the storage memory medium right next to the processor to significantly reduce latency and increase processing speed – the same query took just three minutes,” said Robert Sinkovits, SDSC computational scientist and Gordon Applications Lead.
Spencer Bliven, a graduate student in the Bioinformatics and Systems Biology Department at UC San Diego working with Allen Snavely on the research, said, “The PBD is helping us gain a more theoretical and practical understanding of the foundations of biological science. With the reduced query times I’ve seen in conducting tests with Fusion’s ioMemory technology, I’m able to stay more focused on my research. This allows the scientific discovery process to be more spontaneous and greatly helps me in eliminating potential errors.”
The computer used to realize the performance gains includes two quad-core Intel Xeon E5530 2.40 GHz processors and 48 GB of DDR3-1066 memory. It includes four 320 GB ioDrives mounted and configured as a single 1.2 TB RAID 0 device running an XFS file system.
“Storing crucial data in the server where it is processed can greatly accelerate the analysis of large data sets to enable new scientific discoveries,” said Neil Carson, Chief Technology Officer of Fusion-io. “It’s inspiring to see how SDSC is leveraging the performance gains and reduced latency made possible by Fusion-io technology to advance complex biological research that serves as the foundation for many improvements in science and medicine.”
Fusion-io has pioneered a next generation storage memory platform for shared data decentralization that significantly improves the processing capabilities within a datacenter by relocating process-critical, or "active", data from centralized storage to the server where it is being processed, a methodology referred to as data decentralization. Fusion’s integrated hardware and software solutions leverage non-volatile memory to significantly increase datacenter efficiency and offers enterprise grade performance, reliability, availability and manageability. Fusion’s data decentralization platform can transform legacy architectures into next generation datacenters and allows enterprises to consolidate or significantly reduce complex and expensive high performance storage, high performance networking and memory-rich servers. Fusion’s platform enables enterprises to increase the utilization, performance and efficiency of their datacenter resources and extract greater value from their information assets.
In quieter times, sounding the bell of funding big science with big systems tends to resonate further than when ears are already burning with sour economic and national security news. For exascale's future, however, the time could be ripe to instill some sense of urgency....
In a recent solicitation, the NSF laid out needs for furthering its scientific and engineering infrastructure with new tools to go beyond top performance, Having already delivered systems like Stampede and Blue Waters, they're turning an eye to solving data-intensive challenges. We spoke with the agency's Irene Qualters and Barry Schneider about..
Large-scale, worldwide scientific initiatives rely on some cloud-based system to both coordinate efforts and manage computational efforts at peak times that cannot be contained within the combined in-house HPC resources. Last week at Google I/O, Brookhaven National Lab’s Sergey Panitkin discussed the role of the Google Compute Engine in providing computational support to ATLAS, a detector of high-energy particles at the Large Hadron Collider (LHC).
May 23, 2013 |
The study of climate change is one of those scientific problems where it is almost essential to model the entire Earth to attain accurate results and make worthwhile predictions. In an attempt to make climate science more accessible to smaller research facilities, NASA introduced what they call ‘Climate in a Box,’ a system they note acts as a desktop supercomputer.
May 22, 2013 |
At some point in the not-too-distant future, building powerful, miniature computing systems will be considered a hobby for high schoolers, just as robotics or even Lego-building are today. That could be made possible through recent advancements made with the Raspberry Pi computers.
May 16, 2013 |
When it comes to cloud, long distances mean unacceptably high latencies. Researchers from the University of Bonn in Germany examined those latency issues of doing CFD modeling in the cloud by utilizing a common CFD and its utilization in HPC instance types including both CPU and GPU cores of Amazon EC2.
May 15, 2013 |
Supercomputers at the Department of Energy’s National Energy Research Scientific Computing Center (NERSC) have worked on important computational problems such as collapse of the atomic state, the optimization of chemical catalysts, and now modeling popping bubbles.
05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas | From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.
04/15/2013 | Bull | “50% of HPC users say their largest jobs scale to 120 cores or less.” How about yours? Are your codes ready to take advantage of today’s and tomorrow’s ultra-parallel HPC systems? Download this White Paper by Analysts Intersect360 Research to see what Bull and Intel’s Center for Excellence in Parallel Programming can do for your codes.
In this demonstration of SGI DMF ZeroWatt disk solution, Dr. Eng Lim Goh, SGI CTO, discusses a function of SGI DMF software to reduce costs and power consumption in an exascale (Big Data) storage datacenter.
The Cray CS300-AC cluster supercomputer offers energy efficient, air-cooled design based on modular, industry-standard platforms featuring the latest processor and network technologies and a wide range of datacenter cooling requirements.