November 10, 2006
If the Industrial Age relied on ore, the Digital Age relies on storage.
None of our now-necessary devices, from the most fearsome research-computing arrays to run-of-the-mill office computers to cell-phones to iPods, can work without storage. That's why Richard Moore, director of Production Systems at the San Diego Supercomputer Center (SDSC) at UC San Diego, smiles as he ponders the new IBM tape drives being added to the storage "silos" in the center's already crowded computer room.
SDSC already has six storage silos, each of which holds about 6,000 tapes. With the new tape drives and media (IBM System Storage TS1120 tape drives with the new industry-leading 700-gigabyte tape media), Moore and his colleagues can now store 25 petabytes -- that's 25 million billion bytes -- an upgrade from SDSC's previously phenomenal storage capacity of six petabytes.
That will give SDSC and its host institution, the University of California San Diego, more storage capacity than any other educational institution in the world.
"As an institution which supports not only UC San Diego but also a national community of academic researchers, SDSC has an obligation to provide the increasing amounts of storage capacity these collaborations will require," said Moore. "This latest upgrade with added IBM technology helps us stay ahead of the projections -- all of which show an exponentially increasing demand for storage in the future."
For example, Moore said, SDSC is currently serving more than 10,000 researchers at 300 academic, government and industrial institutions in the United States and around the world. "Today, these scientists and engineers increasingly rely on the availability of globally accessible data and the associated cyberinfrastructure tools to drive research and education," he says.
And, he adds, SDSC intends to stay ahead of that curve.
The supercomputer center -- now in the midst of an ambitious expansion project on the sunny UC San Diego campus -- had an eye-glazing amount of capacity before the upgrade. Now, its storage capacity runs into numbers that are mind-boggling.
Moore uses several analogies to communicate the vast amounts of information SDSC's computers can store and make available. "For reference, the digital equivalent of all the printed materials in the Library of Congress is about 20,000 gigabytes; this represents less than 0.1 percent of our capacity," he says.
For students, he explains it this way: "If every high school student in the U.S. had a gigabyte of music on his or her iPod, all their music -- billions of songs -- could fit in our archive, although," he says with a smile, "I expect that there's a lot of redundancy in that data."
SDSC operates powerful high-end computing resources led by DataStar, a 15.6 teraflop IBM Power4+ supercomputer with total aggregate memory of 7.3 terabytes. DataStar is ranked among the top supercomputers in the world and is used for large-scale, data-intensive scientific research applications. In addition, SDSC was the first U.S. academic institution to install an IBM Blue Gene system, and will soon triple its size to 17.1 teraflops.
"SDSC is a prime example of the leadership that IBM continues to demonstrate in tape storage," said Kristie Bell, vice president, System Storage Marketing, IBM. "No other vendor can offer the end-to-end solutions that IBM is able to offer, and no one comes close to matching IBM in innovations such as tape virtualization and tape encryption."
SDSC also serves as the data-intensive site lead in the NSF-funded TeraGrid, a multiyear effort to build and deploy the world's first large-scale and production grid infrastructure for open scientific research. SDSC hosts a 4.4-teraflop IA-64 Linux cluster, as well as 220 terabytes of a global file system that is mounted across the TeraGrid computing systems. SDSC is connected to the other national TeraGrid partners by a 10-gigabit-per-second cross-country high-speed network.
This focus on data cyberinfrastructure, Moore said, "provides a broad and flexible array of integrated technologies to support increasingly challenging, large-scale and collaborative scientific endeavors."
Two examples he cites involve SDSC support for the Southern California Earthquake Center (SCEC) and the National Virtual Observatory (NVO) collection. For the former, supercomputing resources allow researchers to use massive amounts of geological and historical data to simulate, and better understand, major earthquakes -- and learn how to mitigate structural damage. For the latter, SDSC resources help the NVO combine over 100 terabytes of data from 50 ground- and space-based telescopes and instruments to create a comprehensive picture of the heavens.
"Our work with IBM to significantly upgrade our storage capacity is vital to a myriad such collaborations," said Moore. "We're confident that this project, and many others, will keep our institutions at the very forefront of data storage and cyberinfrastructure for years to come."
-----
Source: SDSC
Jun 18, 2013 |
The world's largest supercomputers, like Tianhe-2, are great at traditional, compute-intensive HPC workloads, such as simulating atomic decay or modeling tornados. But data-intensive applications--such as mining big data sets for connections--is a different sort of workload, and runs best on a different sort of computer.
Read more...
Jun 18, 2013 |
Researchers are finding innovative uses for Gordon, the 285 teraflop supercomputer housed at the San Diego Supercomputer Center (SDSC) that has a unique Flash-based storage system. Since going online, researchers have put the incredibly fast I/O to use on a wide variety of workloads, ranging from chemistry to political science.
Read more...
Jun 17, 2013 |
The advent of low-power mobile processors and cloud delivery models is changing the economics of computing. But just as an economy car is good at different things than a full size truck, an HPC workload still has certain computing demands that neither the fastest smartphone nor the most elastic cloud cluster can fulfill.
Read more...
Jun 14, 2013 |
For all the progress we've made in IT over the last 50 years, there's one area of life that has steadfastly eluded the grasp of computers: understanding human language. Now, researchers at the Texas Advanced Computing Center (TACC) are utilizing a Hadoop cluster on its Longhorn supercomputer to move the state of the art of language processing a little bit further.
Read more...
Jun 13, 2013 |
Titan, the Cray XK7 at the Oak Ridge National Lab that debuted last fall as the fastest supercomputer in the world with 17.59 petaflops of sustained computing power, will rely on its previous LINPACK test for the upcoming edition of the Top 500 list.
Read more...
05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas | From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.
04/15/2013 | Bull | “50% of HPC users say their largest jobs scale to 120 cores or less.” How about yours? Are your codes ready to take advantage of today’s and tomorrow’s ultra-parallel HPC systems? Download this White Paper by Analysts Intersect360 Research to see what Bull and Intel’s Center for Excellence in Parallel Programming can do for your codes.
Join HPCwire Editor Nicole Hemsoth and Dr. David Bader from Georgia Tech as they take center stage on opening night at Atlanta's first Big Data Kick Off Week, filmed in front of a live audience. Nicole and David look at the evolution of HPC, today's big data challenges, discuss real world solutions, and reveal their predictions. Exactly what does the future holds for HPC?
Join our webinar to learn how IT managers can migrate to a more resilient, flexible and scalable solution that grows with the data center. Mellanox VMS is future-proof, efficient and brings significant CAPEX and OPEX savings. The VMS is available today.