The Leading Source for Global News and Information Covering the Ecosystem of High Productivity Computing
April 28, 2006
Purdue Libraries and Information Technology at Purdue (ITaP) are collaborating on an initiative that includes innovative software developed on campus to help researchers store, sort, archive, retrieve and manage large-scale data and information.
The Distributed Institutional Repository is a Web-based data portal that provides tools and systems to manipulate large data sets and to help users understand the origins of data and learn about additional research applications using the same data.
"The DIR is an architecture Purdue Libraries developed which utilizes a unique approach to pulling access together from a number of distributed repositories," said Scott Brandt, professor of library science and associate dean for research for Purdue Libraries.
Most institutional repositories use a single system within a single software environment. In addition to data repositories built by ITaP and the Libraries, the DIR will access a wide variety of information systems, including electronic dissertations e-prints and archival special collections.
Purdue Libraries' role in the repository is to define organizational structure, access to and the retrieval of data deposited and archived from three broad categories: traditional sources, such as journals and books; current digital sources, such as Web pages, digital video and electronic documents; and future sources, such as remote sensing data and large-scale visualization and perceptualization.
"The Libraries have always supported research through building and organizing collections," Brandt said. "With a distributed repository, we hope to enhance discovery and use of data across campus."
An institutional repository is necessary to meet the growing need to house massive amounts of data and to make it usable for users across the academic environment, Brandt said.
"Projects like this are another example of how we are bridging discovery and learning in non-traditional ways," said Krishna Madhaven, research scientist and project leader with ITaP's Rosen Center for Advanced Computing. "We're starting to see the big picture of how large data computation, learning, research and network are coming together."
The ITaP data repository is a multi-disciplinary data management system that was built on top of Storage Resource Broker (SRB) and deployed on the TeraGrid network, said Lan Zhao, senior analyst and programmer in the Rosen Center. She also developed the SRB-based data management architecture that is connected to the Distributed Institutional Repository.
SRB is an open-source data-management middleware developed by the San Diego Supercomputer Center.
Page: 1 of 3(Digg, Technorati, more)
PGI Accelerator™ Fortran 95/03 and C99 compilers for x64+NVIDIA
Accelerate applications on x64+GPU platforms by adding OpenMP-like compiler directives to existing Fortran and C programs. Available now for Linux, MacOS and Windows. Download a free 15 day trial.
Platform HPC Workgroup Manager
Platform HPC Workgroup Manager integrates all the cluster productivity tools you need to deploy, run and manage your HPC environment.
Mar 18 | ChannelWeb | Westmere parts already showing up in HPC machines. Read more...
Mar 17 | The Register | But what about the tier ones? Read more...
Mar 17 | Cadalyst Magazine | A new generation of workstations is changing the nature of technical computing. Read more...
Mar 17 | Linux Magazine | Latest iteration of Sun Grid Engine able to tap into Cloud. Read more...
Mar 16 | Bio-IT World | Biotech firm builds genetic models from patient data. Read more...
Jan 12 | | In-depth look at vSMP Foundation server virtualization technology, technical implementation, use cases and capabilities. The technical whitepaper provides an architectural overview and details on the three vSMP Foundation products: vSMP Foundation for SMP, vSMP Foundation for Cluster and vSMP Foundation for Cloud.
Jan 18 | | This white paper discusses Gore’s copper cable assemblies, and how they continue to exceed the standards for providing reliable, cost-effective solutions for high-performance computer applications.
Join this online panel discussion for live Q&A with leading industry experts, analysts, and end-users to discuss the latest innovations, best practices, barriers to implementation, and measurable benefits of server virtualization with a particular focus on today's real world solutions.
Learn about scalable fault-tolerant architectures and examples of energy efficient and scalable supercomputing clusters using dual QDR InfiniBand to combine capacity computing with network failover capabilities with the help of programming languages such as MPI and a robust Linux cluster management package.
LIVE@SCO9: The IBM team discusses new innovations in hardware, software and services that help clients better understand their workloads and get insight from their R&D efforts. Technology demonstrations include the soon-to-be-released Power7 HPC processor, the DCS990 system with 2.4 petabytes of storage, the xCAT management tool, secure HPC cloud computing and more. Winners of two HPCwire Readers' and Editors’ Choice Awards! Take the IBM virtual tour at SC09 or more information go online to: http://www-03.ibm.com/systems/deepcomputing/sc09.html