The Leading Source for Global News and Information Covering the Ecosystem of High Productivity Computing
January 18, 2008
The advent of more powerful compute systems has increased the capacity to generate data at a fantastic rate. To solve associated issues of data management, a combination of grid technology and other storage components are currently being deployed. Many solutions have been designed to address these petabyte-scale data management problems, including new software, NAS/NFS products and parallel storage solutions from IBM, Panasas and others. This involves handling and storing very large data sets accessed simultaneously by thousands of compute clients.
Extreme single projects such as the Linear Hadron Collider (LHC) at CERN are producing 15 petabytes of data each year. Raw data are distributed to Tier-0 data centres and then, after an order of magnitude reduction, are passed on to Tier-1 data centres and so on. This activity involves 130 computer centres of which 12 are very large.
In HPC there is a strong demand for parallel storage from users in the fields of computational physics, CFD, crash analysis, climate modelling, oceanography, seismic processing and interpretation, bioinformatics, cosmology, computational chemistry and materials sciences. The parallel storage requirement is being driven by the growing size of data sets, more complex analysis, the requirement to run more jobs, simulations with more iterations and the fact that the HPC solutions (Linux clusters) are using multicore processors and more nodes. Inherently the systems and applications are becoming more parallel, hence the requirement for parallel I/O increases.
Before we concentrate on HPC storage needs, let's briefly review the disk storage market trends.
The disk storage market is expanding rapidly. By 2008 the HPC storage market will be well over $4 billion, according to IDC. The spread of broadband creates huge volumes of data, increasing data exchange in commercial transactions, email, images, video and music. Since interactions are global, this is happening 24 hours a day, 7 days a week. This data growth and non-stop operations put storage and data protection at the heart of this business, requiring high-speed processing for large data sets and high-speed backup for protection. Tape is insufficient for this purpose.
High-end NAS storage systems are likely to be using 10 Gbps TCP/IP (soon to be using 20 Gbps) and could have over 150 TBs in a single rack. It is often connected to SAN with a fibre cable and expandable into a cluster.
To deliver a "best-in-class" solution, the compute server and data handling are decoupled. They are highly complementary, but need to be scaled together for balance to handle several petabytes of active data. Although data patterns vary, the system needs to be designed from the ground up for multiple petabyte capability and several millions, or even billions, of files. It is therefore imperative that the data handling systems scale and the network bandwidth does not become a bottleneck.
In the rich digital content environment of today, the limitations of traditional NAS/SAN storage -- scalability, performance bottlenecks and cost -- are driving the industry to find new solutions. The response from the industry was the clustered storage evolution. Vendors claim that clustered NFS storage provides customers with enormous benefits in this digital content environment. The benefits include massive scalability, 100X larger file system, unmatched performance, 20X higher total throughput and industry-leading reliability. They also claim it is as easy to manage a 10-petabyte file system as a 1-terabyte file system. Clustered NFS solutions are fine for most large Web sites, but they simply don't handle the kind of large files typical of most HPC applications very well.
A typical cluster computing architecture consists of a software stack of applications and middleware, tens or thousands of processors/clients, a high speed interconnect using, say, 10GigE, InfiniBand, Myrinet or Quadrics, thousands of direct network connections and hundreds of connections to physical storage.
Storage clusters, similar to compute clusters, transparently aggregate a large amount of independent storage nodes in order to appear as a single-entity. They typically use the same network technology as the compute cluster (InfiniBand or 10GigE), processing power (CPU, multicore, SMP), large amounts of globally coherent cache, and disk drives (up to 1 TB each).
A cluster file system is likely to be using industry standard protocols, NFS, CIFS, HTTP, FTP, NDMP, SNMP, ADS, LDAP and NIS for security, or some other product of similar standing. A cluster file system creates one giant drive or NFS mounted fully symmetric cluster. Such a system is massively scalable to multiple petabytes, easy to manage and has plenty of growth potential. The management of LUNs, volumes or RAID is taken care of by the storage cluster management system and is normally hidden from the user.
Page: 1 of 3(Digg, Technorati, more)
Platform HPC Workgroup Manager
Platform HPC Workgroup Manager integrates all the cluster productivity tools you need to deploy, run and manage your HPC environment.
Mar 11 | Linux Magazine | CUDA may be the rage, but OpenCL is a standard that has some features you may need. Read more...
Mar 09 | Free Software Magazine | Data-driven computing will need open software. Read more...
Mar 09 | Bio-IT World | Tahoe Informatics founder eyes GPUs, CUDA software. Read more...
Mar 08 | Sporting Life | Formula One engineers differ on benefits of CFD. Read more...
Mar 08 | InfoWorld | AMD offers up 48-core server prize. Read more...
Jan 12 | | In-depth look at vSMP Foundation server virtualization technology, technical implementation, use cases and capabilities. The technical whitepaper provides an architectural overview and details on the three vSMP Foundation products: vSMP Foundation for SMP, vSMP Foundation for Cluster and vSMP Foundation for Cloud.
Jan 18 | | This white paper discusses Gore’s copper cable assemblies, and how they continue to exceed the standards for providing reliable, cost-effective solutions for high-performance computer applications.
Join this online panel discussion for live Q&A with leading industry experts, analysts, and end-users to discuss the latest innovations, best practices, barriers to implementation, and measurable benefits of server virtualization with a particular focus on today's real world solutions.
Learn about scalable fault-tolerant architectures and examples of energy efficient and scalable supercomputing clusters using dual QDR InfiniBand to combine capacity computing with network failover capabilities with the help of programming languages such as MPI and a robust Linux cluster management package.
LIVE@SCO9: The IBM team discusses new innovations in hardware, software and services that help clients better understand their workloads and get insight from their R&D efforts. Technology demonstrations include the soon-to-be-released Power7 HPC processor, the DCS990 system with 2.4 petabytes of storage, the xCAT management tool, secure HPC cloud computing and more. Winners of two HPCwire Readers' and Editors’ Choice Awards! Take the IBM virtual tour at SC09 or more information go online to: http://www-03.ibm.com/systems/deepcomputing/sc09.html