The Leading Source for Global News and Information Covering the Ecosystem of High Productivity Computing
March 17, 2006
If you're reading this article, the idea that cluster adoption is increasing will be of no surprise to you. The idea of linking low cost, commodity computers together, and leveraging a low cost operating system, has been around for a number of years now. The ability to cost-effectively leverage high performance computing changes the way people do their jobs and changes the way leaders plan their projects. The use of these types of systems has grown from small projects inside of labs, universities, and basements into full-fledged validated enterprise tools. Scali, a company making cluster management and MPI library software tools, has been involved in Linux-based cluster computing since 1997. Over the past nine years, we've noticed a pattern in how clusters are brought into organizations, particularly commercial organizations, and how their usage and impact changes over time. This article will discuss those various phases and how laying the proper groundwork can significantly increase the likelihood of success when transitioning to the next phase.
One of the interesting aspects of seeing how cluster adoption has spread, particularly within a single organization over time, is how many users claim that if they knew then what they know now, they would have done things differently. In general, many users comment that they tend to focus very heavily on the processor count and performance aspects of their initial systems, but very quickly found that things like management, expansion, networks, operating systems, and applications ended up being the key elements that affected actual user success.
As clusters become more mainstream, we see that the challenges of deploying, managing and maintaining these environments have become increasingly complex and require comprehensive tools, such as Scali Manage, to provide assistance in making these clusters as efficient as possible.
Phase One: Project/Evaluation
Typically, we see that most commercial organizations bring their initial clusters in as single purpose clusters. These clusters are usually focused on specific tasks or specific projects. They consist of homogenous hardware and operating environments that are not expected to change much from initial specifications.
These clusters are often brought into the organization by the project teams themselves and are not directly tied to the general IT environment. This implies that the support and ongoing maintenance is also a responsibility of the project team, with all the benefits and risks that control offers them. They are typically deployed to provide performance and capabilities to teams that have specific project goals that need to be met. They usually have a single or limited number of applications and a limited user base. The primary criteria of success for these types of clusters are that they get up and running quickly, providing benefits to the project completion.
From a management perspective, these users need solutions that can deploy their clusters quickly, leveraging best practices to ensure success. From an ongoing perspective, these environments need an easy-to-use solution for keeping their systems running and for handling any updates to the application environment, while providing basic feedback on the efficiency and effectiveness of the cluster.
Phase Two: Distribution
Once projects experience some success with clusters, that success breeds additional interest in the use of the cluster -- and of clusters in general -- by other projects within the organization.
At this point, the initial cluster environment starts to grow in all dimensions from its original design and purpose. As this occurs, the number of nodes increases, the mix of hardware changes from a purely homogenous to a heterogeneous environment, the number of applications increases and the number of users expands.
Page: 1 of 3(Digg, Technorati, more)
PGI Accelerator™ Fortran 95/03 and C99 compilers for x64+NVIDIA
Accelerate applications on x64+GPU platforms by adding OpenMP-like compiler directives to existing Fortran and C programs. Available now for Linux, MacOS and Windows. Download a free 15 day trial.
Platform HPC Workgroup Manager
Platform HPC Workgroup Manager integrates all the cluster productivity tools you need to deploy, run and manage your HPC environment.
C-DAC announces plans for a petaflop system; IBM researchers are working on vertical integration techniques to extend Moore's Law another 15 years. We recap those stories and more in our weekly wrapup.
Read More...
The Moscow State University supercomputer, Lomonosov, has been selected for a high-performance makeover, with the goal of tripling its processing power to achieve petaflop-level performance in 2010. T-Platforms, who developed and manufactured the supercomputer, is the odds-on favorite to lead the project.
Read More...
Right on schedule, Intel has launched its Xeon 5600 processors, codenamed "Westmere EP." The 5600 represents the 32nm sequel to the Xeon 5500 (Nehalem EP) for dual-socket servers. Intel is touting better performance and energy efficiency, along with new security features, as the big selling points of the new Xeons.
Read More...
Mar 19 | OfficialWire | New super to support intelligence work Down Under. Read more...
Mar 18 | ChannelWeb | Westmere parts already showing up in HPC machines. Read more...
Mar 17 | The Register | But what about the tier ones? Read more...
Mar 17 | Cadalyst Magazine | A new generation of workstations is changing the nature of technical computing. Read more...
Mar 17 | Linux Magazine | Latest iteration of Sun Grid Engine able to tap into Cloud. Read more...
Jan 12 | | In-depth look at vSMP Foundation server virtualization technology, technical implementation, use cases and capabilities. The technical whitepaper provides an architectural overview and details on the three vSMP Foundation products: vSMP Foundation for SMP, vSMP Foundation for Cluster and vSMP Foundation for Cloud.
Jan 18 | | This white paper discusses Gore’s copper cable assemblies, and how they continue to exceed the standards for providing reliable, cost-effective solutions for high-performance computer applications.
Join this online panel discussion for live Q&A with leading industry experts, analysts, and end-users to discuss the latest innovations, best practices, barriers to implementation, and measurable benefits of server virtualization with a particular focus on today's real world solutions.
Learn about scalable fault-tolerant architectures and examples of energy efficient and scalable supercomputing clusters using dual QDR InfiniBand to combine capacity computing with network failover capabilities with the help of programming languages such as MPI and a robust Linux cluster management package.
LIVE@SCO9: The IBM team discusses new innovations in hardware, software and services that help clients better understand their workloads and get insight from their R&D efforts. Technology demonstrations include the soon-to-be-released Power7 HPC processor, the DCS990 system with 2.4 petabytes of storage, the xCAT management tool, secure HPC cloud computing and more. Winners of two HPCwire Readers' and Editors’ Choice Awards! Take the IBM virtual tour at SC09 or more information go online to: http://www-03.ibm.com/systems/deepcomputing/sc09.html