August 12, 2010
HPC on cloud platforms can be undermined by performance-numbing virtualization layers and slow networks. But a group of European researchers have found that there could be a more fundamental problem: multitenancy.
An article that appeared this week in HPC in the Cloud, written by software consultant Jeff Napper along with Paolo Bientinesi and Roman Lakymchuk of RWTH Aachen University, suggests that competition for resources by multiple applications running on the same nodes can slow performance significantly for HPC workloads. In their testing of a DGEMM (double-precision general matrix multiply) code on a single cloud node, they found that the typical run times were much slower than on a dedicated machine:
The fastest execution time of the DGEMM over the 6 hours... is similar to that on a typical HPC cluster node. However, the average execution time on our cloud node is more than 8 times worse with a standard deviation of 33%. The hardware is good, as shown by the best execution time, but the competition among tenants results in diminished average performance with a wide range of possible outcomes. Thus, the expected performance of a simple in-memory matrix-matrix multiply on a multitenant cloud node is not good and fluctuates significantly. Without even using the network, the cloud nodes still cannot be expected to perform as a typical HPC cluster due to the competition from other tenants.
What they discovered was that if they used less of the cores on the node, performance could be optimized. For an 8-core node, they found that using just 2 cores in this particular cloud yielded the lowest (fastest) average execution time. Even when they ran the code across multiple nodes (thus adding the network variable back in), using less than the full complement of cores produced faster results.
A good read for both prospective HPC cloud users and cloud providers.
Full story at HPC in the Cloud
Contributing commentator, Andrew Jones, offers a break in the news cycle with an assessment of what the national "size matters" contest means for the U.S. and other nations...
Read more...
Today at the International Supercomputing Conference in Leipzing, Germany, Jack Dongarra presented on a proposed benchmark that could carry a bit more weight than its older Linpack companion. The high performance conjugate gradient (HPCG) concept takes into account new architectures for new applications, while shedding the floating point....
Read more...
Not content to let the Tianhe-2 announcement ride alone, Intel rolled out a series of announcements around its Knights Corner and Xeon Phi products--all of which are aimed at adding some options and variety for a wider base of potential users across the HPC spectrum. Today at the International Supercomputing Conference, the company's Raj....
Read more...
05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas | From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.
04/15/2013 | Bull | “50% of HPC users say their largest jobs scale to 120 cores or less.” How about yours? Are your codes ready to take advantage of today’s and tomorrow’s ultra-parallel HPC systems? Download this White Paper by Analysts Intersect360 Research to see what Bull and Intel’s Center for Excellence in Parallel Programming can do for your codes.
Join HPCwire Editor Nicole Hemsoth and Dr. David Bader from Georgia Tech as they take center stage on opening night at Atlanta's first Big Data Kick Off Week, filmed in front of a live audience. Nicole and David look at the evolution of HPC, today's big data challenges, discuss real world solutions, and reveal their predictions. Exactly what does the future holds for HPC?
Join our webinar to learn how IT managers can migrate to a more resilient, flexible and scalable solution that grows with the data center. Mellanox VMS is future-proof, efficient and brings significant CAPEX and OPEX savings. The VMS is available today.