From the Editor | Main Blog Index
June 18, 2009
Wondering how the new quad-core Intel Nehalem (Xeon 5500 series) and six-Core AMD Istanbul (Opteron 2400 series) stack up against each other on HPC-style codes? The folks at Advanced Clustering Technologies, a company that builds customized HPC clusters from standard components, have been putting the latest high-end x86 silicon through its paces, and have generated some interesting results. Company engineers there ran the High Performance Linpack (HPL) benchmark on comparable Nehalem- and Istanbul-based machines, and reported their findings on the firm's Web site.
Linpack, of course, is an artificial benchmark, but it is a decent measure of peak HPC performance on a given architecture and is the basis of the popular TOP500 list of supercomputers. Benchmarks, in general, are easy to misuse though, so the HPC system buyer has to be aware of their application. (Our friend, Andy Jones, vice-president of HPC at the Numerical Algorithms Group, spells out how to use benchmarking to good effect in Thursday's ZDNet article.) Serious HPC buyers tend to use a variety a benchmarks to make procurement decisions, but Linpack is often the starting point.
For their HPL tests, the engineers at Advanced Clustering Technologies took some pains to match up the systems so as to provide an apples-to-apples comparison of CPUs. According to the post, written by cluster engineer Shane Corder:
All of the testing showed we could achieve the highest performance when using both the Intel Compilers and Intel Math Library -- even on the AMD system -- so these were used ... as the base of our benchmarks. The benchmarks were run on an Opteron 2435 Istanbul system (6 core 2.6GHz processor with 16GB of 800MHz DDR2) and a X5550 Nehalem system (quad core 2.66GHz processor with 12GB of 1333MHz DDR3). An attempt was made to keep the systems identical in every other way.
They did adjust the HPL problem size to compensate for the larger memory capacity on the Nehalem platform, such that the code would approach 100 percent of memory usage on each system.
In a nutshell, Istanbul beat out Nehalem, 99.38 gigaflops to 74.03 gigaflops, respectively. It might not be too surprising that the six-core beat out the quad-core, but since Intel supports two threads per core with its so-called "hyperthreading" technology, one might surmise that Intel has the overall advantage in parallel computation. In practice though, a speed boost from hyperthreading is highly application dependent. According to the engineers at Advanced Clustering Technologies, they actually noticed a decrease in performance when using hyperthreading while running HPL. They told me that Linpack is one of the few codes that does not benefit from this kind of technology.
Nehalem did turn out to be more computationally efficient (HPL peak/theoretical peak), which they attributed to the higher memory bandwidth of DDR3 -- Istanbul uses DDR2 -- and less cache snooping. Users are not usually concerned with such metrics, but it does point to a better system balance in the Intel design.
The more telling metric is price-performance, which the AMD platform won hands down: $35.21/gigaflop for the Istanbul-based system versus $52.33/gigaflop for the Nehalem system. When you're talking teraflops, that difference adds up quickly.
As I mentioned before, the results here are all based on Linpack, so the results won't necessarily reflect real-world HPC codes. It's quite likely that a quad-core Nehalem will outperform the six-core Istanbul on many applications, especially the ones that are memory-constrained or can benefit from Intel's hyperthreading architecture. Advanced Clustering Technologies says it hopes to run more HPC benchmarks in the future and intends to publish the results.
Posted by Michael Feldman - June 18, 2009 @ 3:09 PM, Pacific Daylight Time
There is 1 discussion item posted.
The benchmark is completely wrong.
Submitted by
Patrick LEE
on Jun 18, 2009 @ 10:14 PM EDT
The benchmark results shown on http://www.advancedclustering.com/company-blog/high-performance-linpack-on-xeon-5500-v-opteron-2400.html is clearly incorrect.
The Linpack benchmark result will get better as problem size became larger. As the Istabul 2435 have 4GB more, it runs a larger problem size N. One should take away the 4GB extra memory and both the Linpack benchmark with the problem size on both system.
Conclusion: The benchmark is completely wrong.
Post #1
|
Join the Discussion |
![]()
Michael Feldman is the editor of HPCwire.
No Recent Blog Comments
NVIDIA has introduced its first Kepler-generation GPU product for high performance computing, and revealed some of the inner working of the new architecture. The announcement took place at the kickoff of the company's GPU Technology Conference taking place this week in San Jose, California.
Read more...
Intel Corp. has launched three new families of Xeon processors, joining the Xeon E5-2600 series the chipmaker introduced in March. These latest chips span the entire market for the Xeon line, from four- and two-socket servers, down to entry-level workstations and microservers. A number of HPC server makers, including SGI, Dell, and Appro announced updated hardware based on the new silicon.
Read more...
With the fastest supercomputers on the planet sporting multi-megawatt appetites, green HPC has become all the rage. The IBM Blue Gene/Q machine is currently number one in energy-efficient flops, but a new FPGA-like technology brought to market by semiconductor startup eASIC is providing an even greener computing solution. And one HPC project in Japan, known as GRAPE, is using the chips to power its newest supercomputer.
Read more...
May 16, 2012 |
Chief scientist discusses memory stacks, interconnects, and US technology leadership.
Read more...
May 15, 2012 |
GPU maker conjures up visualization technology for virtual desktops.
Read more...
May 14, 2012 |
Pessimistic predictions about technology have a poor track record, according to 451's John Barr.
Read more...
May 10, 2012 |
DRAM manufacturers gear up for DDR4.
Read more...
May 09, 2012 |
Steven Chu discusses the role of supercomputing in energy research.
Read more...