Since 1986 - Covering the Fastest Computers in the World and the People Who Run Them
From the Editor | Main Blog Index
June 30, 2009
The engineering team at Advanced Clustering Technologies is at it again. A couple of weeks ago, they published the results of the High Performance Linpack (HPL) benchmark for comparable Intel Nehalem- and AMD Istanbul-based systems, which I discussed in a previous article. Those results had Istanbul edging out Nehalem for Linpack bragging rights.
Now the engineers at Advanced Clustering Technologies have pitted those same microprocessors against each other using the STREAM benchmark and have posted the results on their Web site. STREAM is part of the HPC Challenge suite and measures sustainable memory bandwidth -- one of the most important attributes of high performance computing systems today.
Memory bandwidth, or lack thereof, has become increasingly significant for many applications, since as core counts increase, computational power is racing ahead of memory performance. Like HPL, STREAM is a synthetic benchmark, but, in general, if an application is memory constrained, the STREAM benchmark is a good indicator of relative performance.
The STREAM results for the Nehalem and Istanbul offered no surprises. If you've been following the x86 rivalry, you've probably guessed that Intel's Nehalem (Xeon 5500) processor, with its more advanced memory subsystem, bests AMD's Istanbul Opteron, which relies on the older DDR2 technology. According to Advanced Clustering Technologies engineer Shane Corder:
Even the slowest memory speed on a Xeon 5500 processor bests the fastest produced by the Opteron by as much as 20%; comparing the Opteron to the fastest Xeon, the Xeon outperforms by over 75%. The Xeon 5500 gets these much higher memory bandwidth results because of tri-channel instead of dual-channel memory, the increased clock speed of DDR3 (up to 1333MHz), and the fast point-to-point CPU interconnect provided by its Quick Path Interconnect.
One other noteworthy data point is that STREAM performance on the six-core Istanbul turned out to be slightly worse than on the quad-core Shanghai. The Advanced Clustering Technologies folks attribute this to the two extra Istanbul cores having to contend for bandwidth on the same number of memory controllers (two) that are present in the Shanghai chip. As the company did with the Linpack results, the results were also described in terms of price-performance:
When you add cost per machine into the mix, the results still show the Xeon 5500 series with a clear lead. The Xeon machine as configured has a price of approximately $3,800 while the Opteron is priced at $3,500. This gives the Xeon a rate of 9.8 megabytes per second per dollar vs. 5.9 megabytes per second per dollar for the Opteron: a 66% advantage for the Intel Xeon 5500 series.
As before, the caveat is that the synthetic benchmark results may not correspond to real-world apps. The recommendation from Advanced Clustering Technologies is that you use your own codes to figure out which processor and system configuration is going to give you the most bang for the buck.
Posted by Michael Feldman - June 30 @ 10:55AM, Pacific Daylight Time
(Digg, Technorati, more)
Michael Feldman is the editor of HPCwire.
More Michael Feldman
good by Melany
good by Melany
16 core by gretta
Exciting Ride by melonakos
Oracle exiting? by JF@OCF
SGE ... by cdespoix
Oracle by PhilT
quality by db05
New IC Design Modeling Software Needed for Next Generation by symmecon
I thought this was HPC by Don Lee
rethuglican apologist above; by chammitt
a zinger by melonakos
RoCE does not require DCB by Paul Grun
iWARP provides RDMA over Ethernet – Part 2 by David Fair
iWARP provides RDMA over Ethernet - Part 1 by David Fair
Why can't people get this right?? by BradBooth
Re: Podcast: HPC in the Cloud; Cray Cozies Up To ISVs by Kate
Good resource by Kate
2008R2 parity with linux by tprince
Google needs 10 terabit ethernet by zipdisk2003
Re: IBM and HPC by Proteus
Pop filter and deesser by ChristophWeber
MPI collective operations by jsquyres
Compairson to Core i7-980X by rsingle
HPC? not so much by ewahl
Re: IBM and HPC by truly64
HPC = servers but a lot more by lawries
Multi core deployment becomes a memory game by truly64
Re: Venture Capital Drought? Not So Much. by Ron Van Holst
Painful Truth by jeffrey.mcallister
SGI = graphics + HPC by johnbarr
HPC = servers but a lot more by truly64
Oracle SPARC != Fujitsu SPARC by Alan M. Feldstein
Sun & HPC != Oracle & HPC by Merblich
Response to GAH by KevinButerbaugh
Response to KevinButerbaugh by GAH
Response to KevinButerbaugh by GAH
Response to GAH by KevinButerbaugh
Response to bdrupp by KevinButerbaugh
Climate Crisis and Exaflops by bdrupp
Climate Crisis and Exaflops by John Hules
Climate Crisis and Exaflops by GAH
Climate Crisis by KevinButerbaugh
IBM "Brain Simulation" article is not properly presented. by Merritt
The National Science Foundation has awarded funding to four projects as part of the Future Internet Architecture program; and the 3PAR bidding war is won by HP. We recap those stories and more in our weekly wrapup.
Read More...
Intel Corp has released Parallel Studio 2011, a set of four tools designed to mainstream software development on multicore x86 architectures. The update folds in a number of parallel programming technologies that the company has acquired or developed independently over the past few years, including the Cilk Arts and RapidMind technologies, and Intel's own Ct data parallel language framework.
Read More...
There's nothing like a blazing hot summer to focus one's attention on the best ways to keep cool. That goes for datacenter operators as well, who are equally worried about keeping their servers properly chilled. While there is no shortage of innovative cooling solutions being proffered by various vendors, a new liquid immersion cooling solution from startup Green Revolution Cooling could end up being the best of them all.
Read More...
Sep 03 | Should engineers take advantage of GPU computing? Read more...
Sep 02 | Could see first products in three years. Read more...
Sep 01 | A hand-picked selection of video presentations from the TED conference -- because the next big thing has to start somewhere. Read more...
Aug 30 | CERN project adapts its computation and storage strategy as hardware gets cheaper and better. Read more...
Aug 26 | Chinese-made chip adds vector SIMD unit; delivers 128 gigaflops in 40 watts. Read more...
Jul 29 | | Panasas storage solutions deliver high throughput with many concurrent backup IO streams to standard backup applications such as Veritas NetBackup™ or EMC® NetWorker™. Download this whitepaper to understand the essential elements for effective backup and restore: the tape subsystem, networking, file system workload and administrative policy.
Jul 28 | | As compelling economics and performance drive GPUs into HPC clusters, developers are scrambling to catch up. Download this whitepaper from Platform Computing to understand how to capture the benefits of exciting new GPU capabilities.
In this webinar you will hear about the current storage challenges facing the HPC community, how Panasas storage solutions provide exceptional performance, scalability, and manageability, and how you can achieve the lowest total Cost of Ownership with a system that installs and configures in 15 minutes.
Join this online panel discussion for live Q&A with leading industry experts, analysts, and end-users to discuss the latest innovations, best practices, barriers to implementation, and measurable benefits of server virtualization with a particular focus on today's real world solutions.
Learn about scalable fault-tolerant architectures and examples of energy efficient and scalable supercomputing clusters using dual QDR InfiniBand to combine capacity computing with network failover capabilities with the help of programming languages such as MPI and a robust Linux cluster management package.