The Leading Source for Global News and Information Covering the Ecosystem of High Productivity Computing
From the Editor | Main Blog Index
July 24, 2008
Planting CUDA Seeds
NVIDIA is continuing to push hard on CUDA, the company's C-based software environment for GPU computing. With last month's announcement of the first CUDA Center of Excellence at the University of Illinois at Urbana-Champaign, NVIDIA said it donated half a million dollars to the school.
The announcement also noted that "[u]niversities wishing to become CUDA Centers of Excellence must teach a CUDA class and use CUDA technology in their research, usually across several labs. In return, NVIDIA supports the school through funding and equipment donations, including help to set up a GPU computing cluster." This is the same general idea behind the Sony, Toshiba and IBM (STI) Center of Competence program for Cell technology. The STI Center was established at Georgia Tech last year. Can a Larrabee Center of Distinction be far behind?
Big Time Protein Folding
On the hardware side, even NVIDIA's non-Tesla gear is starting to show up in HPC applications. On Thursday, the company announced the big impact that the GeForce GPUs are having on Stanford's Folding@Home project. The project aggregates donated computer cycles on desktop systems to calculate how different types of proteins fold. The idea is to help understand protein behavior as it relates to cancer, cystic fibrosis, Parkinson's and other diseases.
Powered by CUDA-enabled software, the GeForce GPUs are doing yeoman's work for the folders. According to a recent ExtremeTech report, there are currently over 7,000 GPUs running the Folding@Home program, yielding around 840 teraflops of application performance. The article goes on to say:
That's somewhere around 110 gigaflops per GPU, on average. To put that in perspective, the regular windows CPU client is about one gigaflop per client (it's a mix of the single-threaded client and the multi-core SMP version). The PS3 looks like it leads the pack with a total of 1,358 teraflops, but that's from over 48,000 active PS3s. Each PS3 is actually delivering about 28 gigaflops apiece.
Those are all 32-bit floating point performance numbers, but if you can cure cancer with single precision, that's fine with me.
Argonne's Blue Gene/P Gets a Visual Buddy
On Wednesday, Argonne National Laboratory announced its plans to add a NVIDIA Quadro-based data analytics/visualization system to pair up with its Blue Gene/P supercomputer. The visualization system, named Eureka, will turn the torrents of data produced by applications running on Blue Gene into pretty pictures that make sense to mere mortals.
Apparently, 208 NVIDIA Quadro GPUs will be used to construct the system, which is being built by GraphStream, Inc. The hardware will consist of four racks of 1U boxes, with each box containing four Quadro graphics cards. According to a Dr. Dobbs article, the Eureka server building block is the SuperMicro 6015-UR. Each GPU box is hooked to two SuperMicro servers so that each compute server drives two GPUs. Visualization is driven by data that comes from a very large storage array, which is also hooked up to the Blue Gene machine.
The Manycore War of Words
Meanwhile, NVIDIA and Intel continued to spar on the GPU-Larrabee match-up. An article in Custom PC recorded Andy Keane's reaction to Pat Gelsinger's recent comments disparaging the CUDA technology. Keane is general manager of NVIDIA's GPU computing group and took offense at some off-handed remarks made by the Intel exec earlier this month. Gelsinger, a senior vice president and co-general manager of Intel's Digital Enterprise, started the brouhaha by claiming that GPGPU languages like NVIDIA's CUDA will one day be nothing more than "interesting footnotes in the history of computing annals," adding that Larrabee, Intel's upcoming manycore processor, will be the solution that succeeds in the long term.
If you're a self-respecting Nvidian, those are fighting words. From the article, here is the gist of Keane's reaction:
[T]he high level of interest in CUDA "is causing Larrabee. Larrabee's the reaction." He then added that "these comments from Gelsinger; if we were not making a lot of headway do you think he'd even give us a moment's notice? No. It's because he sees a lot of this activity. The strategy is to try to position it [CUDA] as something scary and unique, and it's really not; it's something that's very accessible."
Next month, Intel may release a lot more details on Larrabee at SIGGRAPH and the Intel Developer Forum. An article published Tuesday in The Inquirer says the Larrabee developer boards are being shipped in November. If true, NVIDIA is going to have a much better idea what it's up against real soon.
NVIDIA for Sale?
Finally, Simon Brew in UK's IT PRO speculates whether NVIDIA could be bought out. The reasoning behind the speculation is the AMD-ATI merger, which was designed to synergize two successful technologies into a greater whole -- or into a greater hole, as the case may be. Even Brew admits, "things haven’t quite gone to plan."
His real argument is that in the long term, NVIDIA's toughest competition will be Intel, not AMD. The implication is that the GPU vendor will have to be a lot bigger and brawnier to go up against the chip giant, especially in the expanding mobile graphics market, where Intel dominates. That's a valid point. But so far, NVIDIA has been more nimble than Intel in the graphics arena, and it's got a big head start at the high end of the market. I'm guessing NVIDIA figures it can still outrun the competition. We'll see...
Posted by Michael Feldman - July 24 @ 8:36PM
(Digg, Technorati, more)
PGI Accelerator™ Fortran 95/03 and C99 compilers for x64+NVIDIA
Accelerate applications on x64+GPU platforms by adding OpenMP-like compiler directives to existing Fortran and C programs. Available now for Linux, MacOS and Windows. Download a free 15 day trial.
Platform HPC Workgroup Manager
Platform HPC Workgroup Manager integrates all the cluster productivity tools you need to deploy, run and manage your HPC environment.
Michael Feldman is the editor of HPCwire.
More Michael Feldman
Compairson to Core i7-980X by rsingle
HPC? not so much by ewahl
Re: IBM and HPC by truly64
HPC = servers but a lot more by lawries
Multi core deployment becomes a memory game by truly64
Re: Venture Capital Drought? Not So Much. by Ron Van Holst
Re: Podcast: Cray Awarded Defense Deal; SGI Makes Storage Buy; IBM Invents New Algorithm by Nastyanna
Painful Truth by jeffrey.mcallister
SGI = graphics + HPC by johnbarr
HPC = servers but a lot more by truly64
Oracle SPARC != Fujitsu SPARC by Alan M. Feldstein
Sun & HPC != Oracle & HPC by Merblich
a third vendor for lossless low latency 10GbE fabric by lee.fisher@hp.com
Response to GAH by KevinButerbaugh
Response to KevinButerbaugh by GAH
Response to KevinButerbaugh by GAH
Response to GAH by KevinButerbaugh
Response to bdrupp by KevinButerbaugh
Climate Crisis and Exaflops by bdrupp
Climate Crisis and Exaflops by John Hules
Climate Crisis and Exaflops by GAH
Climate Crisis by KevinButerbaugh
IBM "Brain Simulation" article is not properly presented. by Merritt
563 out of 1206 by vvolkov
Little Iron by gadunk
At least it's not "cloud" by KevinButerbaugh
Native QPI Interface? by commike
Mmmmmm by hellcats
New transistorized IC chip scales. by symmecon
Itanium at IDF by Alan M. Feldstein
Communication time by jnapper
"The financial meltdown and computing" by donpellegrino
Human Models by mdgabriel
High-End SPARC Chip for Scientific Applications by Alan M. Feldstein
RapidMind by Mr LolO
Rapidmind by dminor
Longer run times by JohnWest
re: Algo trading Angst by jshore
Results of Testing by in_the_crease
C-DAC announces plans for a petaflop system; IBM researchers are working on vertical integration techniques to extend Moore's Law another 15 years. We recap those stories and more in our weekly wrapup.
Read More...
The Moscow State University supercomputer, Lomonosov, has been selected for a high-performance makeover, with the goal of tripling its processing power to achieve petaflop-level performance in 2010. T-Platforms, who developed and manufactured the supercomputer, is the odds-on favorite to lead the project.
Read More...
Right on schedule, Intel has launched its Xeon 5600 processors, codenamed "Westmere EP." The 5600 represents the 32nm sequel to the Xeon 5500 (Nehalem EP) for dual-socket servers. Intel is touting better performance and energy efficiency, along with new security features, as the big selling points of the new Xeons.
Read More...
Mar 19 | OfficialWire | New super to support intelligence work Down Under. Read more...
Mar 18 | ChannelWeb | Westmere parts already showing up in HPC machines. Read more...
Mar 17 | The Register | But what about the tier ones? Read more...
Mar 17 | Cadalyst Magazine | A new generation of workstations is changing the nature of technical computing. Read more...
Mar 17 | Linux Magazine | Latest iteration of Sun Grid Engine able to tap into Cloud. Read more...
Jan 12 | | In-depth look at vSMP Foundation server virtualization technology, technical implementation, use cases and capabilities. The technical whitepaper provides an architectural overview and details on the three vSMP Foundation products: vSMP Foundation for SMP, vSMP Foundation for Cluster and vSMP Foundation for Cloud.
Jan 18 | | This white paper discusses Gore’s copper cable assemblies, and how they continue to exceed the standards for providing reliable, cost-effective solutions for high-performance computer applications.
Join this online panel discussion for live Q&A with leading industry experts, analysts, and end-users to discuss the latest innovations, best practices, barriers to implementation, and measurable benefits of server virtualization with a particular focus on today's real world solutions.
Learn about scalable fault-tolerant architectures and examples of energy efficient and scalable supercomputing clusters using dual QDR InfiniBand to combine capacity computing with network failover capabilities with the help of programming languages such as MPI and a robust Linux cluster management package.
LIVE@SCO9: The IBM team discusses new innovations in hardware, software and services that help clients better understand their workloads and get insight from their R&D efforts. Technology demonstrations include the soon-to-be-released Power7 HPC processor, the DCS990 system with 2.4 petabytes of storage, the xCAT management tool, secure HPC cloud computing and more. Winners of two HPCwire Readers' and Editors’ Choice Awards! Take the IBM virtual tour at SC09 or more information go online to: http://www-03.ibm.com/systems/deepcomputing/sc09.html