HPCwire

The Leading Source for Global News and Information Covering the Ecosystem of High Productivity Computing

HPCwire >> Blogs

Blog: From the Editor

From the Editor | Main Blog Index

NVIDIA Keeps It Interesting


Planting CUDA Seeds

NVIDIA is continuing to push hard on CUDA, the company's C-based software environment for GPU computing. With last month's announcement of the first CUDA Center of Excellence at the University of Illinois at Urbana-Champaign, NVIDIA said it donated half a million dollars to the school.

The announcement also noted that "[u]niversities wishing to become CUDA Centers of Excellence must teach a CUDA class and use CUDA technology in their research, usually across several labs. In return, NVIDIA supports the school through funding and equipment donations, including help to set up a GPU computing cluster." This is the same general idea behind the Sony, Toshiba and IBM (STI) Center of Competence program for Cell technology. The STI Center was established at Georgia Tech last year. Can a Larrabee Center of Distinction be far behind?

Big Time Protein Folding

On the hardware side, even NVIDIA's non-Tesla gear is starting to show up in HPC applications. On Thursday, the company announced the big impact that the GeForce GPUs are having on Stanford's Folding@Home project. The project aggregates donated computer cycles on desktop systems to calculate how different types of proteins fold. The idea is to help understand protein behavior as it relates to cancer, cystic fibrosis, Parkinson's and other diseases.

Powered by CUDA-enabled software, the GeForce GPUs are doing yeoman's work for the folders. According to a recent ExtremeTech report, there are currently over 7,000 GPUs running the Folding@Home program, yielding around 840 teraflops of application performance. The article goes on to say:

That's somewhere around 110 gigaflops per GPU, on average. To put that in perspective, the regular windows CPU client is about one gigaflop per client (it's a mix of the single-threaded client and the multi-core SMP version). The PS3 looks like it leads the pack with a total of 1,358 teraflops, but that's from over 48,000 active PS3s. Each PS3 is actually delivering about 28 gigaflops apiece.

Those are all 32-bit floating point performance numbers, but if you can cure cancer with single precision, that's fine with me.

Argonne's Blue Gene/P Gets a Visual Buddy

On Wednesday, Argonne National Laboratory announced its plans to add a NVIDIA Quadro-based data analytics/visualization system to pair up with its Blue Gene/P supercomputer. The visualization system, named Eureka, will turn the torrents of data produced by applications running on Blue Gene into pretty pictures that make sense to mere mortals.

Apparently, 208 NVIDIA Quadro GPUs will be used to construct the system, which is being built by GraphStream, Inc. The hardware will consist of four racks of 1U boxes, with each box containing four Quadro graphics cards. According to a Dr. Dobbs article, the Eureka server building block is the SuperMicro 6015-UR. Each GPU box is hooked to two SuperMicro servers so that each compute server drives two GPUs. Visualization is driven by data that comes from a very large storage array, which is also hooked up to the Blue Gene machine.

The Manycore War of Words

Meanwhile, NVIDIA and Intel continued to spar on the GPU-Larrabee match-up. An article in Custom PC recorded Andy Keane's reaction to Pat Gelsinger's recent comments disparaging the CUDA technology. Keane is general manager of NVIDIA's GPU computing group and took offense at some off-handed remarks made by the Intel exec earlier this month. Gelsinger, a senior vice president and co-general manager of Intel's Digital Enterprise, started the brouhaha by claiming that GPGPU languages like NVIDIA's CUDA will one day be nothing more than "interesting footnotes in the history of computing annals," adding that Larrabee, Intel's upcoming manycore processor, will be the solution that succeeds in the long term.

If you're a self-respecting Nvidian, those are fighting words. From the article, here is the gist of Keane's reaction:

[T]he high level of interest in CUDA "is causing Larrabee. Larrabee's the reaction." He then added that "these comments from Gelsinger; if we were not making a lot of headway do you think he'd even give us a moment's notice? No. It's because he sees a lot of this activity. The strategy is to try to position it [CUDA] as something scary and unique, and it's really not; it's something that's very accessible."

Next month, Intel may release a lot more details on Larrabee at SIGGRAPH and the Intel Developer Forum. An article published Tuesday in The Inquirer says the Larrabee developer boards are being shipped in November. If true, NVIDIA is going to have a much better idea what it's up against real soon.

NVIDIA for Sale?

Finally, Simon Brew in UK's IT PRO speculates whether NVIDIA could be bought out. The reasoning behind the speculation is the AMD-ATI merger, which was designed to synergize two successful technologies into a greater whole -- or into a greater hole, as the case may be. Even Brew admits, "things haven’t quite gone to plan."

His real argument is that in the long term, NVIDIA's toughest competition will be Intel, not AMD. The implication is that the GPU vendor will have to be a lot bigger and brawnier to go up against the chip giant, especially in the expanding mobile graphics market, where Intel dominates. That's a valid point. But so far, NVIDIA has been more nimble than Intel in the graphics arena, and it's got a big head start at the high end of the market. I'm guessing NVIDIA figures it can still outrun the competition. We'll see...

Posted by Michael Feldman - July 24 @ 8:36PM

(Digg, Technorati, more)

Discussion

There are 0 discussion items posted.  

Michael Feldman

Michael Feldman is the editor of HPCwire.

More Michael Feldman



Recent Comments

Compairson to Core i7-980X by rsingle

HPC? not so much by ewahl

Re: IBM and HPC by truly64

HPC = servers but a lot more by lawries

Multi core deployment becomes a memory game by truly64

Re: Venture Capital Drought? Not So Much. by Ron Van Holst

Re: Podcast: Cray Awarded Defense Deal; SGI Makes Storage Buy; IBM Invents New Algorithm by Nastyanna

Painful Truth by jeffrey.mcallister

SGI = graphics + HPC by johnbarr

HPC = servers but a lot more by truly64

Oracle SPARC != Fujitsu SPARC by Alan M. Feldstein

Sun & HPC != Oracle & HPC by Merblich

a third vendor for lossless low latency 10GbE fabric by lee.fisher@hp.com

Response to GAH by KevinButerbaugh

Response to KevinButerbaugh by GAH

Response to KevinButerbaugh by GAH

Response to GAH by KevinButerbaugh

Response to bdrupp by KevinButerbaugh

Climate Crisis and Exaflops by bdrupp

Climate Crisis and Exaflops by John Hules

Climate Crisis and Exaflops by GAH

Climate Crisis by KevinButerbaugh

IBM "Brain Simulation" article is not properly presented. by Merritt

563 out of 1206 by vvolkov

Little Iron by gadunk

At least it's not "cloud" by KevinButerbaugh

Native QPI Interface? by commike

Mmmmmm by hellcats

New transistorized IC chip scales. by symmecon

Itanium at IDF by Alan M. Feldstein

Communication time by jnapper

"The financial meltdown and computing" by donpellegrino

Human Models by mdgabriel

High-End SPARC Chip for Scientific Applications by Alan M. Feldstein

RapidMind by Mr LolO

Rapidmind by dminor

Longer run times by JohnWest

re: Algo trading Angst by jshore

Results of Testing by in_the_crease

Feature Articles

The Week in Review

C-DAC announces plans for a petaflop system; IBM researchers are working on vertical integration techniques to extend Moore's Law another 15 years. We recap those stories and more in our weekly wrapup.
Read More...

Moscow State University Supercomputer Has Petaflop Aspirations

The Moscow State University supercomputer, Lomonosov, has been selected for a high-performance makeover, with the goal of tripling its processing power to achieve petaflop-level performance in 2010. T-Platforms, who developed and manufactured the supercomputer, is the odds-on favorite to lead the project.
Read More...

Intel Ups Performance Ante with Westmere Server Chips

Right on schedule, Intel has launched its Xeon 5600 processors, codenamed "Westmere EP." The 5600 represents the 32nm sequel to the Xeon 5500 (Nehalem EP) for dual-socket servers. Intel is touting better performance and energy efficiency, along with new security features, as the big selling points of the new Xeons.
Read More...

Top Headlines

Australia Commissions Cray Supercomputer

Mar 19 | OfficialWire | New super to support intelligence work Down Under. Read more...

Intel Partners See 'Easy' Upgrade Path With Xeon 5600 Chips

Mar 18 | ChannelWeb | Westmere parts already showing up in HPC machines. Read more...

AMD: OEMs primed for Opteron 6100s

Mar 17 | The Register | But what about the tier ones? Read more...

Arrival of the Desktop Supercomputer

Mar 17 | Cadalyst Magazine | A new generation of workstations is changing the nature of technical computing. Read more...

Scheduling HPC In The Cloud

Mar 17 | Linux Magazine | Latest iteration of Sun Grid Engine able to tap into Cloud. Read more...

Featured Whitepapers

Virtualization for Aggregation And The vSMP Architecture™

Jan 12 | | In-depth look at vSMP Foundation server virtualization technology, technical implementation, use cases and capabilities. The technical whitepaper provides an architectural overview and details on the three vSMP Foundation products: vSMP Foundation for SMP, vSMP Foundation for Cluster and vSMP Foundation for Cloud.

Copper Cable Technologies for High Performance Computing

Jan 18 | | This white paper discusses Gore’s copper cable assemblies, and how they continue to exceed the standards for providing reliable, cost-effective solutions for high-performance computer applications.

Multimedia

Webcast: Virtualized Data Center Roundtable

Join this online panel discussion for live Q&A with leading industry experts, analysts, and end-users to discuss the latest innovations, best practices, barriers to implementation, and measurable benefits of server virtualization with a particular focus on today's real world solutions.

Webcast: Watch SC09 Birds of a Feather Video: Scalable Fault-Tolerant HPC Supercomputers

Learn about scalable fault-tolerant architectures and examples of energy efficient and scalable supercomputing clusters using dual QDR InfiniBand to combine capacity computing with network failover capabilities with the help of programming languages such as MPI and a robust Linux cluster management package.

Webcast: High Performance Computing for a Smarter Planet

LIVE@SCO9: The IBM team discusses new innovations in hardware, software and services that help clients better understand their workloads and get insight from their R&D efforts. Technology demonstrations include the soon-to-be-released Power7 HPC processor, the DCS990 system with 2.4 petabytes of storage, the xCAT management tool, secure HPC cloud computing and more. Winners of two HPCwire Readers' and Editors’ Choice Awards! Take the IBM virtual tour at SC09 or more information go online to: http://www-03.ibm.com/systems/deepcomputing/sc09.html

Blogs by Topics

Blogs by Author

HPC Blogroll



Featured Events

HPC User Forum DICE
2010 High Performance Computing Linux Financial Markets
Cloud Computing Expo
Cloud Lab
ESC
DEISA PRACE Symposium