AMD Hatches FLOP-Monster GPU Card

By Tiffany Trader

November 12, 2012

SC12 is officially here and the chip vendors are eagerly touting their latest and greatest offerings. On the GPU accelerator front, Advanced Micro Devices (AMD) and NVIDIA both have products dropping today. AMD is launching the FirePro S10000, its most powerful GPU card yet. In fact, AMD is claiming this is the most powerful server graphics card on the market, but we’ll come back to that in a moment.

The FirePro S10000 (yes, four zeros!) is the successor to the S9000, which debuted in August. Both chips are based on 28 nm “Tahiti” silicon and support error-correcting code (ECC) memory, but compared to its predecessor, the S10000 has an extra GPU and packs more FLOPS punch (both single and double precision). The new card also offers greater memory bandwidth (480 GB/s versus 264 GB/s with ECC turned off) and slightly reduced clock speed (825 Mhz versus 900 Mhz). Both cards have 6GB of GDDR5 on-board memory and support PCI Express Gen3, but while the S9000 has 1 DisplayPort output, the S10000 offers 4 and a DVI. Total core count doubled: from 1,792 to 3,584. These enhancements raise the maximum heat envelope from 225 watts to a daunting 375 watts.

As with the previously-launched “Southern Islands” based-cards, the S10000 is built on top of AMD’s “Graphics Core Next” (GCN) architecture, which enables the two GPUs to carry out compute and graphics processing simultaneously. This makes them a good fit for a range of visualization and technical workloads, but the target audience is design professionals who use computer aided design (CAD), and media and entertainment (M&E) applications.

At last Tuesday’s press briefing, Bahman Dara, senior manager of worldwide marketing for AMD, explained that as products and design get more complex and sophisticated, there’s a greater demand for computational analysis, which makes server-based computing and remote graphics increasingly important. “This product is capable of delivering both compute and graphics at the same time – why buy two cards when you can buy one and achieve same objective?” queried Dara.

The “two cards” Dara is referring to are the two NVIDIA lines: Tesla, optimized for compute, and Quadro, which is geared for high-end graphics work. The idea is that the S10000 can do the work of both of these chips. During the same briefing, Joyce Burke, product manager at AMD, continued the theme. The card’s flexibility will enhance IT integration, she said, and because users will not need to purchase a second card for specific tasks, it’s also cost-efficient. Speaking of cost, the S10000 retails for $3,599 US, $1,100 more than its predecessor, but, as usual, most sales will take place through OEMs.

The continuing push by AMD into higher-end GPU territory shows they are serious about competing in this market, and that means competing against NVIDIA. The chipmaker emphasized repeatedly during the press briefing that its latest graphics card outperforms NVIDIA’s best offerings on pure performance and performance per watt. However, and this is important, they were using stats from the older M2090 and Kepler K10 products. We’ve known the higher-end Kepler K20s were coming since NVIDIA broke the news at their GPU Technology Conference last May and we knew they would exceed a teraflop of peak double-precision performance. With the full K20 specs now available, these older comparisons are obsolete.

“This will be the first professional-grade card to exceed one teraflop of double-precision performance,” Dara told reporters last Tuesday. Alas, the NVIDIA K20 and the uber-premium K20X, which also dropped today, both exceed the teraflop mark as well. The Kepler K20X is capable of 3.95 teraflops single precision and 1.31 double precision, while the K20 offers 3.52 teraflops and 1.17 teraflops, respectively.

With 5.91 teraflops of peak single precision and 1.48 teraflops of peak double precision floating point performance, the dual-GPU FirePro S10000 maintains some bragging rights. But it does so at the expense of efficiency. When it comes to double-precision performance, the new FirePro runs at 3.95 gigaflops per watt, while the K20 outputs 5.2 gigaflops per watt and the K20X achieves 5.57 gigaflops per watt. On the single precision side, the figures are: FirePro (15.76 gigaflops per watt), K20 (15.64 gigaflops per watt), and K20X (16.81 gigaflops per watt). A more apt comparison, however, is to the single-precision-optimized K10, which supplies 20.35 gigaflops per watt (making it 23% more efficient than the S10000).

Burke acknowledged that the S10000’s max thermal design power of 375 watts is at the high end, but emphasized that with two GPUs in a single dual-slot configuration, the new FirePro uses 15 percent less power consumption overall than two 225 watt cards. Going by AMD’s pre-release press material, Burke is most likely referencing the 225-watt Tesla M2090s, but note that this is a common power envelope for a GPU accelerator. It shows up in the FirePro S9000 and in the Kepler K10s and K20s; the elite K20X will boost the heat output to 235 watts. At any rate, FLOPS per watt is a more useful metric than overall power consumption, and as we’ve demonstrated, these figures are not in AMD’s favor.

Where the new FirePro excels is in pure FLOPS and its flexible compute-plus-graphics design. The S10000 is the Swiss Army knife of processor cards – it supports HPC workloads as an accelerator, but is also positioned and targeted for Virtual Desktop Infrastructure (VDI).

“FirePro products support two types of VDI,” Burke explained. For higher-end applications, direct GPU pass-through mode from VMware and Citrix facilitates one user per GPU – and since these boards have two GPUs that means two users. Reflecting a more traditional VDI experience, Microsoft’s RemoteFX allows one GPU to be shared among multiple users of office productivity apps that are not particularly graphics-intensive.

AMD’s claim is that with two GPUs per board, the S10000 can handle a greater density of users, but this configuration also places a lot more demand on the PCI bus, creating a possible bottleneck. Also at nearly 400 watts, excess heat could certainly be an issue, especially for densely configured servers.

Overall, AMD has added an interesting offering to its portfolio that will likely appeal to a certain class of users. However, in their zeal to go head-to-head with NVIDIA’s Kepler architecture, AMD may have rushed the launch. Questions about partner OEM and software support were met with a wait-and-see response. We know OpenCL 1.2 is supported but the company was mum when it came to further parallel programming languages and tools.

As for application performance, neither real-world nor artificial benchmarks were available at press time. To be fair, AMD said that more would be revealed during SC, and they even hinted that we should pay close attention to the next Green500 list. We shall see!

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

2022 Road Trip: NASA Ames Takes Off

November 25, 2022

I left Dallas very early Friday morning after the conclusion of SC22. I had a race with the devil to get from Dallas to Mountain View, Calif., by Sunday. According to Google Maps, this 1,957 mile jaunt would be the longe Read more…

2022 Road Trip: Sandia Brain Trust Sounds Off

November 24, 2022

As the 2022 Great American Supercomputing Road Trip carries on, it’s Sandia’s turn. It was a bright sunny day when I rolled into Albuquerque after a high-speed run from Los Alamos National Laboratory. My interview su Read more…

2022 HPC Road Trip: Los Alamos

November 23, 2022

With SC22 in the rearview mirror, it’s time to get back to the 2022 Great American Supercomputing Road Trip. To refresh everyone’s memory, I jumped in the car on November 3rd and headed towards SC22 in Dallas, stoppi Read more…

Chipmakers Looking at New Architecture to Drive Computing Ahead

November 23, 2022

The ability to scale current computing designs is reaching a breaking point, and chipmakers such as Intel, Qualcomm and AMD are putting their brains together on an alternate architecture to push computing forward. The chipmakers are coalescing around the new concept of sparse computing, which involves bringing computing to data... Read more…

QuEra’s Quest: Build a Flexible Neutral Atom-based Quantum Computer

November 23, 2022

Last month, QuEra Computing began providing access to its 256-qubit, neutral atom-based quantum system, Aquila, from Amazon Braket. Founded in 2018, and built on technology developed at Harvard and MIT, QuEra, is one of Read more…

AWS Solution Channel

Shutterstock 1648511269

Avoid overspending with AWS Batch using a serverless cost guardian monitoring architecture

Pay-as-you-go resources are a compelling but daunting concept for budget conscious research customers. Uncertainty of cloud costs is a barrier-to-entry for most, and having near real-time cost visibility is critical. Read more…

 

shutterstock_1431394361

AI and the need for purpose-built cloud infrastructure

Modern AI solutions augment human understanding, preferences, intent, and even spoken language. AI improves our knowledge and understanding by delivering faster, more informed insights that fuel transformation beyond anything previously imagined. Read more…

SC22’s ‘HPC Accelerates’ Plenary Stresses Need for Collaboration

November 21, 2022

Every year, SC has a theme. For SC22 – held last week in Dallas – it was “HPC Accelerates”: a theme that conference chair Candace Culhane said reflected “how supercomputing is continuously changing the world by Read more…

Chipmakers Looking at New Architecture to Drive Computing Ahead

November 23, 2022

The ability to scale current computing designs is reaching a breaking point, and chipmakers such as Intel, Qualcomm and AMD are putting their brains together on an alternate architecture to push computing forward. The chipmakers are coalescing around the new concept of sparse computing, which involves bringing computing to data... Read more…

QuEra’s Quest: Build a Flexible Neutral Atom-based Quantum Computer

November 23, 2022

Last month, QuEra Computing began providing access to its 256-qubit, neutral atom-based quantum system, Aquila, from Amazon Braket. Founded in 2018, and built o Read more…

SC22’s ‘HPC Accelerates’ Plenary Stresses Need for Collaboration

November 21, 2022

Every year, SC has a theme. For SC22 – held last week in Dallas – it was “HPC Accelerates”: a theme that conference chair Candace Culhane said reflected Read more…

Quantum – Are We There (or Close) Yet? No, Says the Panel

November 19, 2022

For all of its politeness, a fascinating panel on the last day of SC22 – Quantum Computing: A Future for HPC Acceleration? – mostly served to illustrate the Read more…

RISC-V Is Far from Being an Alternative to x86 and Arm in HPC

November 18, 2022

One of the original RISC-V designers this week boldly predicted that the open architecture will surpass rival chip architectures in performance. "The prediction is two or three years we'll be surpassing your architectures and available performance with... Read more…

Gordon Bell Special Prize Goes to LLM-Based Covid Variant Prediction

November 17, 2022

For three years running, ACM has awarded not only its long-standing Gordon Bell Prize (read more about this year’s winner here!) but also its Gordon Bell Spec Read more…

2022 Gordon Bell Prize Goes to Plasma Accelerator Research

November 17, 2022

At the awards ceremony at SC22 in Dallas today, ACM awarded the 2022 ACM Gordon Bell Prize to a team of researchers who used four major supercomputers – inclu Read more…

Gordon Bell Nominee Used LLMs, HPC, Cerebras CS-2 to Predict Covid Variants

November 17, 2022

Large language models (LLMs) have taken the tech world by storm over the past couple of years, dominating headlines with their ability to generate convincing hu Read more…

Nvidia Shuts Out RISC-V Software Support for GPUs 

September 23, 2022

Nvidia is not interested in bringing software support to its GPUs for the RISC-V architecture despite being an early adopter of the open-source technology in its GPU controllers. Nvidia has no plans to add RISC-V support for CUDA, which is the proprietary GPU software platform, a company representative... Read more…

RISC-V Is Far from Being an Alternative to x86 and Arm in HPC

November 18, 2022

One of the original RISC-V designers this week boldly predicted that the open architecture will surpass rival chip architectures in performance. "The prediction is two or three years we'll be surpassing your architectures and available performance with... Read more…

AWS Takes the Short and Long View of Quantum Computing

August 30, 2022

It is perhaps not surprising that the big cloud providers – a poor term really – have jumped into quantum computing. Amazon, Microsoft Azure, Google, and th Read more…

Chinese Startup Biren Details BR100 GPU

August 22, 2022

Amid the high-performance GPU turf tussle between AMD and Nvidia (and soon, Intel), a new, China-based player is emerging: Biren Technology, founded in 2019 and headquartered in Shanghai. At Hot Chips 34, Biren co-founder and president Lingjie Xu and Biren CTO Mike Hong took the (virtual) stage to detail the company’s inaugural product: the Biren BR100 general-purpose GPU (GPGPU). “It is my honor to present... Read more…

Tesla Bulks Up Its GPU-Powered AI Super – Is Dojo Next?

August 16, 2022

Tesla has revealed that its biggest in-house AI supercomputer – which we wrote about last year – now has a total of 7,360 A100 GPUs, a nearly 28 percent uplift from its previous total of 5,760 GPUs. That’s enough GPU oomph for a top seven spot on the Top500, although the tech company best known for its electric vehicles has not publicly benchmarked the system. If it had, it would... Read more…

AMD Thrives in Servers amid Intel Restructuring, Layoffs

November 12, 2022

Chipmakers regularly indulge in a game of brinkmanship, with an example being Intel and AMD trying to upstage one another with server chip launches this week. But each of those companies are in different positions, with AMD playing its traditional role of a scrappy underdog trying to unseat the behemoth Intel... Read more…

JPMorgan Chase Bets Big on Quantum Computing

October 12, 2022

Most talk about quantum computing today, at least in HPC circles, focuses on advancing technology and the hurdles that remain. There are plenty of the latter. F Read more…

UCIe Consortium Incorporates, Nvidia and Alibaba Round Out Board

August 2, 2022

The Universal Chiplet Interconnect Express (UCIe) consortium is moving ahead with its effort to standardize a universal interconnect at the package level. The c Read more…

Leading Solution Providers

Contributors

Using Exascale Supercomputers to Make Clean Fusion Energy Possible

September 2, 2022

Fusion, the nuclear reaction that powers the Sun and the stars, has incredible potential as a source of safe, carbon-free and essentially limitless energy. But Read more…

Nvidia, Qualcomm Shine in MLPerf Inference; Intel’s Sapphire Rapids Makes an Appearance.

September 8, 2022

The steady maturation of MLCommons/MLPerf as an AI benchmarking tool was apparent in today’s release of MLPerf v2.1 Inference results. Twenty-one organization Read more…

Not Just Cash for Chips – The New Chips and Science Act Boosts NSF, DOE, NIST

August 3, 2022

After two-plus years of contentious debate, several different names, and final passage by the House (243-187) and Senate (64-33) last week, the Chips and Science Act will soon become law. Besides the $54.2 billion provided to boost US-based chip manufacturing, the act reshapes US science policy in meaningful ways. NSF’s proposed budget... Read more…

SC22 Unveils ACM Gordon Bell Prize Finalists

August 12, 2022

Courtesy of the schedule for the SC22 conference, we now have our first glimpse at the finalists for this year’s coveted Gordon Bell Prize. The Gordon Bell Pr Read more…

Intel Is Opening up Its Chip Factories to Academia

October 6, 2022

Intel is opening up its fabs for academic institutions so researchers can get their hands on physical versions of its chips, with the end goal of boosting semic Read more…

AMD Previews 400 Gig Adaptive SmartNIC SOC at Hot Chips

August 24, 2022

Fresh from finalizing its acquisitions of FPGA provider Xilinx (Feb. 2022) and DPU provider Pensando (May 2022) ), AMD previewed what it calls a 400 Gig Adaptive smartNIC SOC yesterday at Hot Chips. It is another contender in the increasingly crowded and blurry smartNIC/DPU space where distinguishing between the two isn’t always easy. The motivation for these device types... Read more…

Google Program to Free Chips Boosts University Semiconductor Design

August 11, 2022

A Google-led program to design and manufacture chips for free is becoming popular among researchers and computer enthusiasts. The search giant's open silicon program is providing the tools for anyone to design chips, which then get manufactured. Google foots the entire bill, from a chip's conception to delivery of the final product in a user's hand. Google's... Read more…

AMD’s Genoa CPUs Offer Up to 96 5nm Cores Across 12 Chiplets

November 10, 2022

AMD’s fourth-generation Epyc processor line has arrived, starting with the “general-purpose” architecture, called “Genoa,” the successor to third-gen Eypc Milan, which debuted in March of last year. At a launch event held today in San Francisco, AMD announced the general availability of the latest Epyc CPUs with up to 96 TSMC 5nm Zen 4 cores... Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire