FPGA Acceleration Gets a Boost from HP, Intel

By Michael Feldman

September 21, 2007

It’s been said more than once that users of high performance computing have an almost insatiable demand for computational power. Applying Moore’s Law to produce additional cores on general-purpose processors helps, but ultimately fails, to keep up with this demand. Bigger systems can be built, but power and size considerations limit scalability.

These obstacles have increased interest in hardware accelerators. FPGAs, GPUs, Cell processors and ClearSpeed boards are all candidates for offloading the kind of fine-grained parallelism common to many HPC applications. Seismic modeling, financial analytics and bioinformatics applications have been sped up anywhere from 10 to 300 times using these newer technologies. And accelerators are able to do this at a fraction of the cost and power of general-purpose CPUs. But in many cases, hyperbole has been delivered faster than real products. For FPGAs, this might be starting to change.

When FPGAs hit the 90nm process node, manufacturers were able to build chips with enough gates and memory on them to host real HPC kernels. The Xilinx Virtex-4 and Altera Stratix II are two such chips. More recently, Xilinx and Altera have delivered even more powerful FPGAs, using 65nm technology — Virtex-5 and Stratix III, respectively. Reconfigurable computing vendors are just catching up to the newfound treasure. DRC Computer Corp., XtremeData Inc., Celoxica Holdings plc, Nallatech, Mitrionics AB, Impulse Accelerated Technologies and a handful of others are beginning to crack the HPC marketplace.

This week Celoxica announced that its RCHTX acceleration board has been qualified by HP for its ProLiant DL145 server. Customers can now buy an HP server equipped with Celoxica’s FPGA board, with HP standing behind it. An accelerated HP machine was demonstrated at the HPC on Wall Street conference on Monday.

The RCHTX board is based on the Xilinx Virtex-4 FPGAs. (Celoxica also has developed support for the next-generation Virtex-5.) According to Jeff Jussel, Celoxica’s VP of Marketing, it still can be a challenge to fit some of the algorithms on the 90nm Virtex-4 chips, but there is certainly enough real estate on the die to implement a lot of useful double-precision floating point codes.

“Frankly, we only need to get enough multipliers on the FPGAs to do the job,” explains Jussel. “We’re finding that they are now big enough so that we can do that.”

Celoxica has targeted the financial services space as one sector with an acute need for accelerated solutions and, frankly, one with the wherewithal to invest in emerging technologies like theirs. The company is in the process of carrying out a “paid engagement” for a Tier 1 investment bank to demonstrate a proof of concept for the RCHTX technology applied to the bank’s analytics applications. Results of the study have shown improvements in the power-speed ratio of up to 30 times compared to the bank’s current server technology.

The partnership with HP is a significant development for Celoxica. Big banks and other financial institutions need to feel comfortable with the level of support they’re going to get with their mission-critical technology. With a staff of 46, it’s hard for Celoxica to provide that by themselves. They realize hooking up with Tier 1 vendors gives them a big boost into these markets.

One advantage to using FPGAs as an accelerator technology are their reconfigurability. In the financial arena, the same system could be used to do both transaction-level applications, like market data pre-processing, and financial analytics, like options trading and derivative analysis. In Celoxica’s current implementation, though, both applications can’t be performed simultaneously. The RCHTX card has two FPGAs: user and RTOS. The RTOS FPGA manages the hardware interface and handles the communication between memory, the processors and the user FPGA. The user FPGA is the bigger processor and is completely available for application code. Although the user is limited to running one application at a time, switching codes takes only milliseconds.

Like DRC and XtremeData, Celoxica takes advantage of AMD’s Torrenza strategy which opens up the AMD64 platform to third-party coprocessors and allows them to be part of the computing fabric. Celoxica’s accelerator card uses a HyperTransport eXpansion (HTX) slot rather than plugging directly into an Opteron socket, but it still benefits from direct connection to the CPU and memory. AMD has positioned Torrenza as a big differentiator against Intel.

“Far from thinking of us as stealing sockets [from them], they’re seeing this as a way to steal sockets away from Intel,” explains Jussel. “AMD has been a big help as partner. They’ve tied us in with their server partners and helped support the qualification with people like HP.”

Although Celoxica is also part of Intel’s Geneseo initiative, which uses PCI Express (PCIe), Jussel says, at this point, HyperTransport is going to be the fastest way to connect to a processor. It gives you native bus speeds and avoids having to negotiate a bridge to an external bus, as would be necessary with a PCIe solution. Intel’s answer to HyperTransport, CSI (what Intel is now calling QuickPath), is in the works for 2008 for the company’s next-generation Nehalem architecture.

Intel has licensed its FSB technology to Xilinx, Celoxica’s partner, so an FPGA-FSB solution is a possibility. But, says Jussel, “It remains to be seen whether or not you can get FPGAs hooked up to it and the motherboards out before [Intel] comes out with CSI.”

As a matter of fact, a couple of vendors were able to do just that. At the Intel Developer Conference (IDF) on Tuesday, XtremeData and Nallatech unveiled FPGA acceleration modules that could be plugged into Intel Xeon-based platforms.

At IDF, XtremeData demonstrated their new XD2000i module connected to Intel’s FSB. The XD2000i is pin-compatible with the Xeon processor socket in dual- or quad-socket servers. The module will contain three Altera Stratix III FPGAs. Two are reserved for the application. The other one will be used as a bridge to manage FSB communication, allowing the user to reprogram the application FPGAs on the fly.

According to XtremeData, the module was designed to plug into any Xeon DP platform. The company prides itself on fitting into any supported system, by adhering to the CPU vendor’s recommended “keep-out space,” which allows the FPGA module to fit into anyone’s box. XtremeData’s module for AMD platforms works the same way, but in this case, via an AMD64 socket. “Anyplace you can fit an AMD Opteron, you can fit our module — without exception,” says Geno Valente, XtremeData’s VP of Marketing.

Like Celoxica, XtremeData is heavily focused in the financial service market. Many of their current customers are the big financial institutions, which tend to be shy about talking about their internal technology. But according to Valente, they have about 50 or 60 such customers. Most of them are using the XtremeData modules in Opteron sockets to accelerate Monte Carlo simulations for applications like options pricing.

“It’s in the grid, where there are 10,000 CPUs crunching away, playing war games with the bank across the street,” says Valente.

The XD2000i solution was made possible because Intel licensed its FSB intellectual property to XtremeData. When you buy the XD2000i module, it comes with the FSB IP in it, but encrypted so that no one else can tinker with it. Intel created a similar arrangement with Nallatech, a Scottish company that specializes in FPGA solutions, who also launched their FPGA-FSB solutions at IDF. Unlike XtremeData’s Altera hardware, the Nallatech module is based on Xilinx FPGAs.

The fact that Intel was willing to license the FSB technology suggests that the company realizes that there is value outside of the x86 universe that they can tap into. Intel’s Geneseo initiative for PCIe is another example of this line of thinking. The FPGA-FSB solutions give Intel some parity against FPGA-HyperTransport offerings for AMD-based platforms. But the introduction of CSI in 2008 should really level the playing field.

ISC 2024 Keynote: High-precision Computing Will Be a Foundation for AI Models

May 15, 2024

Some scientific computing applications cannot sacrifice accuracy and will always require high-precision computing. Therefore, conventional high-performance computing (HPC) will remain essential, even as many applicati Read more…

EuroHPC Expands: United Kingdom Joins as 35th Member

May 14, 2024

The United Kingdom has officially joined the EuroHPC Joint Undertaking, becoming the 35th member state. This was confirmed after the 38th Governing Board meeting, and it's set to enhance Europe's supercomputing capabilit Read more…

Linux Foundation Announces the Launch of the High-Performance Software Foundation

May 14, 2024

The Linux Foundation, the nonprofit organization enabling mass innovation through open source, is excited to announce the launch of the High-Performance Software Foundation (HPSF). The announcement was made at the ISC Read more…

Nvidia Showcases Work with Quantum Centers at ISC 2024

May 13, 2024

With quantum computing surging in Europe, Nvidia took advantage of ISC 2024 to showcase its efforts working with quantum development centers. Currently, Nvidia GPUs are dominant inside classical systems used for quantum Read more…

ISC 2024: Hyperion Research Predicts HPC Market Rebound after Flat 2023

May 13, 2024

First, the top line: the overall HPC market was flat in 2023 at roughly $37 billion, bogged down by supply chain issues and slowed acceptance of some larger systems (e.g. exascale), according to Hyperion Research’s ann Read more…

Top 500: Aurora Breaks into Exascale, but Can’t Get to the Frontier of HPC

May 13, 2024

The 63rd installment of the TOP500 list is available today in coordination with the kickoff of ISC 2024 in Hamburg, Germany. Once again, the Frontier system at Oak Ridge National Laboratory in Tennessee, USA, retains its Read more…

ISC 2024 Keynote: High-precision Computing Will Be a Foundation for AI Models

May 15, 2024

Some scientific computing applications cannot sacrifice accuracy and will always require high-precision computing. Therefore, conventional high-performance c Read more…

Linux Foundation Announces the Launch of the High-Performance Software Foundation

May 14, 2024

The Linux Foundation, the nonprofit organization enabling mass innovation through open source, is excited to announce the launch of the High-Performance Softw Read more…

ISC 2024: Hyperion Research Predicts HPC Market Rebound after Flat 2023

May 13, 2024

First, the top line: the overall HPC market was flat in 2023 at roughly $37 billion, bogged down by supply chain issues and slowed acceptance of some larger sys Read more…

Top 500: Aurora Breaks into Exascale, but Can’t Get to the Frontier of HPC

May 13, 2024

The 63rd installment of the TOP500 list is available today in coordination with the kickoff of ISC 2024 in Hamburg, Germany. Once again, the Frontier system at Read more…

ISC Preview: Focus Will Be on Top500 and HPC Diversity

May 9, 2024

Last year's Supercomputing 2023 in November had record attendance, but the direction of high-performance computing was a hot topic on the floor. Expect more of Read more…

Illinois Considers $20 Billion Quantum Manhattan Project Says Report

May 7, 2024

There are multiple reports that Illinois governor Jay Robert Pritzker is considering a $20 billion Quantum Manhattan-like project for the Chicago area. Accordin Read more…

The NASA Black Hole Plunge

May 7, 2024

We have all thought about it. No one has done it, but now, thanks to HPC, we see what it looks like. Hold on to your feet because NASA has released videos of wh Read more…

How Nvidia Could Use $700M Run.ai Acquisition for AI Consumption

May 6, 2024

Nvidia is touching $2 trillion in market cap purely on the brute force of its GPU sales, and there's room for the company to grow with software. The company hop Read more…

Nvidia H100: Are 550,000 GPUs Enough for This Year?

August 17, 2023

The GPU Squeeze continues to place a premium on Nvidia H100 GPUs. In a recent Financial Times article, Nvidia reports that it expects to ship 550,000 of its lat Read more…

Synopsys Eats Ansys: Does HPC Get Indigestion?

February 8, 2024

Recently, it was announced that Synopsys is buying HPC tool developer Ansys. Started in Pittsburgh, Pa., in 1970 as Swanson Analysis Systems, Inc. (SASI) by John Swanson (and eventually renamed), Ansys serves the CAE (Computer Aided Engineering)/multiphysics engineering simulation market. Read more…

Comparing NVIDIA A100 and NVIDIA L40S: Which GPU is Ideal for AI and Graphics-Intensive Workloads?

October 30, 2023

With long lead times for the NVIDIA H100 and A100 GPUs, many organizations are looking at the new NVIDIA L40S GPU, which it’s a new GPU optimized for AI and g Read more…

Choosing the Right GPU for LLM Inference and Training

December 11, 2023

Accelerating the training and inference processes of deep learning models is crucial for unleashing their true potential and NVIDIA GPUs have emerged as a game- Read more…

Intel’s Server and PC Chip Development Will Blur After 2025

January 15, 2024

Intel's dealing with much more than chip rivals breathing down its neck; it is simultaneously integrating a bevy of new technologies such as chiplets, artificia Read more…

Meta’s Zuckerberg Puts Its AI Future in the Hands of 600,000 GPUs

January 25, 2024

In under two minutes, Meta's CEO, Mark Zuckerberg, laid out the company's AI plans, which included a plan to build an artificial intelligence system with the eq Read more…

How AMD May Get Across the CUDA Moat

October 5, 2023

When discussing GenAI, the term "GPU" almost always enters the conversation and the topic often moves toward performance and access. Interestingly, the word "GPU" is assumed to mean "Nvidia" products. (As an aside, the popular Nvidia hardware used in GenAI are not technically... Read more…

Nvidia’s New Blackwell GPU Can Train AI Models with Trillions of Parameters

March 18, 2024

Nvidia's latest and fastest GPU, codenamed Blackwell, is here and will underpin the company's AI plans this year. The chip offers performance improvements from Read more…

AMD’s Horsepower-packed MI300X GPU Beats Nvidia’s Upcoming H200

December 7, 2023

AMD and Nvidia are locked in an AI performance battle – much like the gaming GPU performance clash the companies have waged for decades. AMD has claimed it Read more…

Eyes on the Quantum Prize – D-Wave Says its Time is Now

January 30, 2024

Early quantum computing pioneer D-Wave again asserted – that at least for D-Wave – the commercial quantum era has begun. Speaking at its first in-person Ana Read more…

The GenAI Datacenter Squeeze Is Here

February 1, 2024

The immediate effect of the GenAI GPU Squeeze was to reduce availability, either direct purchase or cloud access, increase cost, and push demand through the roof. A secondary issue has been developing over the last several years. Even though your organization secured several racks... Read more…

The NASA Black Hole Plunge

May 7, 2024

We have all thought about it. No one has done it, but now, thanks to HPC, we see what it looks like. Hold on to your feet because NASA has released videos of wh Read more…

Intel Plans Falcon Shores 2 GPU Supercomputing Chip for 2026 

August 8, 2023

Intel is planning to onboard a new version of the Falcon Shores chip in 2026, which is code-named Falcon Shores 2. The new product was announced by CEO Pat Gel Read more…

GenAI Having Major Impact on Data Culture, Survey Says

February 21, 2024

While 2023 was the year of GenAI, the adoption rates for GenAI did not match expectations. Most organizations are continuing to invest in GenAI but are yet to Read more…

Q&A with Nvidia’s Chief of DGX Systems on the DGX-GB200 Rack-scale System

March 27, 2024

Pictures of Nvidia's new flagship mega-server, the DGX GB200, on the GTC show floor got favorable reactions on social media for the sheer amount of computing po Read more…

How the Chip Industry is Helping a Battery Company

May 8, 2024

Chip companies, once seen as engineering pure plays, are now at the center of geopolitical intrigue. Chip manufacturing firms, especially TSMC and Intel, have b Read more…

Click Here for More Headlines

HPCwire is a registered trademark of Tabor Communications, Inc. Use of this site is governed by our Terms of Use and Privacy Policy.

Reproduction in whole or in part in any form or medium without express written permission of Tabor Communications, Inc. is prohibited.

Leading Solution Providers

Off The Wire

Industry Headlines

May 14, 2024

May 13, 2024

Subscribe to HPCwire's Weekly Update!

ISC 2024 Keynote: High-precision Computing Will Be a Foundation for AI Models

EuroHPC Expands: United Kingdom Joins as 35th Member

Linux Foundation Announces the Launch of the High-Performance Software Foundation

Nvidia Showcases Work with Quantum Centers at ISC 2024

ISC 2024: Hyperion Research Predicts HPC Market Rebound after Flat 2023

Top 500: Aurora Breaks into Exascale, but Can’t Get to the Frontier of HPC

ISC 2024 Keynote: High-precision Computing Will Be a Foundation for AI Models

Linux Foundation Announces the Launch of the High-Performance Software Foundation

ISC 2024: Hyperion Research Predicts HPC Market Rebound after Flat 2023

Top 500: Aurora Breaks into Exascale, but Can’t Get to the Frontier of HPC

ISC Preview: Focus Will Be on Top500 and HPC Diversity

Illinois Considers $20 Billion Quantum Manhattan Project Says Report

The NASA Black Hole Plunge

How Nvidia Could Use $700M Run.ai Acquisition for AI Consumption

Nvidia H100: Are 550,000 GPUs Enough for This Year?

Synopsys Eats Ansys: Does HPC Get Indigestion?

Comparing NVIDIA A100 and NVIDIA L40S: Which GPU is Ideal for AI and Graphics-Intensive Workloads?

Choosing the Right GPU for LLM Inference and Training

Intel’s Server and PC Chip Development Will Blur After 2025

Meta’s Zuckerberg Puts Its AI Future in the Hands of 600,000 GPUs

How AMD May Get Across the CUDA Moat

Nvidia’s New Blackwell GPU Can Train AI Models with Trillions of Parameters

Leading Solution Providers

Contributors

Tiffany Trader

Editorial Director

Douglas Eadline

Managing Editor

John Russell

Senior Editor

Kevin Jackson

Contributing Editor

Ali Azhar

Contributing Editor

Alex Woodie

Contributing Editor

Addison Snell

Contributing Editor

Drew Jolly

Assistant Editor

AMD’s Horsepower-packed MI300X GPU Beats Nvidia’s Upcoming H200

Eyes on the Quantum Prize – D-Wave Says its Time is Now

The GenAI Datacenter Squeeze Is Here

The NASA Black Hole Plunge

Intel Plans Falcon Shores 2 GPU Supercomputing Chip for 2026

GenAI Having Major Impact on Data Culture, Survey Says

Q&A with Nvidia’s Chief of DGX Systems on the DGX-GB200 Rack-scale System

How the Chip Industry is Helping a Battery Company

The Information Nexus of Advanced Computing and Data systems for a High Performance World

Share

Copy short link

Intel Plans Falcon Shores 2 GPU Supercomputing Chip for 2026