IBM Debuts IC922 Power Server for AI Inferencing and Data Management

By John Russell

January 28, 2020

IBM today launched a Power9-based inference server – the IC922 – that features up to six Nvidia T4 GPUs, PCIe Gen 4 and OpenCAPI connectivity, and can accommodate up to 24 SFF drives in a 2U form factor. Paired with IBM’s AC922, which uses Nvidia V100 GPUs, IBM says it now offers a complete solution for AI workloads encompassing data management, training, and inferencing. IBM also says the new IC922 is priced at parity or better than comparable x86-based offerings.

The use of T4 GPUs leverages Turing Tensor cores for their varied mixed precision capabilities (FP32, FP16, INT8, INT4) best suited for inferencing and T4’s lower cost. Taken together, this helps IBM attack what analysts say is the fastest growing AI market segment and likely to become the largest by volume. By contrast, the AC922 leverages Nvidia V100 GPUs, which are better suited for traditional HPC and AI training workloads and more costly. The AC922 is famously built using the same architecture as Summit supercomputer, currently the fastest supercomputer in the world as ranked by the Top500 List (November 2019).

“The IC922 is focused on data, inferencing and cloud,” said Dylan Boday, IBM director of offering management, cognitive and scale-out systems, in a pre-briefing with HPCwire. “We’ll be able to drive up to 24 small form factor drives, and including in the not too distant future, 24 NVMe drives. When you combine 24 NVMe drives plus PCIe Gen 4 out to your network you have a very powerful story from a balanced perspective.

“At the rack level you get very high throughputs. This is interesting for AI because many people are starting to look at storage deployments and their tier hierarchy. You need ‘warm’ or low latency access to some storage capabilities. Secondly, launching it with up to six T4 Nvidia GPUs gives the clients flexibility [and] in the very near future we’re going to be going to eight [T4s], which will give you 33 percent better GPU density, than HP or Dell servers will be able to do in a 2U server.”

IBM also argues it’s able to leverage its threads-per-core advantage both generally and for container performance.

The new system will be available on February 7. IBM reports it is still “investigating expanding IC922 into the IBM public cloud in the future.” Official announcement of the IC922 came in a blog (Complete your AI puzzle with inference) today by Grace Liu, principal offering manager, Linux Infrastructure.

IBM has been promising a renewed product push in AI and the IC922 is likely just the first. “Our Linux focus market is one that is [delivering] a portfolio for the AI era,” Boday said. Many AI projects are failing, he contends, and one reason is the difficulty making the transition from a controlled training environment to the more chaotic data ingest and inference environment where compute requirements and skills are different. The IC922, he said, is optimized for inferencing and data management and will make the transition easier. Its modular design allows organizations to scale infrastructure to meet needs whether on- premises or in a private cloud environment.

Software, of course, is another key. At SC19, IBM promoted its Bayesian software expertise as an AI enabler. In conjunction with the IC922 announcement, Boday said, “We’re going to introduce an inferencing software [it] basically allows you to operationalize your inferencing.” Few details were discussed at the briefing and in response to an emailed question about those plans, IBM responded, “IBM believes that just as training required specialized software, so does AI inference. Our Watson Machine Learning Accelerator product family continues to evolve to leverage the latest capabilities of IBM Power Systems for AI, and we expect that to continue for inference.”

That sounds like a stay tuned message. Shown below are top line bullets from the official announcement:

While the immediate IC922 focus is on using T4s, IBM noted plans to support other accelerator types.

“I’m not going to discuss all the details,” said Boday. “There are some statements of direction around FPGAs from Xilinx and other ASIC capabilities, as those devices are moving to PCIe Gen 4. This is kind of that future-proof box, if they want to start to leverage an FPGA as an inferencing, or even a training device. There are hundreds of different acceleration capabilities coming into the market quite rapidly. This system should be able to capture them. As clients demands increase we’re able to respond in an agile method to add those to our server, and provide the best-of-breed solution for those types of acceleration capabilities.”

Unlike the AC922, which offers NVLink for CPU-GPU communication, the LC922 uses PCIe 4. “In AC922, we have NVLink – that’s because of the form factor and the capabilities built into the Nvidia Volta. There’s less demand on overall throughput to these types of [training systems],” said Boday. IBM chose to leverage PCIe density advantages for the IC922 and to provide OpenCAPI capability for future devices. In recent months there has been a fair amount of discussion around OpenCAPI and the newer CXL standard led by Intel with speculation around bringing compatibility between the two.

Boday said, “CXL is not a commercially viable technology at this point. What I would say is CXL is definitely on our radar. We have a board seat within the CXL Foundation. So as that gets more and more traction, we’re going to have a significant voice of influence there. I would argue that IBM [started activities] for coherence in acceleration several years ago with CAPI and OpenCAPI. Speaking to this box, specifically, it will have OpenCAPI capabilities. This is actually the first box that has OpenCAPI capabilities commercially available, and what we’ll see is the ability for developers to start to leverage a coherent, high throughput, low latency interface for all kinds of new devices.”

IBM reports it will soon have a developer board. “One of the first things we’re going to do is enable the marketplace with a Bittware FPGA-based a card. It’ll be available in the near future as well. That allows developers to take advantage of the low latency/high throughput, and then we’ll even have a card for them to start exploring on that as well in the very near future.”

How the new offering fits into the broader AI go-to-market strategy articulated by IBM exec Dave Turek at SC19 isn’t entirely clear. He suggested a strategy in which IBM would provide smaller AI systems able to leverage a client’s existing infrastructure to improve system and application performance. (For more see HPCwire article, SC19: IBM Changes Its HPC-AI Game Plan).

Liu wrote in her blog, “To showcase how the IC922 fits into the AI puzzle, the Department of Defense High Performance Computing Modernization Program (HPCMP) recently demonstrated how the IC922 and AC922 could be combined into a modular computing platform, creating an IBM POWER9-based supercomputer in a shipping container. This modular computing capability, initially installed at the U.S. Army Combat Capabilities Development Command’s Army Research Laboratory DoD Supercomputing Resource Center, will enable the DoD to redefine the term ‘edge’ to include deployment of an AI supercomputing capability anywhere in the world, including the battlefield.”

In a sense, this use of edge could encompass deployments similar to what Turek suggested in which IBM brings an AI cluster – as small as a single node, said Turek – to enhance performance of infrastructure already in place. He also implied IBM would offer AI systems specialized around specific functions such as security and systems management. Perhaps that’s a next step, with AC922-IC922 combinations offered to “supercharge” existing infrastructure.

Link to IBM blog: https://www.ibm.com/blogs/systems/complete-your-ai-puzzle-with-inference/

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

Nvidia-Arm Deal a Boon for RISC-V?

October 26, 2020

The $40 billion blockbuster acquisition deal that will bring chip maker Arm into the Nvidia corporate family could provide a boost for the competing RISC-V architecture. As regulators in the U.S., China and the Europe Read more…

By George Leopold

OpenHPC Progress Report – v2.0, More Recipes, Cloud and Arm Support, Says Schulz

October 26, 2020

Launched in late 2015 and transitioned to a Linux Foundation Project in 2016, OpenHPC has marched quietly but steadily forward. Its goal “to provide a reference collection of open-source HPC software components and bes Read more…

By John Russell

NASA Uses Supercomputing to Measure Carbon in the World’s Trees

October 22, 2020

Trees constitute one of the world’s most important carbon sinks, pulling enormous amounts of carbon dioxide from the atmosphere and storing the carbon in their trunks and the surrounding soil. Measuring this carbon sto Read more…

By Oliver Peckham

Nvidia Dominates (Again) Latest MLPerf Inference Results

October 22, 2020

The two-year-old AI benchmarking group MLPerf.org released its second set of inferencing results yesterday and again, as in the most recent MLPerf training results (July 2020), it was almost entirely The Nvidia Show, a p Read more…

By John Russell

With Optane Gaining, Intel Exits NAND Flash

October 21, 2020

In a sign that its 3D XPoint memory technology is gaining traction, Intel Corp. is departing the NAND flash memory and storage market with the sale of its manufacturing base in China to SK Hynix of South Korea. The $9 Read more…

By George Leopold

AWS Solution Channel

Live Webinar: AWS & Intel Research Webinar Series – Fast scaling research workloads on the cloud

Date: 27 Oct – 5 Nov

Join us for the AWS and Intel Research Webinar series.

You will learn how we help researchers process complex workloads, quickly analyze massive data pipelines, store petabytes of data, and advance research using transformative technologies. Read more…

Intel® HPC + AI Pavilion

Berlin Institute of Health: Putting HPC to Work for the World

Researchers from the Center for Digital Health at the Berlin Institute of Health (BIH) are using science to understand the pathophysiology of COVID-19, which can help to inform the development of targeted treatments. Read more…

HPE, AMD and EuroHPC Partner for Pre-Exascale LUMI Supercomputer

October 21, 2020

Not even a week after Nvidia announced that it would be providing hardware for the first four of the eight planned EuroHPC systems, HPE and AMD are announcing another major EuroHPC design win. Finnish supercomputing cent Read more…

By Oliver Peckham

OpenHPC Progress Report – v2.0, More Recipes, Cloud and Arm Support, Says Schulz

October 26, 2020

Launched in late 2015 and transitioned to a Linux Foundation Project in 2016, OpenHPC has marched quietly but steadily forward. Its goal “to provide a referen Read more…

By John Russell

Nvidia Dominates (Again) Latest MLPerf Inference Results

October 22, 2020

The two-year-old AI benchmarking group MLPerf.org released its second set of inferencing results yesterday and again, as in the most recent MLPerf training resu Read more…

By John Russell

HPE, AMD and EuroHPC Partner for Pre-Exascale LUMI Supercomputer

October 21, 2020

Not even a week after Nvidia announced that it would be providing hardware for the first four of the eight planned EuroHPC systems, HPE and AMD are announcing a Read more…

By Oliver Peckham

HPE to Build Australia’s Most Powerful Supercomputer for Pawsey

October 20, 2020

The Pawsey Supercomputing Centre in Perth, Western Australia, has had a busy year. Pawsey typically spends much of its time looking to the stars, working with a Read more…

By Oliver Peckham

DDN-Tintri Showcases Technology Integration with Two New Products

October 20, 2020

DDN, a long-time leader in HPC storage, announced two new products today and provided more detail around its strategy for integrating DDN HPC technologies with Read more…

By John Russell

Is the Nvidia A100 GPU Performance Worth a Hardware Upgrade?

October 16, 2020

Over the last decade, accelerators have seen an increasing rate of adoption in high-performance computing (HPC) platforms, and in the June 2020 Top500 list, eig Read more…

By Hartwig Anzt, Ahmad Abdelfattah and Jack Dongarra

Nvidia and EuroHPC Team for Four Supercomputers, Including Massive ‘Leonardo’ System

October 15, 2020

The EuroHPC Joint Undertaking (JU) serves as Europe’s concerted supercomputing play, currently comprising 32 member states and billions of euros in funding. I Read more…

By Oliver Peckham

ROI: Is HPC Worth It? What Can We Actually Measure?

October 15, 2020

HPC enables innovation and discovery. We all seem to agree on that. Is there a good way to quantify how much that’s worth? Thanks to a sponsored white pape Read more…

By Addison Snell, Intersect360 Research

Supercomputer-Powered Research Uncovers Signs of ‘Bradykinin Storm’ That May Explain COVID-19 Symptoms

July 28, 2020

Doctors and medical researchers have struggled to pinpoint – let alone explain – the deluge of symptoms induced by COVID-19 infections in patients, and what Read more…

By Oliver Peckham

Nvidia Said to Be Close on Arm Deal

August 3, 2020

GPU leader Nvidia Corp. is in talks to buy U.K. chip designer Arm from parent company Softbank, according to several reports over the weekend. If consummated Read more…

By George Leopold

Intel’s 7nm Slip Raises Questions About Ponte Vecchio GPU, Aurora Supercomputer

July 30, 2020

During its second-quarter earnings call, Intel announced a one-year delay of its 7nm process technology, which it says it will create an approximate six-month shift for its CPU product timing relative to prior expectations. The primary issue is a defect mode in the 7nm process that resulted in yield degradation... Read more…

By Tiffany Trader

Google Hires Longtime Intel Exec Bill Magro to Lead HPC Strategy

September 18, 2020

In a sign of the times, another prominent HPCer has made a move to a hyperscaler. Longtime Intel executive Bill Magro joined Google as chief technologist for hi Read more…

By Tiffany Trader

HPE Keeps Cray Brand Promise, Reveals HPE Cray Supercomputing Line

August 4, 2020

The HPC community, ever-affectionate toward Cray and its eponymous founder, can breathe a (virtual) sigh of relief. The Cray brand will live on, encompassing th Read more…

By Tiffany Trader

10nm, 7nm, 5nm…. Should the Chip Nanometer Metric Be Replaced?

June 1, 2020

The biggest cool factor in server chips is the nanometer. AMD beating Intel to a CPU built on a 7nm process node* – with 5nm and 3nm on the way – has been i Read more…

By Doug Black

Aurora’s Troubles Move Frontier into Pole Exascale Position

October 1, 2020

Intel’s 7nm node delay has raised questions about the status of the Aurora supercomputer that was scheduled to be stood up at Argonne National Laboratory next year. Aurora was in the running to be the United States’ first exascale supercomputer although it was on a contemporaneous timeline with... Read more…

By Tiffany Trader

Is the Nvidia A100 GPU Performance Worth a Hardware Upgrade?

October 16, 2020

Over the last decade, accelerators have seen an increasing rate of adoption in high-performance computing (HPC) platforms, and in the June 2020 Top500 list, eig Read more…

By Hartwig Anzt, Ahmad Abdelfattah and Jack Dongarra

Leading Solution Providers

Contributors

European Commission Declares €8 Billion Investment in Supercomputing

September 18, 2020

Just under two years ago, the European Commission formalized the EuroHPC Joint Undertaking (JU): a concerted HPC effort (comprising 32 participating states at c Read more…

By Oliver Peckham

Nvidia and EuroHPC Team for Four Supercomputers, Including Massive ‘Leonardo’ System

October 15, 2020

The EuroHPC Joint Undertaking (JU) serves as Europe’s concerted supercomputing play, currently comprising 32 member states and billions of euros in funding. I Read more…

By Oliver Peckham

Google Cloud Debuts 16-GPU Ampere A100 Instances

July 7, 2020

On the heels of the Nvidia’s Ampere A100 GPU launch in May, Google Cloud is announcing alpha availability of the A100 “Accelerator Optimized” VM A2 instance family on Google Compute Engine. The instances are powered by the HGX A100 16-GPU platform, which combines two HGX A100 8-GPU baseboards using... Read more…

By Tiffany Trader

Microsoft Azure Adds A100 GPU Instances for ‘Supercomputer-Class AI’ in the Cloud

August 19, 2020

Microsoft Azure continues to infuse its cloud platform with HPC- and AI-directed technologies. Today the cloud services purveyor announced a new virtual machine Read more…

By Tiffany Trader

Oracle Cloud Infrastructure Powers Fugaku’s Storage, Scores IO500 Win

August 28, 2020

In June, RIKEN shook the supercomputing world with its Arm-based, Fujitsu-built juggernaut: Fugaku. The system, which weighs in at 415.5 Linpack petaflops, topp Read more…

By Oliver Peckham

DOD Orders Two AI-Focused Supercomputers from Liqid

August 24, 2020

The U.S. Department of Defense is making a big investment in data analytics and AI computing with the procurement of two HPC systems that will provide the High Read more…

By Tiffany Trader

HPE, AMD and EuroHPC Partner for Pre-Exascale LUMI Supercomputer

October 21, 2020

Not even a week after Nvidia announced that it would be providing hardware for the first four of the eight planned EuroHPC systems, HPE and AMD are announcing a Read more…

By Oliver Peckham

Oracle Cloud Deepens HPC Embrace with Launch of A100 Instances, Plans for Arm, More 

September 22, 2020

Oracle Cloud Infrastructure (OCI) continued its steady ramp-up of HPC capabilities today with a flurry of announcements. Topping the list is general availabilit Read more…

By John Russell

  • arrow
  • Click Here for More Headlines
  • arrow
Do NOT follow this link or you will be banned from the site!
Share This