IBM Debuts IC922 Power Server for AI Inferencing and Data Management

By John Russell

January 28, 2020

IBM today launched a Power9-based inference server – the IC922 – that features up to six Nvidia T4 GPUs, PCIe Gen 4 and OpenCAPI connectivity, and can accommodate up to 24 SFF drives in a 2U form factor. Paired with IBM’s AC922, which uses Nvidia V100 GPUs, IBM says it now offers a complete solution for AI workloads encompassing data management, training, and inferencing. IBM also says the new IC922 is priced at parity or better than comparable x86-based offerings.

The use of T4 GPUs leverages Turing Tensor cores for their varied mixed precision capabilities (FP32, FP16, INT8, INT4) best suited for inferencing and T4’s lower cost. Taken together, this helps IBM attack what analysts say is the fastest growing AI market segment and likely to become the largest by volume. By contrast, the AC922 leverages Nvidia V100 GPUs, which are better suited for traditional HPC and AI training workloads and more costly. The AC922 is famously built using the same architecture as Summit supercomputer, currently the fastest supercomputer in the world as ranked by the Top500 List (November 2019).

“The IC922 is focused on data, inferencing and cloud,” said Dylan Boday, IBM director of offering management, cognitive and scale-out systems, in a pre-briefing with HPCwire. “We’ll be able to drive up to 24 small form factor drives, and including in the not too distant future, 24 NVMe drives. When you combine 24 NVMe drives plus PCIe Gen 4 out to your network you have a very powerful story from a balanced perspective.

“At the rack level you get very high throughputs. This is interesting for AI because many people are starting to look at storage deployments and their tier hierarchy. You need ‘warm’ or low latency access to some storage capabilities. Secondly, launching it with up to six T4 Nvidia GPUs gives the clients flexibility [and] in the very near future we’re going to be going to eight [T4s], which will give you 33 percent better GPU density, than HP or Dell servers will be able to do in a 2U server.”

IBM also argues it’s able to leverage its threads-per-core advantage both generally and for container performance.

The new system will be available on February 7. IBM reports it is still “investigating expanding IC922 into the IBM public cloud in the future.” Official announcement of the IC922 came in a blog (Complete your AI puzzle with inference) today by Grace Liu, principal offering manager, Linux Infrastructure.

IBM has been promising a renewed product push in AI and the IC922 is likely just the first. “Our Linux focus market is one that is [delivering] a portfolio for the AI era,” Boday said. Many AI projects are failing, he contends, and one reason is the difficulty making the transition from a controlled training environment to the more chaotic data ingest and inference environment where compute requirements and skills are different. The IC922, he said, is optimized for inferencing and data management and will make the transition easier. Its modular design allows organizations to scale infrastructure to meet needs whether on- premises or in a private cloud environment.

Software, of course, is another key. At SC19, IBM promoted its Bayesian software expertise as an AI enabler. In conjunction with the IC922 announcement, Boday said, “We’re going to introduce an inferencing software [it] basically allows you to operationalize your inferencing.” Few details were discussed at the briefing and in response to an emailed question about those plans, IBM responded, “IBM believes that just as training required specialized software, so does AI inference. Our Watson Machine Learning Accelerator product family continues to evolve to leverage the latest capabilities of IBM Power Systems for AI, and we expect that to continue for inference.”

That sounds like a stay tuned message. Shown below are top line bullets from the official announcement:

While the immediate IC922 focus is on using T4s, IBM noted plans to support other accelerator types.

“I’m not going to discuss all the details,” said Boday. “There are some statements of direction around FPGAs from Xilinx and other ASIC capabilities, as those devices are moving to PCIe Gen 4. This is kind of that future-proof box, if they want to start to leverage an FPGA as an inferencing, or even a training device. There are hundreds of different acceleration capabilities coming into the market quite rapidly. This system should be able to capture them. As clients demands increase we’re able to respond in an agile method to add those to our server, and provide the best-of-breed solution for those types of acceleration capabilities.”

Unlike the AC922, which offers NVLink for CPU-GPU communication, the LC922 uses PCIe 4. “In AC922, we have NVLink – that’s because of the form factor and the capabilities built into the Nvidia Volta. There’s less demand on overall throughput to these types of [training systems],” said Boday. IBM chose to leverage PCIe density advantages for the IC922 and to provide OpenCAPI capability for future devices. In recent months there has been a fair amount of discussion around OpenCAPI and the newer CXL standard led by Intel with speculation around bringing compatibility between the two.

Boday said, “CXL is not a commercially viable technology at this point. What I would say is CXL is definitely on our radar. We have a board seat within the CXL Foundation. So as that gets more and more traction, we’re going to have a significant voice of influence there. I would argue that IBM [started activities] for coherence in acceleration several years ago with CAPI and OpenCAPI. Speaking to this box, specifically, it will have OpenCAPI capabilities. This is actually the first box that has OpenCAPI capabilities commercially available, and what we’ll see is the ability for developers to start to leverage a coherent, high throughput, low latency interface for all kinds of new devices.”

IBM reports it will soon have a developer board. “One of the first things we’re going to do is enable the marketplace with a Bittware FPGA-based a card. It’ll be available in the near future as well. That allows developers to take advantage of the low latency/high throughput, and then we’ll even have a card for them to start exploring on that as well in the very near future.”

How the new offering fits into the broader AI go-to-market strategy articulated by IBM exec Dave Turek at SC19 isn’t entirely clear. He suggested a strategy in which IBM would provide smaller AI systems able to leverage a client’s existing infrastructure to improve system and application performance. (For more see HPCwire article, SC19: IBM Changes Its HPC-AI Game Plan).

Liu wrote in her blog, “To showcase how the IC922 fits into the AI puzzle, the Department of Defense High Performance Computing Modernization Program (HPCMP) recently demonstrated how the IC922 and AC922 could be combined into a modular computing platform, creating an IBM POWER9-based supercomputer in a shipping container. This modular computing capability, initially installed at the U.S. Army Combat Capabilities Development Command’s Army Research Laboratory DoD Supercomputing Resource Center, will enable the DoD to redefine the term ‘edge’ to include deployment of an AI supercomputing capability anywhere in the world, including the battlefield.”

In a sense, this use of edge could encompass deployments similar to what Turek suggested in which IBM brings an AI cluster – as small as a single node, said Turek – to enhance performance of infrastructure already in place. He also implied IBM would offer AI systems specialized around specific functions such as security and systems management. Perhaps that’s a next step, with AC922-IC922 combinations offered to “supercharge” existing infrastructure.

Link to IBM blog: https://www.ibm.com/blogs/systems/complete-your-ai-puzzle-with-inference/

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

HPC Career Notes: August 2021 Edition

August 4, 2021

In this monthly feature, we’ll keep you up-to-date on the latest career developments for individuals in the high-performance computing community. Whether it’s a promotion, new company hire, or even an accolade, we’ Read more…

The Promise (and Necessity) of Runtime Systems like Charm++ in Exascale Power Management

August 4, 2021

Big heterogeneous computer systems, especially forthcoming exascale computers, are power hungry and difficult to program effectively. This is, of course, not an unrecognized problem. In a recent blog, Charmworks’ CEO S Read more…

Digging into the Atos-Nimbix Deal: Big US HPC and Global Cloud Aspirations. Look out HPE?

August 2, 2021

Behind Atos’s deal announced last week to acquire HPC-cloud specialist Nimbix are ramped-up plans to penetrate the U.S. HPC market and global expansion of its HPC cloud capabilities. Nimbix will become “an Atos HPC c Read more…

Berkeley Lab Makes Strides in Autonomous Discovery to Tackle the Data Deluge

August 2, 2021

Data production is outpacing the human capacity to process said data. Whether a giant radio telescope, a new particle accelerator or lidar data from autonomous cars, the sheer scale of the data generated is increasingly Read more…

Verifying the Universe with Exascale Computers

July 30, 2021

The ExaSky project, one of the critical Earth and Space Science applications being solved by the US Department of Energy’s (DOE’s) Exascale Computing Project (ECP), is preparing to use the nation’s forthcoming exas Read more…

AWS Solution Channel

Pushing pixels, not data with NICE DCV

NICE DCV, our high-performance, low-latency remote-display protocol, was originally created for scientists and engineers who ran large workloads on far-away supercomputers, but needed to visualize data without moving it. Read more…

What’s After Exascale? The Internet of Workflows Says HPE’s Nicolas Dubé

July 29, 2021

With the race to exascale computing in its final leg, it’s natural to wonder what the Post Exascale Era will look like. Nicolas Dubé, VP and chief technologist for HPE’s HPC business unit, agrees and shared his vision at Supercomputing Frontiers Europe 2021 held last week. The next big thing, he told the virtual audience at SFE21, is something that will connect HPC and (broadly) all of IT – into what Dubé calls The Internet of Workflows. Read more…

Digging into the Atos-Nimbix Deal: Big US HPC and Global Cloud Aspirations. Look out HPE?

August 2, 2021

Behind Atos’s deal announced last week to acquire HPC-cloud specialist Nimbix are ramped-up plans to penetrate the U.S. HPC market and global expansion of its Read more…

What’s After Exascale? The Internet of Workflows Says HPE’s Nicolas Dubé

July 29, 2021

With the race to exascale computing in its final leg, it’s natural to wonder what the Post Exascale Era will look like. Nicolas Dubé, VP and chief technologist for HPE’s HPC business unit, agrees and shared his vision at Supercomputing Frontiers Europe 2021 held last week. The next big thing, he told the virtual audience at SFE21, is something that will connect HPC and (broadly) all of IT – into what Dubé calls The Internet of Workflows. Read more…

How UK Scientists Developed Transformative, HPC-Powered Coronavirus Sequencing System

July 29, 2021

In November 2020, the COVID-19 Genomics UK Consortium (COG-UK) won the HPCwire Readers’ Choice Award for Best HPC Collaboration for its CLIMB-COVID sequencing project. Launched in March 2020, CLIMB-COVID has now resulted in the sequencing of over 675,000 coronavirus genomes – an increasingly critical task as variants like Delta threaten the tenuous prospect of a return to normalcy in much of the world. Read more…

IBM and University of Tokyo Roll Out Quantum System One in Japan

July 27, 2021

IBM and the University of Tokyo today unveiled an IBM Quantum System One as part of the IBM-Japan quantum program announced in 2019. The system is the second IB Read more…

Intel Unveils New Node Names; Sapphire Rapids Is Now an ‘Intel 7’ CPU

July 27, 2021

What's a preeminent chip company to do when its process node technology lags the competition by (roughly) one generation, but outmoded naming conventions make it seem like it's two nodes behind? For Intel, the response was to change how it refers to its nodes with the aim of better reflecting its positioning within the leadership semiconductor manufacturing space. Intel revealed its new node nomenclature, and... Read more…

Will Approximation Drive Post-Moore’s Law HPC Gains?

July 26, 2021

“Hardware-based improvements are going to get more and more difficult,” said Neil Thompson, an innovation scholar at MIT’s Computer Science and Artificial Intelligence Lab (CSAIL). “I think that’s something that this crowd will probably, actually, be already familiar with.” Thompson, speaking... Read more…

With New Owner and New Roadmap, an Independent Omni-Path Is Staging a Comeback

July 23, 2021

Put on a shelf by Intel in 2019, Omni-Path faced a uncertain future, but under new custodian Cornelis Networks, OmniPath is looking to make a comeback as an independent high-performance interconnect solution. A "significant refresh" – called Omni-Path Express – is coming later this year according to the company. Cornelis Networks formed last September as a spinout of Intel's Omni-Path division. Read more…

Chameleon’s HPC Testbed Sharpens Its Edge, Presses ‘Replay’

July 22, 2021

“One way of saying what I do for a living is to say that I develop scientific instruments,” said Kate Keahey, a senior fellow at the University of Chicago a Read more…

AMD Chipmaker TSMC to Use AMD Chips for Chipmaking

May 8, 2021

TSMC has tapped AMD to support its major manufacturing and R&D workloads. AMD will provide its Epyc Rome 7702P CPUs – with 64 cores operating at a base cl Read more…

Berkeley Lab Debuts Perlmutter, World’s Fastest AI Supercomputer

May 27, 2021

A ribbon-cutting ceremony held virtually at Berkeley Lab's National Energy Research Scientific Computing Center (NERSC) today marked the official launch of Perlmutter – aka NERSC-9 – the GPU-accelerated supercomputer built by HPE in partnership with Nvidia and AMD. Read more…

Intel Launches 10nm ‘Ice Lake’ Datacenter CPU with Up to 40 Cores

April 6, 2021

The wait is over. Today Intel officially launched its 10nm datacenter CPU, the third-generation Intel Xeon Scalable processor, codenamed Ice Lake. With up to 40 Read more…

Ahead of ‘Dojo,’ Tesla Reveals Its Massive Precursor Supercomputer

June 22, 2021

In spring 2019, Tesla made cryptic reference to a project called Dojo, a “super-powerful training computer” for video data processing. Then, in summer 2020, Tesla CEO Elon Musk tweeted: “Tesla is developing a [neural network] training computer called Dojo to process truly vast amounts of video data. It’s a beast! … A truly useful exaflop at de facto FP32.” Read more…

Google Launches TPU v4 AI Chips

May 20, 2021

Google CEO Sundar Pichai spoke for only one minute and 42 seconds about the company’s latest TPU v4 Tensor Processing Units during his keynote at the Google I Read more…

CentOS Replacement Rocky Linux Is Now in GA and Under Independent Control

June 21, 2021

The Rocky Enterprise Software Foundation (RESF) is announcing the general availability of Rocky Linux, release 8.4, designed as a drop-in replacement for the soon-to-be discontinued CentOS. The GA release is launching six-and-a-half months after Red Hat deprecated its support for the widely popular, free CentOS server operating system. The Rocky Linux development effort... Read more…

Iran Gains HPC Capabilities with Launch of ‘Simorgh’ Supercomputer

May 18, 2021

Iran is said to be developing domestic supercomputing technology to advance the processing of scientific, economic, political and military data, and to strengthen the nation’s position in the age of AI and big data. On Sunday, Iran unveiled the Simorgh supercomputer, which will deliver.... Read more…

HPE Launches Storage Line Loaded with IBM’s Spectrum Scale File System

April 6, 2021

HPE today launched a new family of storage solutions bundled with IBM’s Spectrum Scale Erasure Code Edition parallel file system (description below) and featu Read more…

Leading Solution Providers

Contributors

10nm, 7nm, 5nm…. Should the Chip Nanometer Metric Be Replaced?

June 1, 2020

The biggest cool factor in server chips is the nanometer. AMD beating Intel to a CPU built on a 7nm process node* – with 5nm and 3nm on the way – has been i Read more…

Julia Update: Adoption Keeps Climbing; Is It a Python Challenger?

January 13, 2021

The rapid adoption of Julia, the open source, high level programing language with roots at MIT, shows no sign of slowing according to data from Julialang.org. I Read more…

GTC21: Nvidia Launches cuQuantum; Dips a Toe in Quantum Computing

April 13, 2021

Yesterday Nvidia officially dipped a toe into quantum computing with the launch of cuQuantum SDK, a development platform for simulating quantum circuits on GPU-accelerated systems. As Nvidia CEO Jensen Huang emphasized in his keynote, Nvidia doesn’t plan to build... Read more…

AMD-Xilinx Deal Gains UK, EU Approvals — China’s Decision Still Pending

July 1, 2021

AMD’s planned acquisition of FPGA maker Xilinx is now in the hands of Chinese regulators after needed antitrust approvals for the $35 billion deal were receiv Read more…

Microsoft to Provide World’s Most Powerful Weather & Climate Supercomputer for UK’s Met Office

April 22, 2021

More than 14 months ago, the UK government announced plans to invest £1.2 billion ($1.56 billion) into weather and climate supercomputing, including procuremen Read more…

Quantum Roundup: IBM, Rigetti, Phasecraft, Oxford QC, China, and More

July 13, 2021

IBM yesterday announced a proof for a quantum ML algorithm. A week ago, it unveiled a new topology for its quantum processors. Last Friday, the Technical Univer Read more…

Q&A with Jim Keller, CTO of Tenstorrent, and an HPCwire Person to Watch in 2021

April 22, 2021

As part of our HPCwire Person to Watch series, we are happy to present our interview with Jim Keller, president and chief technology officer of Tenstorrent. One of the top chip architects of our time, Keller has had an impactful career. Read more…

Senate Debate on Bill to Remake NSF – the Endless Frontier Act – Begins

May 18, 2021

The U.S. Senate today opened floor debate on the Endless Frontier Act which seeks to remake and expand the National Science Foundation by creating a technology Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire