Is IBM Getting Openness Right? Yes, Says GM Doug Balog

By John Russell

June 29, 2015

The annual Red Hat Summit, held in Boston last week, is something of revival tent for open source where the pulpits are plentiful and so are smiling believers. Indeed it’s hard to dispute the powerful innovation springing from open source. The RH Summit, which started 11 years ago as a modest celebration of open source (mostly Red Hat Linux), has since mushroomed into a boisterous, expansive technology ecosystem showcase.

So it’s interesting when a senior IBM exec turns up in a keynote slot. Big Blue’s heritage, at least at the high end, had for years been dominated by proprietary architecture. No longer, said Doug Balog, general manager of IBM Power Systems. The founding of OpenPOWER roughly two years ago, sale of IBM’s x86 business, and the sprint away from the formidable but proprietary Blue Gene (and re-embrace of the battle-tested mainframe) are all part of IBM’s about-face.

Balog is smack in the middle of proving IBM’s commitment to open source and community development. “It’s not an accident I’m here or that IBM is a major sponsor,” he said.

Message to Red Hatters

“The message we’re bringing to this audience, is we are bringing higher value to customers where we can help clients do amazing things. We’re able to do that by supporting an open approach and that’s kind of unique versus some of my competitors in the systems business who are taking commodity infrastructure and swapping the software available and really not bringing in any differentiation. They are simply providing the recipe from Intel in a lot of ways.

“Our value is we have differentiated systems – we have POWER systems, we have storage systems, we have mainframe systems. Those deliver unique capabilities but still leverage open technology to do it. We think that’s the right recipe,” said Balog.

According to the official bio, Balog is “responsible for all facets of the Power Systems’ business including strategy, architecture, operations, technology development and overall financial performance. He is also a current member of IBM’s Performance Team and a recent member of the Strategy Team, which focus respectively on tactical execution and the strategic direction for the IBM Corporation.” Probably doesn’t get much sleep.

Balog’s IBM counterpart is Ken King, general manager, OpenPOWER at IBM. The organization is separate from Doug’s but heavily related because both are based on the POWER8 chip. Both Ken and Doug report up into the Senior Vice President of IBM Systems, Tom Rosamilia.

While at RH Summit, Balog talked with HPCwire on a range of issues including how and why openness informs the new IBM; what the emerging HPC strategy is; how big data and mobility are transforming enterprise computing and why that’s good for IBM’s mainframes (z Systems); and Big Blue’s nascent cloud-based outreach to OpenPOWER developers. He also couldn’t resist taking a few light jabs at Intel and the ARM camp.

Bucking Technology Headwinds

“It really started two years ago with a conversation with Google, Mellanox, NVIDIA and Tyan (a mother board company). Up until that point systems generally ran faster year after year because processor speeds kept advancing,” he said. The group fretted about slowing the rate of performance gains for applications as well as the prospect of brutal commoditization caused by stagnant innovation.

“We said, there’s this great model called open source software. Can’t we take that model and adapt it to open source hardware and incorporate that same community approach to innovation. Yes, at the end of the day IBM will pick pieces from that innovation [for their] products but others will too. It’s going to create choice in the market place and innovation, not commoditization,” said Balog.

Perhaps not a revolutionary idea – they had a model – but giving away proprietary advantage and its associated higher profit margins isn’t easy if you’re not forced to. That’s kind of the point of the free market. Views vary on how successful and how open the new initiative is, but the OpenPOWER Foundation is growing. The gang of five has grown to about 140 members at present according to Balog.

“Jim Whitehurst (Red Hat CEO) and I were discussing his new book (The Open Organization: Igniting Passion and Performance, published in June) and talking about what‘s needed for openness. First, you have to form a [substantial] community because we have seen plenty of attempts at openness where it ended up being a set of family members getting together and nobody else joined. There’s always more to do but I feel good about the community we have built so far.”

“The next phase is getting a community to bring innovation to the space and we are starting to see that. At the [first] OpenPOWER summit in San Jose [in March] there were 15 companies who brought POWER-based motherboards, all very different, all very innovative, all really targeted at the cloud or HPC companies,” said Balog (see IBM’s First OpenPOWER Server Targets HPC Workloads, HPCwire).

Show Me The Money

“Now we have to start to move in the monetization direction. How do you take this community, take this innovation, and start to transform it into products and sales. How do [OpenPOWER] members who want to build on top find that balance between innovation and openness and turn it into monetization. We are all publically traded companies so at the end of the day we want to see some sales.”

IBM helped its OpenPOWER cause recently by luring longtime industry executive Sumit Gupta away from GPU pioneer (and founding OpenPOWER member) NVIDIA. Gupta is VP, HPC & OpenPOWER and reports to King. He had been GM of GPU Accelerated Data Center Computing for NVIDIA.

“He was a perfect fit to come in and lead our HPC OpenPOWER business. He’s well connected in the industry and has a great business mind so that’s the role he has. He’s the guy Ken King and I look to and say OK you’ve got two big wins where’s the next five or ten or whatever the number is and where do we go from here.”

It doesn’t hurt that Gupta has extensive knowledge of accelerators which play a critical role in IBM Power Systems plans for HPC (see What IBM says about Gupta’s role, HPCwire).

“So [moving away from Blue Gene] was one of the strategy changes made two years ago. Historically we had built these wonderful engineering marvels called Blue Gene systems, beautiful, well-engineered machines but they are monstrous in size. Quite custom you would agree. In the best cases we shipped a few systems and didn’t lose money; in most cases we lost money. It’s a tough market.”

“We said why don’t we have an HPC strategy that takes our standard one and two-socket systems and buck those up with accelerators through CAPI attached (POWER8 Coherent Accelerator Processor Interface). CAPI plays a big role here, or NVIDIA attached with NVLink. I think accelerators are a big opportunity here and that’s not just IBM hardware but Altera and Xilinx and NVIDI, etc.

“That’s the kind of HPC system we’ve targeted so it’s much more about a rack full of scale out systems with accelerators. Obviously you need the code optimizations, so you’ve got to pick your target industries where they are willing to take their code and start heading down a path of leveraging accelerators. Some have seen that light already and some haven’t seen the light yet but I think more and more are,” Balog said.

Mainframe Revival

One benefactor of all the openness changes is the mainframe. It’s always been around, but also absorbed its share of condescension over the years. Porting Linux to IBM’s z-Systems line is opening doors to new uses. Container technology, Hadoop, and a myriad of other ‘open’ technologies are becoming accessible on mainframes.

IBMSystemz10mainframe2“Think about the evolution and transformation of IBM, not as a company though we could talk about that, but also from a systems business and how this aspect of openness is really transcending the way platforms like the mainframe continues to drive growth, [especially in large enterprise environments.]

“[The mainframe] has been around for 50 plus years. Obviously, it’s very different today than it was 50 years ago although you still run the same aps from 50 years ago and that’s one of the miraculous aspects of the mainframe platform is the commitment to architecture continuity while enabling new work. One of the biggest drivers in addition to Linux [on the mainframe] is mobile transactions for z-Systems [such as in financial services.] I mean who actually goes to a bank these days,” said Balog.

IBM’s key HPC targets are unsurprising – government, oil & gas, financial services and increasingly life sciences. Balog emphasizes it’s a much more economical way to go, to take what’s already in the portfolio and bring in accelerators.

It will be interesting to watch how IBM fares in the TOP500 list in coming years. IBM had 153 systems (roughly 30 percent) in the T0P500 List last November including four in the top ten. IBM does have a couple of big HPC wins recently, one a DOE and the other with Science & Technology Facilities Council (STFC) in the U.K.

We’re not really that focused on being in the TOP500. It’s not that we are shying away from those opportunities, but it’s not about the number of the score anymore. A lot of our focus is on the sweet spot for POWER and the marriage of HPC in those industries where it really is about the data analytics. That’s where the POWER architecture shines through even with this addition of the accelerator model. As you can see from the accelerator [possibilities] our approach is quite different than Intel’s in terms of it’s an open approach. All are welcome to bring their best acceleration technology. We didn’t see the need to go spend $16B dollars,” said Balog.

OpenPOWER, of course, isn’t the only processor-based ecosystem out there. Intel remains the giant everyone aims for. To say it has been less than spectacularly successful is just plain wrong. On the other hand, the market does seem hungry or at least open to more choice. Stir in the slowing of chip advances (i.e. the much discussed demise of Moore’s Law) and growing worry over power consumption and the suddenly potential opportunity for non-Intel contenders seems more realistic.

The ARM camp is one contender. It’s been a huge winner in the mobile device space where ARM’s reduced power requirements are critical. Traction in the server market has been slow, but release of a 64-bit design (ARMv8) is making matters more interesting. Just a week ago, the Mont-Blanc Project at the Barcelona Supercomputer Center (BSC) fired up a prototype running on ARM suggesting it is possible to get high performance from the architecture. Mont-Blanc is exploring more energy efficient approach towards achieving exascale computing.

ARM Needs a Body of Support

“We keep asking ourselves about it. Our view of ARM is it had promise, we were watching it, but I think we’ve seen a lot of the ARM server companies start to fold up tent and move away from it. Part of it is a weak core design, if I could call it that, I don’t mean that disparagingly, it’s just that that’s what it is. That’s why it is in all of our mobile devices.

“It hasn’t built the server ecosystem that clients might want to look at. So we just haven’t seen it mature at the pace it might have been able to mature and it’s been around for a while. You know AMD recently sort of declared they are moving away from it,” said Balog.

HP’s Moonshot server line has a model with ARM which has a few wins, a notable recent one at the University of Utah. Part of the challenge is to interest the developer community. Balog asked wryly about Moonshot, “But has HP ever delivered much of the ARM stuff really. They quickly touted that and went right back to Moonshot is a bunch of Intel servers.”

“Could ARM and POWER partner? I don’t know. We continue to ask ourselves that question. We’ll see. It’s low power. Do we think about power issues in the OpenPOWER community and what’s down the line, sure. [But so far] we aren’t seeing power consumption as a major issue to deployment. It is the balance between do you take a slightly stronger core that can run oodles of performance benefits over an ARM or an Intel processor and therefore it’s got a littler more energy consumption or do you go with a lot of systems with a really weak core.

“We continue to watch the space. We’re all for openness so I think they help chip a way at Intel at the low end and we chip away at middle- and high-end. The market wants choice and that’s sort of the fundamental thing we hear from the cloud companies,” said Balog.

Developer Outreach

Clearly a big challenge, and directly related to IBM’s presence at the Red Hat Summit, is engaging the open source developer community. They need to be convinced of IBM’s commitment and to be able to play on the POWER architecture. To some extent, IBM’s efforts there remain nascent, but growing.

“[Developer outreach] is more and more cloud based, no surprise, by providing access to POWER infrastructure in the cloud. [The idea] is to leverage benefits of a cloud versus [forcing] everybody to have their server on their desktop. A couple of weeks ago in China and we launched an open developer Linux platform in the cloud with accelerators called SuperVessel and it’s come one come all (See, IBM Introduces SuperVessel, HPCwire).

“You can do development, try out accelerators, try out POWER, try Linux on power, and it’s available free. We will expand it to the rest of the world over time. We have some things in the POWER development platform today, we have some slight differences but as POWER goes into software which it will here very soon, there will be Linux on POWER that will be another opportunity for us to provide free connection to developers.”

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industry updates delivered to you every week!

Nvidia’s New Blackwell GPU Can Train AI Models with Trillions of Parameters

March 18, 2024

Nvidia's latest and fastest GPU, code-named Blackwell, is here and will underpin the company's AI plans this year. The chip offers performance improvements from its predecessors, including the red-hot H100 and A100 GPUs. Read more…

Nvidia Showcases Quantum Cloud, Expanding Quantum Portfolio at GTC24

March 18, 2024

Nvidia’s barrage of quantum news at GTC24 this week includes new products, signature collaborations, and a new Nvidia Quantum Cloud for quantum developers. While Nvidia may not spring to mind when thinking of the quant Read more…

2024 Winter Classic: Meet the HPE Mentors

March 18, 2024

The latest installment of the 2024 Winter Classic Studio Update Show features our interview with the HPE mentor team who introduced our student teams to the joys (and potential sorrows) of the HPL (LINPACK) and accompany Read more…

Houston We Have a Solution: Addressing the HPC and Tech Talent Gap

March 15, 2024

Generations of Houstonian teachers, counselors, and parents have either worked in the aerospace industry or know people who do - the prospect of entering the field was normalized for boys in 1969 when the Apollo 11 missi Read more…

Apple Buys DarwinAI Deepening its AI Push According to Report

March 14, 2024

Apple has purchased Canadian AI startup DarwinAI according to a Bloomberg report today. Apparently the deal was done early this year but still hasn’t been publicly announced according to the report. Apple is preparing Read more…

Survey of Rapid Training Methods for Neural Networks

March 14, 2024

Artificial neural networks are computing systems with interconnected layers that process and learn from data. During training, neural networks utilize optimization algorithms to iteratively refine their parameters until Read more…

Nvidia’s New Blackwell GPU Can Train AI Models with Trillions of Parameters

March 18, 2024

Nvidia's latest and fastest GPU, code-named Blackwell, is here and will underpin the company's AI plans this year. The chip offers performance improvements from Read more…

Nvidia Showcases Quantum Cloud, Expanding Quantum Portfolio at GTC24

March 18, 2024

Nvidia’s barrage of quantum news at GTC24 this week includes new products, signature collaborations, and a new Nvidia Quantum Cloud for quantum developers. Wh Read more…

Houston We Have a Solution: Addressing the HPC and Tech Talent Gap

March 15, 2024

Generations of Houstonian teachers, counselors, and parents have either worked in the aerospace industry or know people who do - the prospect of entering the fi Read more…

Survey of Rapid Training Methods for Neural Networks

March 14, 2024

Artificial neural networks are computing systems with interconnected layers that process and learn from data. During training, neural networks utilize optimizat Read more…

PASQAL Issues Roadmap to 10,000 Qubits in 2026 and Fault Tolerance in 2028

March 13, 2024

Paris-based PASQAL, a developer of neutral atom-based quantum computers, yesterday issued a roadmap for delivering systems with 10,000 physical qubits in 2026 a Read more…

India Is an AI Powerhouse Waiting to Happen, but Challenges Await

March 12, 2024

The Indian government is pushing full speed ahead to make the country an attractive technology base, especially in the hot fields of AI and semiconductors, but Read more…

Charles Tahan Exits National Quantum Coordination Office

March 12, 2024

(March 1, 2024) My first official day at the White House Office of Science and Technology Policy (OSTP) was June 15, 2020, during the depths of the COVID-19 loc Read more…

AI Bias In the Spotlight On International Women’s Day

March 11, 2024

What impact does AI bias have on women and girls? What can people do to increase female participation in the AI field? These are some of the questions the tech Read more…

Alibaba Shuts Down its Quantum Computing Effort

November 30, 2023

In case you missed it, China’s e-commerce giant Alibaba has shut down its quantum computing research effort. It’s not entirely clear what drove the change. Read more…

Nvidia H100: Are 550,000 GPUs Enough for This Year?

August 17, 2023

The GPU Squeeze continues to place a premium on Nvidia H100 GPUs. In a recent Financial Times article, Nvidia reports that it expects to ship 550,000 of its lat Read more…

Analyst Panel Says Take the Quantum Computing Plunge Now…

November 27, 2023

Should you start exploring quantum computing? Yes, said a panel of analysts convened at Tabor Communications HPC and AI on Wall Street conference earlier this y Read more…

DoD Takes a Long View of Quantum Computing

December 19, 2023

Given the large sums tied to expensive weapon systems – think $100-million-plus per F-35 fighter – it’s easy to forget the U.S. Department of Defense is a Read more…

Shutterstock 1285747942

AMD’s Horsepower-packed MI300X GPU Beats Nvidia’s Upcoming H200

December 7, 2023

AMD and Nvidia are locked in an AI performance battle – much like the gaming GPU performance clash the companies have waged for decades. AMD has claimed it Read more…

Synopsys Eats Ansys: Does HPC Get Indigestion?

February 8, 2024

Recently, it was announced that Synopsys is buying HPC tool developer Ansys. Started in Pittsburgh, Pa., in 1970 as Swanson Analysis Systems, Inc. (SASI) by John Swanson (and eventually renamed), Ansys serves the CAE (Computer Aided Engineering)/multiphysics engineering simulation market. Read more…

Intel’s Server and PC Chip Development Will Blur After 2025

January 15, 2024

Intel's dealing with much more than chip rivals breathing down its neck; it is simultaneously integrating a bevy of new technologies such as chiplets, artificia Read more…

Baidu Exits Quantum, Closely Following Alibaba’s Earlier Move

January 5, 2024

Reuters reported this week that Baidu, China’s giant e-commerce and services provider, is exiting the quantum computing development arena. Reuters reported � Read more…

Leading Solution Providers

Contributors

Choosing the Right GPU for LLM Inference and Training

December 11, 2023

Accelerating the training and inference processes of deep learning models is crucial for unleashing their true potential and NVIDIA GPUs have emerged as a game- Read more…

Training of 1-Trillion Parameter Scientific AI Begins

November 13, 2023

A US national lab has started training a massive AI brain that could ultimately become the must-have computing resource for scientific researchers. Argonne N Read more…

Shutterstock 1179408610

Google Addresses the Mysteries of Its Hypercomputer 

December 28, 2023

When Google launched its Hypercomputer earlier this month (December 2023), the first reaction was, "Say what?" It turns out that the Hypercomputer is Google's t Read more…

Comparing NVIDIA A100 and NVIDIA L40S: Which GPU is Ideal for AI and Graphics-Intensive Workloads?

October 30, 2023

With long lead times for the NVIDIA H100 and A100 GPUs, many organizations are looking at the new NVIDIA L40S GPU, which it’s a new GPU optimized for AI and g Read more…

AMD MI3000A

How AMD May Get Across the CUDA Moat

October 5, 2023

When discussing GenAI, the term "GPU" almost always enters the conversation and the topic often moves toward performance and access. Interestingly, the word "GPU" is assumed to mean "Nvidia" products. (As an aside, the popular Nvidia hardware used in GenAI are not technically... Read more…

Shutterstock 1606064203

Meta’s Zuckerberg Puts Its AI Future in the Hands of 600,000 GPUs

January 25, 2024

In under two minutes, Meta's CEO, Mark Zuckerberg, laid out the company's AI plans, which included a plan to build an artificial intelligence system with the eq Read more…

Google Introduces ‘Hypercomputer’ to Its AI Infrastructure

December 11, 2023

Google ran out of monikers to describe its new AI system released on December 7. Supercomputer perhaps wasn't an apt description, so it settled on Hypercomputer Read more…

China Is All In on a RISC-V Future

January 8, 2024

The state of RISC-V in China was discussed in a recent report released by the Jamestown Foundation, a Washington, D.C.-based think tank. The report, entitled "E Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire