The @hpcnotes Predictions for HPC in 2018

By Andrew Jones

January 4, 2018

I’m not averse to making predictions about the world of High Performance Computing (and Supercomputing, Cloud, etc.) in person at conferences, meetings, causal conversations, etc.; however, it turns out to be a while since I have stuck my neck out and widely published my predictions for the year ahead in HPC. Of course, such predictions tend to be evenly split between inspired foresight and misguided idiocy. At least some of the predictions will have readers spluttering coffee in indignation at how wrong I am. But, where would the fun in HPC be if we all played safe? So, here goes for the @hpcnotes predictions for HPC in 2018 …

Intel

After spending much of 2017 being called out for ambitiously high pricing of Skylake for HPC customers, and following that with the months of Xeon Phi confusion – eventually publicly admitting at SC17 that Knights Hill has been cancelled, still not clear about the future of Phi overall – Intel seems to have continued into 2018 in the worst way, with news of kernel memory hardware bugs flooding the IT news and social media space. [NB: these bugs have now been confirmed to affect CPUs from AMD, ARM and other vendors too.] 2018 will also see widespread availability of AMD EPYC, Cavium ThunderX2, and IBM Power9 processors and so it seems Intel has a tough year ahead. The hardware bug is especially painful here as it negates the “Intel is the safe option” thinking. To be clear, HPC community consensus so far (including NAG’s impartial benchmarking work with customer codes) says Skylake is a very capable and performance leading processor. However, Skylake has three possible let downs: (1) price substantially higher, relative to the benefits gained, than customers are comfortable with; (2) reduced cache per core compared with other CPUs; (3) dependence on a code’s saturation of the vector units to extract the maximum performance. In some early benchmarks, EPYC and TX2 are winning on both price and performance. My prediction is that Intel will meaningfully drop the Skylake price early in 2018 to pull back into a competitive position on price/performance.

AI and ML

Sorry, the media and marketing hype for AI/ML taking over HPC shows no sign of going away. Yes, there are many real use cases for AI and ML (e.g., follow Paige Bailey and colleagues for real examples); however, the aggressive insertion of AI and ML labels into every HPC-related conference agenda (taking over from the mandatory mentions of Big Data) doesn’t add a lot of value, I think. I’m not suggesting that the HPC community (users or providers) ignore AI/ML – indeed, I would firmly advocate that you add these to your portfolio. But, HPC is an exceptionally powerful and widely applicable tool in its own right – it doesn’t need AI/ML to justify itself. My prediction is that AI/ML will continue to hog a share of the HPC marketing noise unrelated to the scale of actual use in the HPC arena.

New processors

As noted above, 2018 sees credible HPC processors from AMD (EPYC), Cavium (ThunderX2) and other ARM chips, and IBM (Power9) surge into general availability. In my view, these are not (yet) competing with Intel Xeon; they are competing with each other to be the best of the rest. Depending on how Intel behaves (NB: this is not just about technology) and how well AMD/ARM/IBM and their system partners actually execute on promises, one of these might close out 2018 being a serious competitor to Intel’s dominance of the HPC processor space. Either way, I predict we will see at least one meaningful (i.e., competitively won, large scale, for production use) HPC deployment of each of these processors in 2018. I’m also going to add a second prediction to this section: a MIPS based processor option will start to gain headlines as a real HPC processor candidate in 2018 (not just in China).

Cloud

In most cases, HPC is still cheaper and more capable through traditional in-house systems than via cloud deployments. No amount of marketing changes that. Time might change it, but not by the end of 2018. However, cloud as an option for HPC is not going away. It does present a real option for many HPC workloads, and not just trivial workloads. I am hopeful we are at the end of the era where the cloud providers hoped to succeed by trying to convince everyone that “HPC in-house” advocates were just dinosaurs. The cloud companies all show signs of adjusting their offerings to the actual needs of HPC users (technical, commercial and political needs). This means that an impartial understanding of the pros and cons of cloud for your specific HPC situation is going to be even more critical in 2018. I am certainly being asked to help address the question of HPC in the cloud by my consulting customers with increasing frequency. Azure has been ramping up efforts in HPC (and AI) aggressively over the last few months through acquisitions (e.g., Cycle Computing) and recruitments (e.g., Developer Advocate teams), and I’d expect AWS and Google to do likewise. My prediction is that all three of the major cloud providers (AWS, Azure, Google) will deliver substantially more HPC-relevant solutions in 2018, and at least one will secure a major (and possibly surprising) real HPC customer win.

GPUs

Nvidia also got an unwelcome start to 2018 as they tried to ban (via retrospective changes to license conditions) the use of their cheaper GPUs in datacenter (e.g., HPC, AI, …) applications. Of course, it is no surprise that Nvidia would prefer customers to buy the much more expensive high-end GPUs for datacenter applications. However, it doesn’t say much for the supposedly compelling business case or sales success of the high-end GPUs if they have to force people off the cheaper products first. We (NAG) have done enough benchmarking across enough different customer codes to know that GPUs are flat-out the fastest widely available processor option for codes that can take effective advantage of highly parallel architectures. However, when price of the high-end GPUs is taken into account, plus the performance left on the floor for the non-accelerated codes, then the CPUs often look a better overall choice. Ultimately, adapting many codes to use GPUs (not just a selected few codes to show easy wins) is a big effort. So is adapting workflows to the cloud. With limited resources available, I think users will decide that investing effort in cloud porting is a better long-term return than GPUs. Yes – oddly, I think cloud, not CPUs, will be the pressure that limits the success of GPUs! My prediction is that Nvidia’s unfortunate licensing assertions, coupled with marginal gains in performance relative to total cost of ownership (TCO), plus scarcity of software engineering resources, is that fewer newly deployed on-site HPC systems will be based around GPUs. On the other hand, I think use of GPUs in the cloud, for HPC, will grow substantially in 2018.

Zettascale

Yes, really. After all, exascale is within grasping distance now. We will see multiple systems at >0.1 EF in 2018. Exascale is being talked about in terms of when and which site first, rather than how and which country first. As exascale now seems likely to happen without all those disruptive changes that voices across the community foretold would be critical, computer science researchers and supercomputer center managers will need to start using the zettascale label to drive the next round of funding bids for novel technologies. There have already been a few small gatherings on zettascale, at least as far back as 2004 (!), but I predict 2018 will see the first mainstream meeting with a session focused on zettascale – perhaps at SC18?

Cybersecurity

The consumer world was wracked in 2017 by a range of large scale cybersecurity breaches. The government community has been hit badly in previous years too. Sadly, I see cybersecurity moving up the agenda in the HPC world. Not sad that it is happening, but sad that I think it will be forced to happen by one or more incidents. In general, HPC systems are fairly well protected, largely because they are expensive, capable assets and, in some cases, have regulatory criteria to meet. However, performance and ease-of-use for a predominantly research-led userbase have been the traditional strong drivers of requirements, often meaning the risk management decisions have been tilted towards a minimally compliant security configuration. (Security is arguably one area where HPC-in-the-cloud wins.) My prediction for 2018 is twofold: (1) there will be a major security incident on a high profile HPC system; (2) cybersecurity for HPC will move from a niche topic to a mainstream agenda item for some of the larger HPC conferences.

Finally, Growth

I saw HPC and related things such as AI, cloud, etc., gain lots of momentum in 2017. This included several technologies heralded in confidence finally coming to fruition, new HPC deployments across public and private sectors customers, a notable uptick in our HPC consulting work, interesting personnel moves, and an overall excitement and enthusiasm in the HPC community that had been dulled recently. My final prediction is that 2018 will see this growth and energy in the HPC community gather pace. I look forward to new HPC sites emerging, to significant new HPC systems being announced, and to the growing attention on the broader aspects of HPC beyond FLOPS – people, business aspects, impact stories, and more.

I hope you enjoyed my HPC predictions for 2018. Please do engage with me via Twitter (@hpcnotes) or LinkedIn (www.linkedin.com/in/andrewjones) if you want to comment on my inspired foresight or misguided idiocy. I’ll be back with a follow-up article in a week or two on how you can exploit these predictions to your advantage.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

RSC Reports 500Tflops, Hot Water Cooled System Deployed at JINR

April 18, 2018

RSC, developer of supercomputers and advanced HPC systems based in Russia, today reported deployment of “the world's first 100% ‘hot water’ liquid cooled supercomputer” at Joint Institute for Nuclear Research (JI Read more…

By Staff

New Device Spots Quantum Particle ‘Fingerprint’

April 18, 2018

Majorana particles have been observed by university researchers employing a device consisting of layers of magnetic insulators on a superconducting material. The advance opens the door to controlling the elusive particle Read more…

By George Leopold

Cray Rolls Out AMD-Based CS500; More to Follow?

April 18, 2018

Cray was the latest OEM to bring AMD back into the fold with introduction today of a CS500 option based on AMD’s Epyc processor line. The move follows Cray’s introduction of an ARM-based system (XC-50) last November. Read more…

By John Russell

HPE Extreme Performance Solutions

Hybrid HPC is Speeding Time to Insight and Revolutionizing Medicine

High performance computing (HPC) is a key driver of success in many verticals today, and health and life science industries are extensively leveraging these capabilities. Read more…

Hennessy & Patterson: A New Golden Age for Computer Architecture

April 17, 2018

On Monday June 4, 2018, 2017 A.M. Turing Award Winners John L. Hennessy and David A. Patterson will deliver the Turing Lecture at the 45th International Symposium on Computer Architecture (ISCA) in Los Angeles. The Read more…

By Staff

Cray Rolls Out AMD-Based CS500; More to Follow?

April 18, 2018

Cray was the latest OEM to bring AMD back into the fold with introduction today of a CS500 option based on AMD’s Epyc processor line. The move follows Cray’ Read more…

By John Russell

IBM: Software Ecosystem for OpenPOWER is Ready for Prime Time

April 16, 2018

With key pieces of the IBM/OpenPOWER versus Intel/x86 gambit settling into place – e.g., the arrival of Power9 chips and Power9-based systems, hyperscaler sup Read more…

By John Russell

US Plans $1.8 Billion Spend on DOE Exascale Supercomputing

April 11, 2018

On Monday, the United States Department of Energy announced its intention to procure up to three exascale supercomputers at a cost of up to $1.8 billion with th Read more…

By Tiffany Trader

Cloud-Readiness and Looking Beyond Application Scaling

April 11, 2018

There are two aspects to consider when determining if an application is suitable for running in the cloud. The first, which we will discuss here under the title Read more…

By Chris Downing

Transitioning from Big Data to Discovery: Data Management as a Keystone Analytics Strategy

April 9, 2018

The past 10-15 years has seen a stark rise in the density, size, and diversity of scientific data being generated in every scientific discipline in the world. Key among the sciences has been the explosion of laboratory technologies that generate large amounts of data in life-sciences and healthcare research. Large amounts of data are now being stored in very large storage name spaces, with little to no organization and a general unease about how to approach analyzing it. Read more…

By Ari Berman, BioTeam, Inc.

IBM Expands Quantum Computing Network

April 5, 2018

IBM is positioning itself as a first mover in establishing the era of commercial quantum computing. The company believes in order for quantum to work, taming qu Read more…

By Tiffany Trader

FY18 Budget & CORAL-2 – Exascale USA Continues to Move Ahead

April 2, 2018

It was not pretty. However, despite some twists and turns, the federal government’s Fiscal Year 2018 (FY18) budget is complete and ended with some very positi Read more…

By Alex R. Larzelere

Nvidia Ups Hardware Game with 16-GPU DGX-2 Server and 18-Port NVSwitch

March 27, 2018

Nvidia unveiled a raft of new products from its annual technology conference in San Jose today, and despite not offering up a new chip architecture, there were still a few surprises in store for HPC hardware aficionados. Read more…

By Tiffany Trader

Inventor Claims to Have Solved Floating Point Error Problem

January 17, 2018

"The decades-old floating point error problem has been solved," proclaims a press release from inventor Alan Jorgensen. The computer scientist has filed for and Read more…

By Tiffany Trader

Researchers Measure Impact of ‘Meltdown’ and ‘Spectre’ Patches on HPC Workloads

January 17, 2018

Computer scientists from the Center for Computational Research, State University of New York (SUNY), University at Buffalo have examined the effect of Meltdown Read more…

By Tiffany Trader

Russian Nuclear Engineers Caught Cryptomining on Lab Supercomputer

February 12, 2018

Nuclear scientists working at the All-Russian Research Institute of Experimental Physics (RFNC-VNIIEF) have been arrested for using lab supercomputing resources to mine crypto-currency, according to a report in Russia’s Interfax News Agency. Read more…

By Tiffany Trader

How the Cloud Is Falling Short for HPC

March 15, 2018

The last couple of years have seen cloud computing gradually build some legitimacy within the HPC world, but still the HPC industry lies far behind enterprise I Read more…

By Chris Downing

Fast Forward: Five HPC Predictions for 2018

December 21, 2017

What’s on your list of high (and low) lights for 2017? Volta 100’s arrival on the heels of the P100? Appearance, albeit late in the year, of IBM’s Power9? Read more…

By John Russell

Chip Flaws ‘Meltdown’ and ‘Spectre’ Loom Large

January 4, 2018

The HPC and wider tech community have been abuzz this week over the discovery of critical design flaws that impact virtually all contemporary microprocessors. T Read more…

By Tiffany Trader

How Meltdown and Spectre Patches Will Affect HPC Workloads

January 10, 2018

There have been claims that the fixes for the Meltdown and Spectre security vulnerabilities, named the KPTI (aka KAISER) patches, are going to affect applicatio Read more…

By Rosemary Francis

Nvidia Responds to Google TPU Benchmarking

April 10, 2017

Nvidia highlights strengths of its newest GPU silicon in response to Google's report on the performance and energy advantages of its custom tensor processor. Read more…

By Tiffany Trader

Leading Solution Providers

Deep Learning at 15 PFlops Enables Training for Extreme Weather Identification at Scale

March 19, 2018

Petaflop per second deep learning training performance on the NERSC (National Energy Research Scientific Computing Center) Cori supercomputer has given climate Read more…

By Rob Farber

Lenovo Unveils Warm Water Cooled ThinkSystem SD650 in Rampup to LRZ Install

February 22, 2018

This week Lenovo took the wraps off the ThinkSystem SD650 high-density server with third-generation direct water cooling technology developed in tandem with par Read more…

By Tiffany Trader

AI Cloud Competition Heats Up: Google’s TPUs, Amazon Building AI Chip

February 12, 2018

Competition in the white hot AI (and public cloud) market pits Google against Amazon this week, with Google offering AI hardware on its cloud platform intended Read more…

By Doug Black

HPC and AI – Two Communities Same Future

January 25, 2018

According to Al Gara (Intel Fellow, Data Center Group), high performance computing and artificial intelligence will increasingly intertwine as we transition to Read more…

By Rob Farber

New Blueprint for Converging HPC, Big Data

January 18, 2018

After five annual workshops on Big Data and Extreme-Scale Computing (BDEC), a group of international HPC heavyweights including Jack Dongarra (University of Te Read more…

By John Russell

US Plans $1.8 Billion Spend on DOE Exascale Supercomputing

April 11, 2018

On Monday, the United States Department of Energy announced its intention to procure up to three exascale supercomputers at a cost of up to $1.8 billion with th Read more…

By Tiffany Trader

Momentum Builds for US Exascale

January 9, 2018

2018 looks to be a great year for the U.S. exascale program. The last several months of 2017 revealed a number of important developments that help put the U.S. Read more…

By Alex R. Larzelere

Google Chases Quantum Supremacy with 72-Qubit Processor

March 7, 2018

Google pulled ahead of the pack this week in the race toward "quantum supremacy," with the introduction of a new 72-qubit quantum processor called Bristlecone. Read more…

By Tiffany Trader

  • arrow
  • Click Here for More Headlines
  • arrow
Share This