With a Carbon Footprint Like HPC’s, It Matters When and Where You Step

By Oliver Peckham

December 9, 2021

From European HPC experts pondering “can fast be green?” to new milestones on the Green500 list, sustainability certainly had a moment at the hybrid SC21 conference. And it’s no wonder: the exascale era is here, and power consumption for HPC is skyrocketing even as efficiency is driven to its extremes. At SC21, another session – “HPC’s Growing Sustainability Challenges and Emerging Approaches” – tackled the topic, bringing together experts across the HPC sector to discuss how the rapidly expanding field could control its energy use and emissions.

“The context is – as I’m sure you’re all aware – that there’s aggressive government goals to decarbonize their economies and reduce negative environmental impacts broadly, and computing is sort of in a special place in this,” explained the session’s host, Andrew Chien, a senior scientist at Argonne National Laboratory. “Computing and HPC together have challenges around sustainability because of their rapidly proliferating use and very, very fast technology lifecycles.”

Chien was joined by Steve Hammond, a senior research advisor at the National Renewable Energy Laboratory; Bill Magro, chief technologist for HPC at Google; Michael McNamara, CEO and founder of Lancium; and Erik Riedel, senior vice president of engineering at ITRenew.

HPC has a big footprint.

“It’s actually particularly timely that we have this panel on sustainability,” Chien continued, “because [the international climate conference] COP26 has been going on, and maybe just completed … but entire economies – including of course HPC and computing – have made the commitment to get to net zero by various times, whether it’s 2050, 2060, or 2070.”

HPC, Chien continued, was not an insignificant slice of the pie: using the Top500 as a proxy, Chien said, researchers had estimated the systems’ aggregate energy load at around 600MW, or 5.2 terawatt-hours per year, with the top ten systems constituting around 100MW of that load. This, he said, would account for around two million metric tons of CO2 per year – equivalent to around 285,000 average households, and that only from 500 publicly-ranked systems.

And HPC’s footprint doesn’t stop at energy use (“scope two” emissions). “Scope three” emissions, which include embedded carbon from along the value chain, are also high for the sector. “We have the challenging practice of a fast-moving technology, which is that we dispose of our systems after a relatively short lifetime,” Chien said, noting that there was a “very, very significant amount of energy” corresponding to production of components for HPC systems (along with e-waste, of course).

Choosing when to step matters.

But, it turns out, with a big footprint, choosing when to step can be crucial. Normally, Chien said, computing loads are constant, making it hard to go carbon-free as the power grids hosting those computing loads fluctuate in terms of carbon intensity throughout the day. To that end, Chien detailed a project he had been working on called zero-carbon cloud (ZCCloud).

“Instead of having all of your datacenters as constant loads in the grid,” he said, “we might be able to have computing equipment modulate its consumption to the availability of excess renewable power.”

“If you take this study to its logical conclusion,” he continued, there are some “amazing opportunities.” The team had looked at workloads at Argonne for about a year, examining expected performance, throughput and energy use between a traditional, reliably-powered system and a system powered by intermittent renewable energy. The latter system, Chien said, “effectively eliminates 50 percent of the carbon emissions footprint for this aggregate computing system.”

“This approach – a holistic approach that looks at TCO from capital equipment on through operational costs and power and so on – could lead you to the conclusion that you could build a system with higher throughput per million dollars of TCO per year and … higher peak performance per million dollars of TCO per year. So this is in addition to the 50 percent emissions reduction.”

“If you look at this in the right way,” he concluded, “there may be opportunities that not only reduce the carbon emissions footprint, but also create new opportunities for new capability and even cost-effectiveness.”

Magro, in his talk, agreed with both the stated problem and the proposed solution. “Even if you want to commit to being carbon-free, the problem is there isn’t carbon-free energy available all day,” he said. “The energy supply is very spiky. … If you just present a stable load, then of course you can’t run carbon-free.”

Google, he explained, had moved away from constant loads in many cases as a way of combating this reality. “In our datacenters, we actually align with the availability of low-carbon energy and we compute more when the wind is blowing and when the sun is shining,” he said.

A simplified visualization of how Google adjusts its workloads based on renewable energy availability. Image courtesy of Bill Magro.

Choosing where to step matters.

But time isn’t the only variable in play – location matters, too. Hammond, who helps to manage the world’s most energy-efficient datacenter at NREL’s campus in Golden, Colorado, detailed the wide range of efficiency choices made to give the datacenter its stunning PUE of 1.036… with one major caveat. “No one approach fits every solution wherever you are,” he said. “What works for us here in Colorado may be different from what works in the Southeastern U.S.”

This, as it turns out, was the entire pitch for another speaker – McNamara – who flipped that idea on its head: what if regionally-specific sustainability considerations played a major role in siting workloads?

McNamara first outlined the characteristics of the modern (and future) power grid. First, he said, solar and wind were getting overbuilt to compensate for their intermittency; second, generation was no longer necessarily colocated to urban centers; and third, as a result of these trends, negative-priced energy – energy that renewable generators needed to offload, but for which there was not demand – was a growing problem.

“Negative-priced energy is now endemic,” he said. “And it’s endemic because the grid of the future has too much wind and solar. … The way to solve this problem is to build, at extremely large scale, many datacenters at critical points on the transmission system in every grid operator that is going decarbonized. Which is all of them.”

Negative pricing frequency, with Lancium’s target region circled. Image courtesy of Michael McNamara.

This, he said, was the pitch for his company, Lancium. “Our vision is: datacenters should act as enormous inverse power plants,” he said. “What power plants do is, when the grid is in times of need, power plants go up. The grid is indifferent if a power plant goes up or a datacenter goes down. … They consume negative-priced energy; the provide the grid inertia that the grid needs because of the retirement of fossil plants; and the net effect, if you run these in a highly flexible manner, is negative eight million tons [of carbon] a year if you build 5000MW of datacenters. And that’s in Texas alone. And that’s what we’re doing.”

McNamara said that this solution – which could, ostensibly, run massive workloads at zero energy cost – was preferable to costly, time-consuming transmission line upgrades and more foolproof than current storage solutions, which he said were good for about four hours. Further, he explained that costly sustainability solutions were not necessary in Lancium’s datacenters, which ran hot and “moved a ton of air.”

Hand-me-downs

But all of that, of course, only addresses the scope two emissions. Scope three emissions were an entirely different beast: “It boggles my mind how much we just have to throw away when we get a new system,” Hammond said. Riedel explained how his company, ITRenew, worked to decommission datacenter equipment for companies like Facebook, Google and Twitter, shredding hard drives and repurposing other equipment to give it a second life.

Chien, earlier in the panel, had pitched this same notion, outlining how datacenter operators could deploy phased-out hardware at a secondary facility with low-cost, on-site renewable energy for another few years. This, he said, would be a win-win-win: low-cost equipment, low-cost energy and lower (annualized) embedded emissions thanks to longer lifetimes for the components.

More sustainability coverage from SC21

At SC21, Experts Ask: Can Fast HPC Be Green?

The Green500’s Crystal Anniversary Sees MN-3 Crystallize Its Winning Streak

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industry updates delivered to you every week!

Empowering High-Performance Computing for Artificial Intelligence

April 19, 2024

Artificial intelligence (AI) presents some of the most challenging demands in information technology, especially concerning computing power and data movement. As a result of these challenges, high-performance computing Read more…

Kathy Yelick on Post-Exascale Challenges

April 18, 2024

With the exascale era underway, the HPC community is already turning its attention to zettascale computing, the next of the 1,000-fold performance leaps that have occurred about once a decade. With this in mind, the ISC Read more…

2024 Winter Classic: Texas Two Step

April 18, 2024

Texas Tech University. Their middle name is ‘tech’, so it’s no surprise that they’ve been fielding not one, but two teams in the last three Winter Classic cluster competitions. Their teams, dubbed Matador and Red Read more…

2024 Winter Classic: The Return of Team Fayetteville

April 18, 2024

Hailing from Fayetteville, NC, Fayetteville State University stayed under the radar in their first Winter Classic competition in 2022. Solid students for sure, but not a lot of HPC experience. All good. They didn’t Read more…

Software Specialist Horizon Quantum to Build First-of-a-Kind Hardware Testbed

April 18, 2024

Horizon Quantum Computing, a Singapore-based quantum software start-up, announced today it would build its own testbed of quantum computers, starting with use of Rigetti’s Novera 9-qubit QPU. The approach by a quantum Read more…

2024 Winter Classic: Meet Team Morehouse

April 17, 2024

Morehouse College? The university is well-known for their long list of illustrious graduates, the rigor of their academics, and the quality of the instruction. They were one of the first schools to sign up for the Winter Read more…

Kathy Yelick on Post-Exascale Challenges

April 18, 2024

With the exascale era underway, the HPC community is already turning its attention to zettascale computing, the next of the 1,000-fold performance leaps that ha Read more…

Software Specialist Horizon Quantum to Build First-of-a-Kind Hardware Testbed

April 18, 2024

Horizon Quantum Computing, a Singapore-based quantum software start-up, announced today it would build its own testbed of quantum computers, starting with use o Read more…

MLCommons Launches New AI Safety Benchmark Initiative

April 16, 2024

MLCommons, organizer of the popular MLPerf benchmarking exercises (training and inference), is starting a new effort to benchmark AI Safety, one of the most pre Read more…

Exciting Updates From Stanford HAI’s Seventh Annual AI Index Report

April 15, 2024

As the AI revolution marches on, it is vital to continually reassess how this technology is reshaping our world. To that end, researchers at Stanford’s Instit Read more…

Intel’s Vision Advantage: Chips Are Available Off-the-Shelf

April 11, 2024

The chip market is facing a crisis: chip development is now concentrated in the hands of the few. A confluence of events this week reminded us how few chips Read more…

The VC View: Quantonation’s Deep Dive into Funding Quantum Start-ups

April 11, 2024

Yesterday Quantonation — which promotes itself as a one-of-a-kind venture capital (VC) company specializing in quantum science and deep physics  — announce Read more…

Nvidia’s GTC Is the New Intel IDF

April 9, 2024

After many years, Nvidia's GPU Technology Conference (GTC) was back in person and has become the conference for those who care about semiconductors and AI. I Read more…

Google Announces Homegrown ARM-based CPUs 

April 9, 2024

Google sprang a surprise at the ongoing Google Next Cloud conference by introducing its own ARM-based CPU called Axion, which will be offered to customers in it Read more…

Nvidia H100: Are 550,000 GPUs Enough for This Year?

August 17, 2023

The GPU Squeeze continues to place a premium on Nvidia H100 GPUs. In a recent Financial Times article, Nvidia reports that it expects to ship 550,000 of its lat Read more…

Synopsys Eats Ansys: Does HPC Get Indigestion?

February 8, 2024

Recently, it was announced that Synopsys is buying HPC tool developer Ansys. Started in Pittsburgh, Pa., in 1970 as Swanson Analysis Systems, Inc. (SASI) by John Swanson (and eventually renamed), Ansys serves the CAE (Computer Aided Engineering)/multiphysics engineering simulation market. Read more…

Intel’s Server and PC Chip Development Will Blur After 2025

January 15, 2024

Intel's dealing with much more than chip rivals breathing down its neck; it is simultaneously integrating a bevy of new technologies such as chiplets, artificia Read more…

Choosing the Right GPU for LLM Inference and Training

December 11, 2023

Accelerating the training and inference processes of deep learning models is crucial for unleashing their true potential and NVIDIA GPUs have emerged as a game- Read more…

Baidu Exits Quantum, Closely Following Alibaba’s Earlier Move

January 5, 2024

Reuters reported this week that Baidu, China’s giant e-commerce and services provider, is exiting the quantum computing development arena. Reuters reported � Read more…

Comparing NVIDIA A100 and NVIDIA L40S: Which GPU is Ideal for AI and Graphics-Intensive Workloads?

October 30, 2023

With long lead times for the NVIDIA H100 and A100 GPUs, many organizations are looking at the new NVIDIA L40S GPU, which it’s a new GPU optimized for AI and g Read more…

Shutterstock 1179408610

Google Addresses the Mysteries of Its Hypercomputer 

December 28, 2023

When Google launched its Hypercomputer earlier this month (December 2023), the first reaction was, "Say what?" It turns out that the Hypercomputer is Google's t Read more…

AMD MI3000A

How AMD May Get Across the CUDA Moat

October 5, 2023

When discussing GenAI, the term "GPU" almost always enters the conversation and the topic often moves toward performance and access. Interestingly, the word "GPU" is assumed to mean "Nvidia" products. (As an aside, the popular Nvidia hardware used in GenAI are not technically... Read more…

Leading Solution Providers

Contributors

Shutterstock 1606064203

Meta’s Zuckerberg Puts Its AI Future in the Hands of 600,000 GPUs

January 25, 2024

In under two minutes, Meta's CEO, Mark Zuckerberg, laid out the company's AI plans, which included a plan to build an artificial intelligence system with the eq Read more…

China Is All In on a RISC-V Future

January 8, 2024

The state of RISC-V in China was discussed in a recent report released by the Jamestown Foundation, a Washington, D.C.-based think tank. The report, entitled "E Read more…

Shutterstock 1285747942

AMD’s Horsepower-packed MI300X GPU Beats Nvidia’s Upcoming H200

December 7, 2023

AMD and Nvidia are locked in an AI performance battle – much like the gaming GPU performance clash the companies have waged for decades. AMD has claimed it Read more…

Nvidia’s New Blackwell GPU Can Train AI Models with Trillions of Parameters

March 18, 2024

Nvidia's latest and fastest GPU, codenamed Blackwell, is here and will underpin the company's AI plans this year. The chip offers performance improvements from Read more…

DoD Takes a Long View of Quantum Computing

December 19, 2023

Given the large sums tied to expensive weapon systems – think $100-million-plus per F-35 fighter – it’s easy to forget the U.S. Department of Defense is a Read more…

Eyes on the Quantum Prize – D-Wave Says its Time is Now

January 30, 2024

Early quantum computing pioneer D-Wave again asserted – that at least for D-Wave – the commercial quantum era has begun. Speaking at its first in-person Ana Read more…

GenAI Having Major Impact on Data Culture, Survey Says

February 21, 2024

While 2023 was the year of GenAI, the adoption rates for GenAI did not match expectations. Most organizations are continuing to invest in GenAI but are yet to Read more…

The GenAI Datacenter Squeeze Is Here

February 1, 2024

The immediate effect of the GenAI GPU Squeeze was to reduce availability, either direct purchase or cloud access, increase cost, and push demand through the roof. A secondary issue has been developing over the last several years. Even though your organization secured several racks... Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire