AWS Reveals Gaudi-based EC2 Instances Coming in 2021

By Todd R. Weiss

December 2, 2020

Amazon Web Services has a broad swath of new and bolstered services coming for customers in 2021, from the implementation of powerful Habana Gaudi AI hardware in Amazon EC2 instances for machine learning workloads to custom-designed AWS Trainium ML training chips built to cut cloud training costs.

The new products were announced on Tuesday (Dec. 1) by AWS CEO Andy Jassy, who opened the company’s 9th annual re:Invent educational and learning conference with a virtual keynote from Seattle for the first time ever due to the COVID-19 pandemic.

Also slated for arrival in 2021 are new Graviton2-equipped AWS instances, new AWS GP3 general purpose data storage volumes, and on-premises ECS Anywhere (Elastic Container Service) and EKS Anywhere (Elastic Kubernetes Service) offerings that allow customers to run Amazon services inside their own datacenters for the first time. Other upcoming products and service updates include the introduction of the AWS Aurora Serverless v.2, and new Lambda Containers Support.

Gaudi processor high-level architecture

The new Habana Gaudi-based Amazon EC2 instances will be offered in the first half of 2021, said Jassy, through a partnership between AWS and Intel, which acquired Habana Labs for $2 billion in 2019. The Gaudi accelerators promise 40 percent better price-performance than the best performing GPU instances today, according to AWS.

“It will work with all the main machine learning frameworks, PyTorch as well as TensorFlow,” and will help the company keep pushing the price-performance envelope and machine learning training advancements, said Jassy. The Gaudi accelerators are designed for training deep learning models for workloads that include natural language processing, object detection and machine learning training, classification, recommendation and personalization.

Up to eight Habana Gaudi accelerators will power each EC2 ML instance, and a fully-equipped instance can process about 12,000 images-per-second training the ResNet-50 model on TensorFlow, according to Intel. Gaudi-based EC2 instances are designed to deliver increased performance and greater cost efficiencies for customers, while allowing developers to build new or port existing training models from graphics processing units to Gaudi accelerators.

Each Gaudi chip provides 32GB of HBM2 memory and implements 10 ports of standard 100 Gigabit Ethernet. Native RDMA over Converged Ethernet connects the chips within the server, and multiple Gaudi servers can be clustered using AWS Elastic Fabric Adapter (EFA) technology to enable scalable distributed training.

Current generation Gaudi chips are fabricated on TSMC’s 16nm process, and Habana plans for the follow-on Gaudi2 to use TSMC 7nm. Intel’s Habana Labs also makes the an inference-focused chip, called Goya.

AWS Trainium Chips

The company’s all-new AWS Trainium chips are machine learning chips that are custom-designed by AWS to deliver the most cost-effective training in the cloud, according to Jassy.

According to AWS, Trainium provides the highest performance with the most teraflops (TFLOPS) of compute power for ML in the cloud, while also enabling a broader set of ML applications. Trainium chips are optimized for deep learning training workloads for applications including image classification, semantic search, translation, voice recognition, natural language processing and recommendation engines.

“Trainium will be even more cost-effective than the Habana chip” and will support all the major frameworks, including TensorFlow, PyTorch and [Apache] MXnet, he said. “You’re going to use the same [AWS] Neuron SDK, that our Inferentia customers use. So, if you use Inferentia for inference it will be easy to also get going on our machine learning chip Trainium. It’ll be available both as an EC2 instance as well as in [the AWS] Sagemaker [ML service] in the second half of 2021.”

Karl Freund, senior analyst at Moor Insights and Strategy, called Trainium “a fitting bookend to Inferentia,” AWS’s inference chip that was revealed in 2018 and deployed last year.

“Supporting Trainium, Gaudi and Nvidia GPUs is a smart move,” Freund wrote for Forbes, “and it is consistent with AWS’s strategy of offering customers a variety of technologies to meet their specific needs.”

New Graviton2-Powered Instances

Designed for compute-heavy and network-heavy workloads, AWS will also debut new C6gn instances in the next couple weeks, powered by Amazon’s Arm-based Graviton2 chips. The new instances will include 100 gigabit-per-second performance capabilities that promise to save money for customers while increasing speeds, said Jassy.

Also coming soon are new general purpose AWS GP3 (General Purpose) volumes for AWS Elastic Block Store (EBS). GP3 volumes are evolving from the previous generation of GP2 volumes that were introduced in 2014.

“The feedback that we’ve gotten the last year or two from customers is that we love GP2, but if we had a wish list, there’s a couple things that we’d like from you,” including lower costs per gigabyte and the ability to scale throughput or IOPS without also having to scale its storage, said Jassy.

The AWS team worked on those requests, resulting in new GP3 volumes that have 20 percent lower costs per gigabyte with the ability to provision IOPS and throughput separately from storage, he said.

“The baseline performance if you do GP3 volumes is 3,000 IOPS and 125 megabytes per second, but you can burst that and scale that up to a peak of 1,000 megabytes per second, which is four times that of GP2,” said Jassy. “And you’ll see that customers will be able to run many more of their demanding workloads on GP3 that they even were running on GP2.”

ECS Anywhere and EKS Anywhere

AWS has offered its managed Elastic Container Service (ECS) and Elastic Kubernetes Service (EKS) services to customers for several years, with more than 100,000 active ECS customers using it with billions of compute hours on EKS every week on AWS, according to Jassy. Amazon ECS is a fully managed container orchestration service, while Amazon EKS gives users a managed environment where they can start, run, and scale Kubernetes applications in the AWS cloud. Some customers, however, prefer to run these workloads on-premises, which they couldn’t do with the existing ECS and EKS services, Jassy added.

With those requests repeated by many customers, new on-premises Amazon ECS Anywhere and EKS Anywhere offerings will now be available, giving customers the options they wanted, said Jassy.

“ECS Anywhere allows you to have all the same AWS style API’s and cluster configuration management pieces on-premises that you have in the cloud, so it makes it easy,” he said. “It works with all your on-premises infrastructure.”

That led to some EKS customers wanting the same capabilities, which led to the creation of EKS Anywhere for Kubernetes users, which lets EKS customers run the services in their own datacenters, according to Jassy.

Some EKS customers were so excited about the coming services in 2021 that AWS is now making the EKS Kubernetes distribution open source so that customers can start using it now, he added. “It will be exactly the same as what we do with EKS. We’ll make all the same patches and updates so you can actually be starting to transition as you get ready for EKS Anywhere.”

Gartner analyst Arun Chandrasekaran said that while the vast majority of clients will continue to use the services through hybrid cloud deployments, the new services offer flexibility.

“The ECS Anywhere and EKS Anywhere products provide customers with a hybrid cloud option of running application containers in a consistent manner across on-premises and AWS public cloud,” he said. “While ECS offers more operational simplicity across a hybrid environment, the EKS offering extends Kubernetes into customer datacenters.”

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

SC22 Unveils ACM Gordon Bell Prize Finalists

August 12, 2022

Courtesy of the schedule for the SC22 conference, we now have our first glimpse at the finalists for this year’s coveted Gordon Bell Prize. The Gordon Bell Prize, of course, comes with an award of $10,000 courtesy of H Read more…

Q&A with ORNL’s Bronson Messer, an HPCwire Person to Watch in 2022

August 12, 2022

HPCwire presents our interview with Bronson Messer, distinguished scientist and director of Science at the Oak Ridge Leadership Computing Facility (OLCF), ORNL, and an HPCwire 2022 Person to Watch. Messer recaps ORNL's journey to exascale and sheds light on how all the pieces line up to support the all-important science. Also covered are the role... Read more…

TACC Simulations Probe the First Days of Stars, Black Holes

August 12, 2022

The stunning images produced by the James Webb Space Telescope and recent supercomputer-enabled black hole imaging efforts have brought the early days of the universe quite literally into sharp focus. Researchers from th Read more…

Google Program to Free Chips Boosts University Semiconductor Design

August 11, 2022

A Google-led program to design and manufacture chips for free is becoming popular among researchers and computer enthusiasts. The search giant's open silicon program is providing the tools for anyone to design chips, which then get manufactured. Google foots the entire bill, from a chip's conception to delivery of the final product in a user's hand. Google's... Read more…

Argonne Deploys Polaris Supercomputer for Science in Advance of Aurora

August 9, 2022

Argonne National Laboratory has made its newest supercomputer, Polaris, available for scientific research. The system, which ranked 14th on the most recent Top500 list, is serving as a testbed for the exascale Aurora system slated for delivery in the coming months. The HPE-built Polaris system (pictured in the header) consists of 560 nodes... Read more…

AWS Solution Channel

Shutterstock 1519171757

Running large-scale CFD fire simulations on AWS for Amazon.com

This post was contributed by Matt Broadfoot, Senior Fire Strategy Manager at Amazon Design and Construction, and Antonio Cennamo ProServe Customer Practice Manager, Colin Bridger Principal HPC GTM Specialist, Grigorios Pikoulas ProServe Strategic Program Leader, Neil Ashton Principal, Computational Engineering Product Strategy, Roberto Medar, ProServe HPC Consultant, Taiwo Abioye ProServe Security Consultant, Talib Mahouari ProServe Engagement Manager at AWS. Read more…

Microsoft/NVIDIA Solution Channel

Shutterstock 1689646429

Gain a Competitive Edge using Cloud-Based, GPU-Accelerated AI KYC Recommender Systems

Financial services organizations face increased competition for customers from technologies such as FinTechs, mobile banking applications, and online payment systems. To meet this challenge, it is important for organizations to have a deep understanding of their customers. Read more…

US CHIPS and Science Act Signed Into Law

August 9, 2022

Just a few days after it was passed in the Senate, the U.S. CHIPS and Science Act has been signed into law by President Biden. In a ceremony today, Biden signed and lauded the ambitious piece of legislation, which over the course of the legislative process broadened to include hundreds of billions in additional science and technology spending. He was flanked by Speaker... Read more…

Q&A with ORNL’s Bronson Messer, an HPCwire Person to Watch in 2022

August 12, 2022

HPCwire presents our interview with Bronson Messer, distinguished scientist and director of Science at the Oak Ridge Leadership Computing Facility (OLCF), ORNL, and an HPCwire 2022 Person to Watch. Messer recaps ORNL's journey to exascale and sheds light on how all the pieces line up to support the all-important science. Also covered are the role... Read more…

Google Program to Free Chips Boosts University Semiconductor Design

August 11, 2022

A Google-led program to design and manufacture chips for free is becoming popular among researchers and computer enthusiasts. The search giant's open silicon program is providing the tools for anyone to design chips, which then get manufactured. Google foots the entire bill, from a chip's conception to delivery of the final product in a user's hand. Google's... Read more…

Argonne Deploys Polaris Supercomputer for Science in Advance of Aurora

August 9, 2022

Argonne National Laboratory has made its newest supercomputer, Polaris, available for scientific research. The system, which ranked 14th on the most recent Top500 list, is serving as a testbed for the exascale Aurora system slated for delivery in the coming months. The HPE-built Polaris system (pictured in the header) consists of 560 nodes... Read more…

US CHIPS and Science Act Signed Into Law

August 9, 2022

Just a few days after it was passed in the Senate, the U.S. CHIPS and Science Act has been signed into law by President Biden. In a ceremony today, Biden signed and lauded the ambitious piece of legislation, which over the course of the legislative process broadened to include hundreds of billions in additional science and technology spending. He was flanked by Speaker... Read more…

12 Midwestern Universities Team to Boost Semiconductor Supply Chain

August 8, 2022

The combined stressors of Covid-19 and the invasion of Ukraine have sent every major nation scrambling to reinforce its mission-critical supply chains – including and in particular the semiconductor supply chain. In the U.S. – which, like much of the world, relies on Asia for its semiconductors – those efforts have taken shape through the recently... Read more…

Quantum Pioneer D-Wave Rings NYSE Bell, Begins Life as Public Company

August 8, 2022

D-Wave Systems, one of the early quantum computing pioneers, has completed its SPAC deal to go public. Its merger with DPCM Capital was completed last Friday, and today, D-Wave management rang the bell on the New York Stock Exchange. It is now trading under two ticker symbols – QBTS and QBTS WS (warrant shares), respectively. Welcome to the public... Read more…

Supercomputer Models Explosives Critical for Nuclear Weapons

August 6, 2022

Lawrence Livermore National Laboratory (LLNL) is one of the laboratories that operates under the auspices of the National Nuclear Security Administration (NNSA), which manages the United States’ stockpile of nuclear weapons. Amid major efforts to modernize that stockpile, LLNL has announced that researchers from its own Energetic Materials Center... Read more…

SEA Changes: How EuroHPC Is Preparing for Exascale

August 5, 2022

Back in June, the EuroHPC Joint Undertaking – which serves as the EU’s concerted supercomputing play – announced its first exascale system: JUPITER, set to be installed by the Jülich Supercomputing Centre (FZJ) in 2023. But EuroHPC has been preparing for the exascale era for a much longer time: eight months... Read more…

Nvidia R&D Chief on How AI is Improving Chip Design

April 18, 2022

Getting a glimpse into Nvidia’s R&D has become a regular feature of the spring GTC conference with Bill Dally, chief scientist and senior vice president of research, providing an overview of Nvidia’s R&D organization and a few details on current priorities. This year, Dally focused mostly on AI tools that Nvidia is both developing and using in-house to improve... Read more…

Royalty-free stock illustration ID: 1919750255

Intel Says UCIe to Outpace PCIe in Speed Race

May 11, 2022

Intel has shared more details on a new interconnect that is the foundation of the company’s long-term plan for x86, Arm and RISC-V architectures to co-exist in a single chip package. The semiconductor company is taking a modular approach to chip design with the option for customers to cram computing blocks such as CPUs, GPUs and AI accelerators inside a single chip package. Read more…

The Final Frontier: US Has Its First Exascale Supercomputer

May 30, 2022

In April 2018, the U.S. Department of Energy announced plans to procure a trio of exascale supercomputers at a total cost of up to $1.8 billion dollars. Over the ensuing four years, many announcements were made, many deadlines were missed, and a pandemic threw the world into disarray. Now, at long last, HPE and Oak Ridge National Laboratory (ORNL) have announced that the first of those... Read more…

US Senate Passes CHIPS Act Temperature Check, but Challenges Linger

July 19, 2022

The U.S. Senate on Tuesday passed a major hurdle that will open up close to $52 billion in grants for the semiconductor industry to boost manufacturing, supply chain and research and development. U.S. senators voted 64-34 in favor of advancing the CHIPS Act, which sets the stage for the final consideration... Read more…

Top500: Exascale Is Officially Here with Debut of Frontier

May 30, 2022

The 59th installment of the Top500 list, issued today from ISC 2022 in Hamburg, Germany, officially marks a new era in supercomputing with the debut of the first-ever exascale system on the list. Frontier, deployed at the Department of Energy’s Oak Ridge National Laboratory, achieved 1.102 exaflops in its fastest High Performance Linpack run, which was completed... Read more…

Newly-Observed Higgs Mode Holds Promise in Quantum Computing

June 8, 2022

The first-ever appearance of a previously undetectable quantum excitation known as the axial Higgs mode – exciting in its own right – also holds promise for developing and manipulating higher temperature quantum materials... Read more…

AMD’s MI300 APUs to Power Exascale El Capitan Supercomputer

June 21, 2022

Additional details of the architecture of the exascale El Capitan supercomputer were disclosed today by Lawrence Livermore National Laboratory’s (LLNL) Terri Read more…

PsiQuantum’s Path to 1 Million Qubits

April 21, 2022

PsiQuantum, founded in 2016 by four researchers with roots at Bristol University, Stanford University, and York University, is one of a few quantum computing startups that’s kept a moderately low PR profile. (That’s if you disregard the roughly $700 million in funding it has attracted.) The main reason is PsiQuantum has eschewed the clamorous public chase for... Read more…

Leading Solution Providers

Contributors

ISC 2022 Booth Video Tours

AMD
AWS
DDN
Dell
Intel
Lenovo
Microsoft
PENGUIN SOLUTIONS

Exclusive Inside Look at First US Exascale Supercomputer

July 1, 2022

HPCwire takes you inside the Frontier datacenter at DOE's Oak Ridge National Laboratory (ORNL) in Oak Ridge, Tenn., for an interview with Frontier Project Direc Read more…

AMD Opens Up Chip Design to the Outside for Custom Future

June 15, 2022

AMD is getting personal with chips as it sets sail to make products more to the liking of its customers. The chipmaker detailed a modular chip future in which customers can mix and match non-AMD processors in a custom chip package. "We are focused on making it easier to implement chips with more flexibility," said Mark Papermaster, chief technology officer at AMD during the analyst day meeting late last week. Read more…

Intel Reiterates Plans to Merge CPU, GPU High-performance Chip Roadmaps

May 31, 2022

Intel reiterated it is well on its way to merging its roadmap of high-performance CPUs and GPUs as it shifts over to newer manufacturing processes and packaging technologies in the coming years. The company is merging the CPU and GPU lineups into a chip (codenamed Falcon Shores) which Intel has dubbed an XPU. Falcon Shores... Read more…

Nvidia, Intel to Power Atos-Built MareNostrum 5 Supercomputer

June 16, 2022

The long-troubled, hotly anticipated MareNostrum 5 supercomputer finally has a vendor: Atos, which will be supplying a system that includes both Nvidia and Inte Read more…

India Launches Petascale ‘PARAM Ganga’ Supercomputer

March 8, 2022

Just a couple of weeks ago, the Indian government promised that it had five HPC systems in the final stages of installation and would launch nine new supercomputers this year. Now, it appears to be making good on that promise: the country’s National Supercomputing Mission (NSM) has announced the deployment of “PARAM Ganga” petascale supercomputer at Indian Institute of Technology (IIT)... Read more…

Is Time Running Out for Compromise on America COMPETES/USICA Act?

June 22, 2022

You may recall that efforts proposed in 2020 to remake the National Science Foundation (Endless Frontier Act) have since expanded and morphed into two gigantic bills, the America COMPETES Act in the U.S. House of Representatives and the U.S. Innovation and Competition Act in the U.S. Senate. So far, efforts to reconcile the two pieces of legislation have snagged and recent reports... Read more…

AMD Lines Up Alternate Chips as It Eyes a ‘Post-exaflops’ Future

June 10, 2022

Close to a decade ago, AMD was in turmoil. The company was playing second fiddle to Intel in PCs and datacenters, and its road to profitability hinged mostly on Read more…

Exascale Watch: Aurora Installation Underway, Now Open for Reservations

May 10, 2022

Installation has begun on the Aurora supercomputer, Rick Stevens (associate director of Argonne National Laboratory) revealed today during the Intel Vision event keynote taking place in Dallas, Texas, and online. Joining Intel exec Raja Koduri on stage, Stevens confirmed that the Aurora build is underway – a major development for a system that is projected to deliver more... Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire