Esperanto, Silicon in Hand, Champions the Efficiency of Its 1,092-Core RISC-V Chip

By Oliver Peckham

August 27, 2021

Esperanto Technologies made waves last December when it announced ET-SoC-1, a new RISC-V-based chip aimed at machine learning that packed nearly 1,100 cores onto a package small enough to fit six times over on a single PCIe card. Now, Esperanto is back, silicon in-hand and taking aim at the competition with bold estimates of ET-SoC-1’s energy efficiency.

ET-SoC-1

First, a refresher: ET-SoC-1 can operate either as a standalone processor or as a PCIe-driven accelerator. At Hot Chips 33 this week, Esperanto Technologies founder and executive chairman Dave Ditzel called it “the world’s highest-performance commercial RISC-V chip.”

“On one 7nm chip,” he said, “we put 1,088 energy-efficient ET-Minion RISC-V processors, each with its own vector/tensor unit; four high-performance ET-Maxion RISC-V processors; over 160 million bytes of on-chip SRAM; and interfaces for external DRAM and flash memory.”

The chip is aimed at machine learning recommendation workloads, which Ditzel said were among the most important workloads for datacenters due to their increasing popularity coupled with their demanding performance memory requirements. Most of these datacenters, Ditzel explained, were running x86 servers with open PCIe slots, leaving Esperanto an opening to enter existing datacenters through a high-performing PCIe card.

Performance maximization through energy efficiency

Operating with existing datacenters meant adhering to a set of requirements. Perhaps most importantly, the PCIe slots have a limited power budget – around 75 to 120 watts. “Esperanto’s challenge was how to put the highest recommendation performance onto a single PCIe-based accelerator card using no more than six of our chips and no more than 120 watts,” Ditzel said. This left Esperanto at a fork in the road.

“Some of the other solutions use one giant hot chip that uses up the entire power budget of the accelerator card,” Ditzel said. “Single-chip solutions often push for the highest operating frequencies, but this comes with very high power and it’s not very energy efficient. Esperanto realized that transistors – particularly 7nm FinFETs – are much more energy-efficient when operated at low voltages.”

Ditzel illustrated this point with a graph showing the energy efficiency of ET-SoC-1 (described in terms of inferences per second per watt) across different operating voltages.

Image courtesy of Dave Ditzel/Esperanto Technologies.

At the highest operating frequency (0.9 volts), Ditzel said, just one of their chips could use 275 watts – well beyond the power budget of a PCIe slot. At 0.67 volts, they could come in just under the power budget.

But if, instead, they followed the peak of the energy efficiency graph, the entire chip would consume just 8.5 watts and the six chips would fit in a 120-watt budget with “room to spare.” With six chips, the performance would be two and a half times better than the one-chip solution, and the energy efficiency twenty times better than the 275-watt scenario.

Of course, they’d want to use up the entire power budget – and operating at about 0.4 volts, Ditzel said, one chip would take about 20 watts. Using six chips at that voltage yielded four times better performance than the one-chip solution.

An example of how three ET-SoC-1 chips could fit on each side of a Glacier Point v2 PCIe card. Image courtesy of Dave Ditzel/Esperanto Technologies.

Squaring up to the competition

Of course, Esperanto is facing down an increasingly crowded field of accelerators. So: after all this work on efficiency, how does ET-SoC-1 actually stack up?

For that, Esperanto turned to the MLPerf deep learning recommendation model benchmarks for the Intel Xeon Platinum 8380H 8S (an example of a server chip) and two Nvidia GPUs, the A10 and the T4. With respect to the ET-SoC-1, Ditzel said that “all the performance [metrics] for Esperanto shown so far are projections based upon gate-level simulations of the entire chip and a large Synopsys ZeBu hardware emulation system.”

Using the Xeon chip as the baseline, Esperanto presented comparisons of the relative performance and relative performance per watt of the four processors. The T4 operated at 11 times the performance and 39 times the performance per watt of the Xeon chip; the A10, at 31 times and 52 times, respectively; the ET-SoC-1, 59 times and 123 times.

Image courtesy of Dave Ditzel/Esperanto Technologies.

Using the ResNet-50 inference benchmark, the processors showed similar results, with the ET-SoC-1 again arriving in first place with 15.4 times the performance and 25.7 times the performance per watt of the Xeon chip.

Unsurprisingly, Esperanto is particularly interested in its projected performance-per-watt dominance.

“Our position is that performance-per-watt is actually a better metric of performance, since everyone ought to be measured at similar power usage,” Ditzel said.

The ET-SoC-1 silicon. Image courtesy of Dave Ditzel/Esperanto Technologies.

The future

Esperanto has big ambitions: in his presentation, Ditzel talked about how, using open-source Glacier Point designs, a single large datacenter could contain millions of ET-SoC-1 chips. For now, though, the Mountain View-based company is still laying its foundation. “At the time of this recording, we’ve recently received silicon in our labs and are in the process of bringing up chip testing,” Ditzel said, showing off the first 7nm ET-SoC-1 chip mounted in the package.

To learn more, read our previous coverage of Esperanto Technologies here.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industry updates delivered to you every week!

Quantum Companies D-Wave and Rigetti Again Face Stock Delisting

October 4, 2024

Both D-Wave (NYSE: QBTS) and Rigetti (Nasdaq: RGTI) are again facing stock delisting. This is a third time for D-Wave, which issued a press release today following notification by the SEC. Rigetti was notified of delisti Read more…

Alps Scientific Symposium Highlights AI’s Role in Tackling Science’s Biggest Challenges

October 4, 2024

ETH Zürich recently celebrated the launch of the AI-optimized “Alps” supercomputer with a scientific symposium focused on the future possibilities of scientific AI thanks to increased compute power and a flexible ar Read more…

The New MLPerf Storage Benchmark Runs Without ML Accelerators

October 3, 2024

MLCommons is known for its independent Machine Learning (ML) benchmarks. These benchmarks have focused on mathematical ML operations and accelerators (e.g., Nvidia GPUs). Recently, MLCommons introduced the results of its Read more…

DataPelago Unveils Universal Engine to Unite Big Data, Advanced Analytics, HPC, and AI Workloads

October 3, 2024

DataPelago today emerged from stealth with a new virtualization layer that it says will allow users to move AI, data analytics, and ETL workloads to whatever physical processor they want, without making code changes, the Read more…

IBM Quantum Summit Evolves into Developer Conference

October 2, 2024

Instead of its usual quantum summit this year, IBM will hold its first IBM Quantum Developer Conference which the company is calling, “an exclusive, first-of-its-kind.” It’s planned as an in-person conference at th Read more…

Stayin’ Alive: Intel’s Falcon Shores GPU Will Survive Restructuring

October 2, 2024

Intel's upcoming Falcon Shores GPU will survive the brutal cost-cutting measures as part of its "next phase of transformation." An Intel spokeswoman confirmed that the company will release Falcon Shores as a GPU. The com Read more…

The New MLPerf Storage Benchmark Runs Without ML Accelerators

October 3, 2024

MLCommons is known for its independent Machine Learning (ML) benchmarks. These benchmarks have focused on mathematical ML operations and accelerators (e.g., Nvi Read more…

DataPelago Unveils Universal Engine to Unite Big Data, Advanced Analytics, HPC, and AI Workloads

October 3, 2024

DataPelago today emerged from stealth with a new virtualization layer that it says will allow users to move AI, data analytics, and ETL workloads to whatever ph Read more…

Stayin’ Alive: Intel’s Falcon Shores GPU Will Survive Restructuring

October 2, 2024

Intel's upcoming Falcon Shores GPU will survive the brutal cost-cutting measures as part of its "next phase of transformation." An Intel spokeswoman confirmed t Read more…

How GenAI Will Impact Jobs In the Real World

September 30, 2024

There’s been a lot of fear, uncertainty, and doubt (FUD) about the potential for generative AI to take people’s jobs. The capability of large language model Read more…

IBM and NASA Launch Open-Source AI Model for Advanced Climate and Weather Research

September 25, 2024

IBM and NASA have developed a new AI foundation model for a wide range of climate and weather applications, with contributions from the Department of Energy’s Read more…

Intel Customizing Granite Rapids Server Chips for Nvidia GPUs

September 25, 2024

Intel is now customizing its latest Xeon 6 server chips for use with Nvidia's GPUs that dominate the AI landscape. The chipmaker's new Xeon 6 chips, also called Read more…

Building the Quantum Economy — Chicago Style

September 24, 2024

Will there be regional winner in the global quantum economy sweepstakes? With visions of Silicon Valley’s iconic success in electronics and Boston/Cambridge� Read more…

How GPUs Are Embedded in the HPC Landscape

September 23, 2024

Grasping the basics of Graphics Processing Unit (GPU) architecture is crucial for understanding how these powerful processors function, particularly in high-per Read more…

Shutterstock_2176157037

Intel’s Falcon Shores Future Looks Bleak as It Concedes AI Training to GPU Rivals

September 17, 2024

Intel's Falcon Shores future looks bleak as it concedes AI training to GPU rivals On Monday, Intel sent a letter to employees detailing its comeback plan after Read more…

Nvidia Shipped 3.76 Million Data-center GPUs in 2023, According to Study

June 10, 2024

Nvidia had an explosive 2023 in data-center GPU shipments, which totaled roughly 3.76 million units, according to a study conducted by semiconductor analyst fir Read more…

Granite Rapids HPC Benchmarks: I’m Thinking Intel Is Back (Updated)

September 25, 2024

Waiting is the hardest part. In the fall of 2023, HPCwire wrote about the new diverging Xeon processor strategy from Intel. Instead of a on-size-fits all approa Read more…

AMD Clears Up Messy GPU Roadmap, Upgrades Chips Annually

June 3, 2024

In the world of AI, there's a desperate search for an alternative to Nvidia's GPUs, and AMD is stepping up to the plate. AMD detailed its updated GPU roadmap, w Read more…

Ansys Fluent® Adds AMD Instinct™ MI200 and MI300 Acceleration to Power CFD Simulations

September 23, 2024

Ansys Fluent® is well-known in the commercial computational fluid dynamics (CFD) space and is praised for its versatility as a general-purpose solver. Its impr Read more…

Shutterstock_1687123447

Nvidia Economics: Make $5-$7 for Every $1 Spent on GPUs

June 30, 2024

Nvidia is saying that companies could make $5 to $7 for every $1 invested in GPUs over a four-year period. Customers are investing billions in new Nvidia hardwa Read more…

Shutterstock 1024337068

Researchers Benchmark Nvidia’s GH200 Supercomputing Chips

September 4, 2024

Nvidia is putting its GH200 chips in European supercomputers, and researchers are getting their hands on those systems and releasing research papers with perfor Read more…

Comparing NVIDIA A100 and NVIDIA L40S: Which GPU is Ideal for AI and Graphics-Intensive Workloads?

October 30, 2023

With long lead times for the NVIDIA H100 and A100 GPUs, many organizations are looking at the new NVIDIA L40S GPU, which it’s a new GPU optimized for AI and g Read more…

Leading Solution Providers

Contributors

Everyone Except Nvidia Forms Ultra Accelerator Link (UALink) Consortium

May 30, 2024

Consider the GPU. An island of SIMD greatness that makes light work of matrix math. Originally designed to rapidly paint dots on a computer monitor, it was then Read more…

IBM Develops New Quantum Benchmarking Tool — Benchpress

September 26, 2024

Benchmarking is an important topic in quantum computing. There’s consensus it’s needed but opinions vary widely on how to go about it. Last week, IBM introd Read more…

Quantum and AI: Navigating the Resource Challenge

September 18, 2024

Rapid advancements in quantum computing are bringing a new era of technological possibilities. However, as quantum technology progresses, there are growing conc Read more…

Intel Customizing Granite Rapids Server Chips for Nvidia GPUs

September 25, 2024

Intel is now customizing its latest Xeon 6 server chips for use with Nvidia's GPUs that dominate the AI landscape. The chipmaker's new Xeon 6 chips, also called Read more…

Google’s DataGemma Tackles AI Hallucination

September 18, 2024

The rapid evolution of large language models (LLMs) has fueled significant advancement in AI, enabling these systems to analyze text, generate summaries, sugges Read more…

Microsoft, Quantinuum Use Hybrid Workflow to Simulate Catalyst

September 13, 2024

Microsoft and Quantinuum reported the ability to create 12 logical qubits on Quantinuum's H2 trapped ion system this week and also reported using two logical qu Read more…

IonQ Plots Path to Commercial (Quantum) Advantage

July 2, 2024

IonQ, the trapped ion quantum computing specialist, delivered a progress report last week firming up 2024/25 product goals and reviewing its technology roadmap. Read more…

US Implements Controls on Quantum Computing and other Technologies

September 27, 2024

Yesterday the Commerce Department announced export controls on quantum computing technologies as well as new controls for advanced semiconductors and additive Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire