Ansys Fluent® Adds AMD Instinct™ MI200 and MI300 Acceleration to Power CFD Simulations

By Martin Huarte, Developer Relations Manager, AMD and Wim Slagter, Director of Partner Programs, Ansys

September 23, 2024

Ansys Fluent® is well-known in the commercial computational fluid dynamics (CFD) space and is praised for its versatility as a general-purpose solver. Its impressive breadth of physics capabilities makes finding a problem that Fluent cannot solve seemingly impossible. Fluent can be found in use across numerous industry segments globally, particularly in the automotive and aerospace sectors, as well as in energy, materials and chemical processing, and high-tech industries.

Ansys recently integrated support for AMD Instinct™ MI200 and MI300 accelerators into Fluent, its Flagship CFD Solver, significantly enhancing simulation efficiency and power. With the Fluent GPU Solver, simulations that once took weeks or months can now be completed in hours or days.

Director Jeremy McCaslin of the Ansys Fluids Product Management team adds, “Simulation is not a commodity anymore; it’s a necessity. Fluent customers need to innovate designs faster, bring products to market more quickly, and do all of this more sustainably.” He emphasizes the energy efficiency of GPUs, noting, “Large compute centers consume substantial energy, and using AMD Instinct GPUs can achieve similar computational tasks with significantly less energy.”

As such, the release of Ansys Fluent 2024 R2 is a significant milestone. “This is our first version to have full AMD GPU support, enabling our users to run AMD Instinct GPUs out of the box, on MI200 and the MI300 series,” says Ansys Distinguished Engineer Rongguang Jia.

McCaslin describes Ansys’s long-term strategy with AMD GPU support, “We want to provide something that has all of your bases covered.” By fully supporting AMD Instinct GPUs in the new release and strategically prioritizing feature development, Fluent software aims to deliver a powerful, comprehensive, and efficient solution. “Ensuring that GPU-supported features perform on par with or exceed customer expectations is crucial,” McCaslin notes.
Travis Karr, AMD Corporate Vice President for HPC, adds, “We are pleased to closely partner with Ansys to integrate the industry’s leading Fluent software into AMD’s high-performance Instinct accelerators. Fluent users can now run AMD Instinct GPUs out of the box, increasing CFD efficiency and lowering energy consumption”.

AMD and Ansys share a long history of collaboration

The collaboration between AMD and Ansys is rooted in a shared commitment to an open ecosystem. Ansys excels in simulation technology but relies on partners like AMD for hardware expertise. “AMD is a strategic partner within our overall philosophy of providing software solutions to our customers,” says McCaslin. “We believe offering options like AMD Instinct GPUs is good for the overall pace of innovation in the market.”

This partnership has long roots, with AMD contributing to Fluent’s parallel performance and CPU core scaling capabilities for decades. Jia recalls, “When I started working at Ansys 20 years ago, AMD had just come out with the x86 64-bit, which transformed the industry.” He continues, “Today, AMD Instinct GPUs power Frontier, the world’s fastest supercomputer, offering substantial savings and contributing to corporate sustainability goals.”

Supporting simulations that get larger and more complex by the day

Of course, the primary objective for many customers is the ability to perform simulations much faster and handle more complex tasks. “A high-fidelity simulation that once took weeks to complete can now be finished in just a day or two with heterogeneous (CPU+GPU) computing. Simulations that used to require several days are now done in the time it takes to have lunch, allowing for faster iterations and greater innovation,” McCaslin shares. He notes that while some customers are excited to run faster simulations of existing models, others are excited to tackle larger and more detailed simulations. “Just a few years ago, running a billion cells felt really big. A billion cells seem smaller every day,” he says.

To that end, integrating AMD Instinct GPUs with Fluent software offers significant advancements. McCaslin explains, “Fluent’s coupled solver is very much the workhorse for customers doing steady state problems because it significantly improves the robustness and time to convergence. The high memory bandwidth and capacity of the AMD Instinct MI300X makes it a good choice for applications requiring steady-state analysis.” He adds, “But, in general, all the models are well suited for AMD Instinct GPUs.”
Karr agrees, “Ansys has effectively harnessed AMD Instinct GPUs’ industry leading memory capacity and bandwidth to significantly accelerate complex simulations”.
A standout feature of the AMD Instinct MI300A APU, is its shared GPU and CPU memory, which can allow logically complex models to benefit from continuing to run on AMD EPYC™ CPUs cores, while more computationally demanding tasks are handled by the GPU cores, without extensive data transfers. McCaslin says, “Having shared memory between CPU and GPU opens up a lot of interesting opportunities to get things onto the GPU where the compute will be faster.” The ability to perform tasks on the CPU without data movement issues can accelerate the delivery of hybrid CPU-GPU solutions to customers more quickly.

McCaslin emphasizes the practical benefits of such developments, noting that GPUs could efficiently utilize large meshes created and iteratively modified by CPUs without significant data movement. “We don’t want to be moving large amounts of data around just so that the GPUs can access the mesh that the CPUs created,” he explains.

“We have customers with decades of established CPU CFD usage built around workflows developed over many years,” says McCaslin. “If we can show a customer that’s been running legacy workflows on CPU clusters for 15 years, ‘Hey, look, I can get you that same outcome, but 10x faster, sometimes even more,’ they’ll make the jump,” referring to adopting AMD Instinct GPUs and heterogenous workloads.

Porting for performance

Reflecting on porting components to HIP to support AMD Instinct GPUs in Ansys, Jia notes, “The AMD ROCm™ platform is progressing rapidly, and the HIP APIs used by our GPU Solver are comparable to CUDA APIs,” adding, “despite differences, finding corresponding APIs are straightforward.” Jia adds that the HIP compilers, built on Clang, have been “friendly to use.”

Jia shared some impressive benchmarking results, saying, “For large or high-fidelity simulations, such as analyzing the impact of turbulence on the aerodynamic design of Formula 1 cars for example, we observed that one MI300X GPU is able to produce competitive performance for the coupled solver.”

Figure 1 – HPC Competitive CFD Performance (AMD MI300X vs NVIDIA H100)
Click to enlarge
Figure 2 – Formula 1 Race Car CFD Workload
Click to enlarge

Utilizing AI alongside Ansys Fluent with AMD Instinct GPUs

Ansys is leveraging AI capabilities alongside its core simulation tools. For example, Ansys SimAI is a new machine learning platform that enables engineers to leverage previous simulation data to assess the performance of new designs within a given design envelope—in minutes instead of hours. GPUs are crucial in enabling rapid execution of both high-fidelity simulations that define the design envelope, as well as AI-driven tasks that rapidly explore new designs within it. McCaslin emphasizes, “GPUs enable AI predictions to happen in seconds or minutes. But to explore entirely new design spaces, you need high-fidelity simulations to define the bounds of the design envelope—and running these simulations quickly on GPUs reduces the bottleneck for the overall design process.” Rapid heterogeneous simulations facilitated by AMD Instinct GPUs make the nearly instantaneous AI results more impactful, ensuring an efficient overall workflow.

Moving forward starts with preserving the familiar

Ansys designed Fluent 2024 R2 to help users transition smoothly to GPU-enhanced workflows. Jia explains, “We make the transition easier by keeping the running environments exactly the same for our traditional CPU solver and the GPU solver.”

McCaslin adds, “Selecting the GPU Solver is literally a checkbox in the Fluent launcher. Everything else is the same,” he explains. For batch runs, users only need to add a “-gpu” flag to their command line. This accessibility ensures that users can leverage AMD Instinct GPU acceleration without learning new tools or significantly altering their workflows.

The benefits of GPU acceleration speak for themselves

The combination of enhanced performance and cost efficiency is a powerful driver for GPU adoption. As McCaslin says, “Performance, cost efficiency… it’s a no-brainer.” As more customers experience the benefits of GPU acceleration, he expects the shift towards GPU-enhanced workflows to continue accelerating.

Highlighting the overwhelmingly positive response, McCaslin says, “The user feedback around Fluent software running on CPUs AND GPUs is like, ‘Wow, this is impressive. We can do the same workloads, but we can do them so much faster, mind-blowingly faster.'” Such an enthusiastic reception has only reinforced Ansys’ commitment to prioritize GPU development for the Fluids product family.
Karr adds, “AMD is very excited by these results as well, providing the community access to the best technologies available. We look forward to continuing our strong partnership with Ansys, delivering best-in-class solutions to our users.”

Find recommendations on installing Ansys Fluent on AMD Instinct™ systems at AMD’s Infinity Hub Github

For details go to AMD Community Site

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industry updates delivered to you every week!

AMD Announces Flurry of New Chips

October 10, 2024

AMD today announced several new chips including its newest Instinct GPU — the MI325X — as it chases Nvidia. Other new devices announced at the company event in San Francisco included the 5th Gen AMD EPYC processors, Read more…

NSF Grants $107,600 to English Professors to Research Aurora Supercomputer

October 9, 2024

The National Science Foundation has granted $107,600 to English professors at US universities to unearth the mysteries of the Aurora supercomputer. The two-year grant recipients will write up what the Aurora supercompute Read more…

VAST Looks Inward, Outward for An AI Edge

October 9, 2024

There’s no single best way to respond to the explosion of data and AI. Sometimes you need to bring everything into your own unified platform. Other times, you lean on friends and neighbors to chart a way forward. Those Read more…

Google Reports Progress on Quantum Devices beyond Supercomputer Capability

October 9, 2024

A Google-led team of researchers has presented more evidence that it’s possible to run productive circuits on today’s near-term intermediate scale quantum devices that are beyond the reach of classical computing. � Read more…

At 50, Foxconn Celebrates Graduation from Connectors to AI Supercomputing

October 8, 2024

Foxconn is celebrating its 50th birthday this year. It started by making connectors, then moved to systems, and now, a supercomputer. The company announced it would build the supercomputer with Nvidia's Blackwell GPUs an Read more…

ZLUDA Takes Third Wack as a CUDA Emulator

October 7, 2024

The ZLUDA CUDA emulator is back in its third invocation. At one point, the project was quietly funded by AMD and demonstrated the ability to run unmodified CUDA applications with near-native performance on AMD GPUs. Cons Read more…

NSF Grants $107,600 to English Professors to Research Aurora Supercomputer

October 9, 2024

The National Science Foundation has granted $107,600 to English professors at US universities to unearth the mysteries of the Aurora supercomputer. The two-year Read more…

VAST Looks Inward, Outward for An AI Edge

October 9, 2024

There’s no single best way to respond to the explosion of data and AI. Sometimes you need to bring everything into your own unified platform. Other times, you Read more…

Google Reports Progress on Quantum Devices beyond Supercomputer Capability

October 9, 2024

A Google-led team of researchers has presented more evidence that it’s possible to run productive circuits on today’s near-term intermediate scale quantum d Read more…

At 50, Foxconn Celebrates Graduation from Connectors to AI Supercomputing

October 8, 2024

Foxconn is celebrating its 50th birthday this year. It started by making connectors, then moved to systems, and now, a supercomputer. The company announced it w Read more…

The New MLPerf Storage Benchmark Runs Without ML Accelerators

October 3, 2024

MLCommons is known for its independent Machine Learning (ML) benchmarks. These benchmarks have focused on mathematical ML operations and accelerators (e.g., Nvi Read more…

DataPelago Unveils Universal Engine to Unite Big Data, Advanced Analytics, HPC, and AI Workloads

October 3, 2024

DataPelago this week emerged from stealth with a new virtualization layer that it says will allow users to move AI, data analytics, and ETL workloads to whateve Read more…

Stayin’ Alive: Intel’s Falcon Shores GPU Will Survive Restructuring

October 2, 2024

Intel's upcoming Falcon Shores GPU will survive the brutal cost-cutting measures as part of its "next phase of transformation." An Intel spokeswoman confirmed t Read more…

How GenAI Will Impact Jobs In the Real World

September 30, 2024

There’s been a lot of fear, uncertainty, and doubt (FUD) about the potential for generative AI to take people’s jobs. The capability of large language model Read more…

Shutterstock_2176157037

Intel’s Falcon Shores Future Looks Bleak as It Concedes AI Training to GPU Rivals

September 17, 2024

Intel's Falcon Shores future looks bleak as it concedes AI training to GPU rivals On Monday, Intel sent a letter to employees detailing its comeback plan after Read more…

Granite Rapids HPC Benchmarks: I’m Thinking Intel Is Back (Updated)

September 25, 2024

Waiting is the hardest part. In the fall of 2023, HPCwire wrote about the new diverging Xeon processor strategy from Intel. Instead of a on-size-fits all approa Read more…

Ansys Fluent® Adds AMD Instinct™ MI200 and MI300 Acceleration to Power CFD Simulations

September 23, 2024

Ansys Fluent® is well-known in the commercial computational fluid dynamics (CFD) space and is praised for its versatility as a general-purpose solver. Its impr Read more…

AMD Clears Up Messy GPU Roadmap, Upgrades Chips Annually

June 3, 2024

In the world of AI, there's a desperate search for an alternative to Nvidia's GPUs, and AMD is stepping up to the plate. AMD detailed its updated GPU roadmap, w Read more…

Nvidia Shipped 3.76 Million Data-center GPUs in 2023, According to Study

June 10, 2024

Nvidia had an explosive 2023 in data-center GPU shipments, which totaled roughly 3.76 million units, according to a study conducted by semiconductor analyst fir Read more…

Shutterstock_1687123447

Nvidia Economics: Make $5-$7 for Every $1 Spent on GPUs

June 30, 2024

Nvidia is saying that companies could make $5 to $7 for every $1 invested in GPUs over a four-year period. Customers are investing billions in new Nvidia hardwa Read more…

Shutterstock 1024337068

Researchers Benchmark Nvidia’s GH200 Supercomputing Chips

September 4, 2024

Nvidia is putting its GH200 chips in European supercomputers, and researchers are getting their hands on those systems and releasing research papers with perfor Read more…

Comparing NVIDIA A100 and NVIDIA L40S: Which GPU is Ideal for AI and Graphics-Intensive Workloads?

October 30, 2023

With long lead times for the NVIDIA H100 and A100 GPUs, many organizations are looking at the new NVIDIA L40S GPU, which it’s a new GPU optimized for AI and g Read more…

Leading Solution Providers

Contributors

IBM Develops New Quantum Benchmarking Tool — Benchpress

September 26, 2024

Benchmarking is an important topic in quantum computing. There’s consensus it’s needed but opinions vary widely on how to go about it. Last week, IBM introd Read more…

Intel Customizing Granite Rapids Server Chips for Nvidia GPUs

September 25, 2024

Intel is now customizing its latest Xeon 6 server chips for use with Nvidia's GPUs that dominate the AI landscape. The chipmaker's new Xeon 6 chips, also called Read more…

Quantum and AI: Navigating the Resource Challenge

September 18, 2024

Rapid advancements in quantum computing are bringing a new era of technological possibilities. However, as quantum technology progresses, there are growing conc Read more…

Google’s DataGemma Tackles AI Hallucination

September 18, 2024

The rapid evolution of large language models (LLMs) has fueled significant advancement in AI, enabling these systems to analyze text, generate summaries, sugges Read more…

IonQ Plots Path to Commercial (Quantum) Advantage

July 2, 2024

IonQ, the trapped ion quantum computing specialist, delivered a progress report last week firming up 2024/25 product goals and reviewing its technology roadmap. Read more…

Microsoft, Quantinuum Use Hybrid Workflow to Simulate Catalyst

September 13, 2024

Microsoft and Quantinuum reported the ability to create 12 logical qubits on Quantinuum's H2 trapped ion system this week and also reported using two logical qu Read more…

US Implements Controls on Quantum Computing and other Technologies

September 27, 2024

Yesterday the Commerce Department announced export controls on quantum computing technologies as well as new controls for advanced semiconductors and additive Read more…

Everyone Except Nvidia Forms Ultra Accelerator Link (UALink) Consortium

May 30, 2024

Consider the GPU. An island of SIMD greatness that makes light work of matrix math. Originally designed to rapidly paint dots on a computer monitor, it was then Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire