New Thoughts on Leveraging Cloud for Advanced AI

May 25, 2023

Artificial intelligence (AI) is becoming critical to many operations within companies. As the use and sophistication of AI grow, there is a new focus on the infrastructure requirements to produce results fast and efficiently. Many companies find that firing up cloud instances is not enough. Instead, companies must take a more strategic view of their cloud adoption to have the IT foundation required to fully use state-of-the-art AI. Doing so can deliver significant results across a wide variety of industries.

Specifically, AI requires an infrastructure that can meet the constantly increasing demands for high-performance compute and specialized needs of AI applications and workloads such as natural language processing, machine learning, and deep learning. To that point, a suitable infrastructure to support advanced AI must easily scale up and out.

Cloud infrastructure purpose-built for advanced AI

The recent Harvard Business Review whitepaper Analytic Services Rethinking Cloud Strategies for Advance AI noted the benefits of such an AI-first infrastructure and quantified how companies in different industries benefit from its use. According to the white paper, advanced AI applications must be supported by a cutting-edge infrastructure that provides the performance, flexibility, and scalability that these applications demand. But not just any cloud will do.

The diversity of cloud offerings gives organizations many options for their AI needs. That is particularly the case with generative AI. So, the question has shifted from whether to use the cloud for AI applications to which cloud provider best aligns with a company’s strategic vision for AI. The selection will depend on the capabilities of the cloud vendor and the ecosystem of partners and vendors that is built around the vendor’s offerings.

These and other points were the subjects of a recent Harvard Business Review Analytic Services WebinarRethinking Cloud Strategies for Advanced AI. The webinar discussed cloud strategies to support advanced AI. (The webinar can be viewed on-demand here.) the speakers included IDC’s Ritu Jyoti and Nidhi Chappell, General Manager of Azure HPC for AI, SAP and Confidential Computing, at Microsoft. Their talk examined how advanced AI creates unprecedented growth opportunities, the problems companies face related to cloud and AI technologies, and how to choose the right cloud platform for your AI goals.

Let’s look at some examples from the leading companies in healthcare, automotive, fashion, and conservation featured in this Harvard Business Review Analytic Services whitepaper.

Innovative AI-led personalized cancer treatments

While radiology used to diagnose cancer has long embraced AI, Elekta, a Stockholm-based Swedish maker of precision radiation therapy solutions, focused on a related but more involved area: radiotherapy, which is used to treat cancer. Elekta found that many people worldwide do not have access to the needed personalized therapy, not because of a lack of technology but because of insufficient medical personnel from diverse disciplines that must collaborate to ensure the correct adjustments are made to treatment plans.

“We realized the tsunami of AI innovations that were happening in the computer vision and text recognition fields were eventually going to find their way into the medical field, as well,” said Rui Lopes, Director of New Technology Assessment at Elekta.

To address the problem, it embeds intelligence into devices to increase access to treatment for a larger swath of patients worldwide. “This provides not just personalization of care but democratization of a standard of care, allowing more advanced protocols to be deployed in regions of the world that lack the human capital to do so now,” said Lopes.

The models Elekta uses must easily scale. “You need to radically scale up the amount of data you use,” said Adam Moore, Director of Global Cloud Solutions for Elekta. “By training the models in the cloud, you can identify those problems earlier and build resilience into your compute infrastructure, so you avoid hardware failures.”

“We rely heavily on Azure cloud infrastructure. With Azure, we can create virtual machines on the fly with specific GPUs. If that’s not enough, we can cancel that virtual machine, create a new one, and then scale up as the project demands,” says Silvain Beriault, Lead Research Scientist at Elekta.

Developing a new generation of autonomous vehicles

Wayve, a London startup, is trying to bring deep learning and AI to the next generation of autonomous driving, something it calls AV2.0 (autonomous vehicles 2.0). In particular, the company wants to accelerate and scale autonomous vehicle development by using vision-based machine learning for rapid prototyping and quick iteration.

“Advanced AI, the latest and greatest, is absolutely pivotal to what we’re doing,” says Jamie Shotton, Chief Scientist at Wayve. “We have to train the algorithm on petabytes and potentially greater amounts of data that we’ve captured from our fleet of cars, which is a radically different approach to autonomous self-driving than anyone has done before.”

Moving to Azure infrastructure allows Wayve to rapidly improve its iteration speed and innovation rate for new autonomy features, which, in turn, helps cars drive better. Through its use of Azure Machine Learning, the company trains its AV2.0 models 90 percent faster compared to its previous data center environment.

“Using a managed platform gives us the ability to scale quickly and reliably. It also allows us to focus our efforts doing the research and solving problems around autonomous self-driving rather than building additional tools ourselves,” Shotton says.

Creating new fashions at the speed of the market

Fashion is one of the fastest-growing, most lucrative, and demanding industries, with high expectations of quick turnaround rates, creative designs, and a constant parade of new styles. As such, Portugal-based Fashable is trying to change the fashion industry with AI.

“In the near future, you will have a digital closet of clothing designs that you can ask a manufacturer to produce just for you,” says Orlando Ribas Fernandes, CEO and Co-founder of Fashable, ““We will use the metaverse to create physical goods that are exclusive to each person.”

Using Azure AI infrastructure, powered by NVIDIA GPUs for deep learning, Fashable built a generative AI application that can create dozens of original AI-generated clothing designs in minutes without the need for actual material. The algorithm ingests data from multiple sources to learn about trends, styles, and clothing types. Using social media to do A/B tests directly with customers lets designers gauge interest and forecast demand for their particular creations before going into production.

“We can share the collection with customers before they are produced, avoiding the problem of overstock,” says Orlando Ribas Fernandes, CEO and Co-founder of Fashable.

Protecting endangered species from wildlife crime

Wildlife Protection Solutions (WPS) use artificial intelligence on remote camera images for the conservation of endangered species and ecosystems. Its work helps recognize threats, classify species, and aids in anti-poaching to prevent human-wildlife conflict.

“Conservation is a huge challenge globally, and we’re not necessarily winning the war,” says Eric Schmidt, Executive Director of the Organization. To improve its odds in the fight, WPS is arming itself with AI models that search images from thousands of camera feeds, looking for humans and vehicles that may be engaged in suspicious activities or animals that may be encroaching on human populations.

For its AI needs, WPS uses Microsoft Azure’s purpose-built AI infrastructure powered by NVIDIA GPUs. For example, the group’s wpsWatch platform analyzes and monitors many inbound images from the remote cameras hosted in more than 100 sites across almost 20 counties. It is powered by Microsoft Azure VMs (virtual machines) with NVIDIA GPUs (graphics processing units) and was initially focused on the security and anti-poaching elements of the group’s mission.

A look to the future

These examples demonstrate the growing use of purpose-built infrastructure for AI. As companies increasingly adopt the latest AI technologies, like Generative AI, to transform their applications and derive business and economic value from AI, access to such an infrastructure will be critical for quickly getting value from AI economically.

Learn more

Read the Harvard Business Review Analytic Services whitepaper “Rethinking Cloud Strategies for Advance AI“ and watch the webinar.

Visit the Microsoft and NVIDIA HPCwire Solution Channel for more articles and insights.

#MakeAIYourReality
#AzureHPCAI
#NVIDIAonAzure

Return to Solution Channel Homepage
Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industry updates delivered to you every week!

MLPerf Inference 4.0 Results Showcase GenAI; Nvidia Still Dominates

March 28, 2024

There were no startling surprises in the latest MLPerf Inference benchmark (4.0) results released yesterday. Two new workloads — Llama 2 and Stable Diffusion XL — were added to the benchmark suite as MLPerf continues Read more…

Q&A with Nvidia’s Chief of DGX Systems on the DGX-GB200 Rack-scale System

March 27, 2024

Pictures of Nvidia's new flagship mega-server, the DGX GB200, on the GTC show floor got favorable reactions on social media for the sheer amount of computing power it brings to artificial intelligence.  Nvidia's DGX Read more…

Call for Participation in Workshop on Potential NSF CISE Quantum Initiative

March 26, 2024

Editor’s Note: Next month there will be a workshop to discuss what a quantum initiative led by NSF’s Computer, Information Science and Engineering (CISE) directorate could entail. The details are posted below in a Ca Read more…

Waseda U. Researchers Reports New Quantum Algorithm for Speeding Optimization

March 25, 2024

Optimization problems cover a wide range of applications and are often cited as good candidates for quantum computing. However, the execution time for constrained combinatorial optimization applications on quantum device Read more…

NVLink: Faster Interconnects and Switches to Help Relieve Data Bottlenecks

March 25, 2024

Nvidia’s new Blackwell architecture may have stolen the show this week at the GPU Technology Conference in San Jose, California. But an emerging bottleneck at the network layer threatens to make bigger and brawnier pro Read more…

Who is David Blackwell?

March 22, 2024

During GTC24, co-founder and president of NVIDIA Jensen Huang unveiled the Blackwell GPU. This GPU itself is heavily optimized for AI work, boasting 192GB of HBM3E memory as well as the the ability to train 1 trillion pa Read more…

MLPerf Inference 4.0 Results Showcase GenAI; Nvidia Still Dominates

March 28, 2024

There were no startling surprises in the latest MLPerf Inference benchmark (4.0) results released yesterday. Two new workloads — Llama 2 and Stable Diffusion Read more…

Q&A with Nvidia’s Chief of DGX Systems on the DGX-GB200 Rack-scale System

March 27, 2024

Pictures of Nvidia's new flagship mega-server, the DGX GB200, on the GTC show floor got favorable reactions on social media for the sheer amount of computing po Read more…

NVLink: Faster Interconnects and Switches to Help Relieve Data Bottlenecks

March 25, 2024

Nvidia’s new Blackwell architecture may have stolen the show this week at the GPU Technology Conference in San Jose, California. But an emerging bottleneck at Read more…

Who is David Blackwell?

March 22, 2024

During GTC24, co-founder and president of NVIDIA Jensen Huang unveiled the Blackwell GPU. This GPU itself is heavily optimized for AI work, boasting 192GB of HB Read more…

Nvidia Looks to Accelerate GenAI Adoption with NIM

March 19, 2024

Today at the GPU Technology Conference, Nvidia launched a new offering aimed at helping customers quickly deploy their generative AI applications in a secure, s Read more…

The Generative AI Future Is Now, Nvidia’s Huang Says

March 19, 2024

We are in the early days of a transformative shift in how business gets done thanks to the advent of generative AI, according to Nvidia CEO and cofounder Jensen Read more…

Nvidia’s New Blackwell GPU Can Train AI Models with Trillions of Parameters

March 18, 2024

Nvidia's latest and fastest GPU, codenamed Blackwell, is here and will underpin the company's AI plans this year. The chip offers performance improvements from Read more…

Nvidia Showcases Quantum Cloud, Expanding Quantum Portfolio at GTC24

March 18, 2024

Nvidia’s barrage of quantum news at GTC24 this week includes new products, signature collaborations, and a new Nvidia Quantum Cloud for quantum developers. Wh Read more…

Alibaba Shuts Down its Quantum Computing Effort

November 30, 2023

In case you missed it, China’s e-commerce giant Alibaba has shut down its quantum computing research effort. It’s not entirely clear what drove the change. Read more…

Nvidia H100: Are 550,000 GPUs Enough for This Year?

August 17, 2023

The GPU Squeeze continues to place a premium on Nvidia H100 GPUs. In a recent Financial Times article, Nvidia reports that it expects to ship 550,000 of its lat Read more…

Shutterstock 1285747942

AMD’s Horsepower-packed MI300X GPU Beats Nvidia’s Upcoming H200

December 7, 2023

AMD and Nvidia are locked in an AI performance battle – much like the gaming GPU performance clash the companies have waged for decades. AMD has claimed it Read more…

DoD Takes a Long View of Quantum Computing

December 19, 2023

Given the large sums tied to expensive weapon systems – think $100-million-plus per F-35 fighter – it’s easy to forget the U.S. Department of Defense is a Read more…

Synopsys Eats Ansys: Does HPC Get Indigestion?

February 8, 2024

Recently, it was announced that Synopsys is buying HPC tool developer Ansys. Started in Pittsburgh, Pa., in 1970 as Swanson Analysis Systems, Inc. (SASI) by John Swanson (and eventually renamed), Ansys serves the CAE (Computer Aided Engineering)/multiphysics engineering simulation market. Read more…

Choosing the Right GPU for LLM Inference and Training

December 11, 2023

Accelerating the training and inference processes of deep learning models is crucial for unleashing their true potential and NVIDIA GPUs have emerged as a game- Read more…

Intel’s Server and PC Chip Development Will Blur After 2025

January 15, 2024

Intel's dealing with much more than chip rivals breathing down its neck; it is simultaneously integrating a bevy of new technologies such as chiplets, artificia Read more…

Baidu Exits Quantum, Closely Following Alibaba’s Earlier Move

January 5, 2024

Reuters reported this week that Baidu, China’s giant e-commerce and services provider, is exiting the quantum computing development arena. Reuters reported � Read more…

Leading Solution Providers

Contributors

Comparing NVIDIA A100 and NVIDIA L40S: Which GPU is Ideal for AI and Graphics-Intensive Workloads?

October 30, 2023

With long lead times for the NVIDIA H100 and A100 GPUs, many organizations are looking at the new NVIDIA L40S GPU, which it’s a new GPU optimized for AI and g Read more…

Shutterstock 1179408610

Google Addresses the Mysteries of Its Hypercomputer 

December 28, 2023

When Google launched its Hypercomputer earlier this month (December 2023), the first reaction was, "Say what?" It turns out that the Hypercomputer is Google's t Read more…

AMD MI3000A

How AMD May Get Across the CUDA Moat

October 5, 2023

When discussing GenAI, the term "GPU" almost always enters the conversation and the topic often moves toward performance and access. Interestingly, the word "GPU" is assumed to mean "Nvidia" products. (As an aside, the popular Nvidia hardware used in GenAI are not technically... Read more…

Shutterstock 1606064203

Meta’s Zuckerberg Puts Its AI Future in the Hands of 600,000 GPUs

January 25, 2024

In under two minutes, Meta's CEO, Mark Zuckerberg, laid out the company's AI plans, which included a plan to build an artificial intelligence system with the eq Read more…

Google Introduces ‘Hypercomputer’ to Its AI Infrastructure

December 11, 2023

Google ran out of monikers to describe its new AI system released on December 7. Supercomputer perhaps wasn't an apt description, so it settled on Hypercomputer Read more…

China Is All In on a RISC-V Future

January 8, 2024

The state of RISC-V in China was discussed in a recent report released by the Jamestown Foundation, a Washington, D.C.-based think tank. The report, entitled "E Read more…

Intel Won’t Have a Xeon Max Chip with New Emerald Rapids CPU

December 14, 2023

As expected, Intel officially announced its 5th generation Xeon server chips codenamed Emerald Rapids at an event in New York City, where the focus was really o Read more…

IBM Quantum Summit: Two New QPUs, Upgraded Qiskit, 10-year Roadmap and More

December 4, 2023

IBM kicks off its annual Quantum Summit today and will announce a broad range of advances including its much-anticipated 1121-qubit Condor QPU, a smaller 133-qu Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire