Azure Edges AWS in Linpack Benchmark Study

By John Russell

February 15, 2017

The “when will clouds be ready for HPC” question has ebbed and flowed for years. It seems clear that for at least some workloads and on some clouds, the answer is now. HPC cloud specialist Nimbix, for example, focuses on providing fast interconnect, large memory, and heterogeneous architecture specifically tailored for HPC. The goliath public clouds have likewise steadily incorporated needed technology and (perhaps less decisively) pricing options.

A new study posted on arXiv.org last week – Comparative benchmarking of cloud computing vendors with High Performance Linpack – authored by Exabyte.io, an admittedly biased source, reports the answer is an unambiguous yes to the question of whether popular clouds can accommodate HPC and further examines some of the differences between a few of the major players.

“For high performance computing (HPC) workloads that traditionally required large and cost-intensive hardware procurement, the feasibility and advantages of cloud computing are still debated. In particular, it is often questioned whether software applications that require distributed memory can be efficiently run on ”commodity” compute infrastructure publicly available from cloud computing vendors,” write the authors, Mohammad Mohammadi, Timur Bazhirov of Exabyte.io.

“We benchmarked the performance of the best available computing hardware from public cloud providers with high performance Linpack. We optimized the benchmark for each computing environment and evaluated the relative performance for distributed memory calculations. We found Microsoft Azure to deliver the best results, and demonstrated that the performance per single computing core on public cloud to be comparable to modern traditional supercomputing systems.

“Based on our findings we suggest that the concept of high performance computing in the cloud is ready for a widespread adoption and can provide a viable and cost-efficient alternative to capital-intensive on- premises hardware deployments.”

Exabyte.io is a young company building a cloud-based environment to assist organizations with materials design – hence it has a horse in the race. Company marketing info on its website states, “Exabyte.io powers the adoption of high-performance cloud computing for design and discovery of advanced materials, devices and chemicals from nanoscale. We combine high fidelity simulation techniques, large-scale data analytics and machine learning tools into a hosted environment available for public, private and hybrid cloud deployments.”

Leaving its interest aside the study is interesting. Here’s a list of the cloud offerings evaluated:

The benchmarking was done using the High Performance Linpack (HPL) program, which solves a random system of linear equations, represented by a dense matrix, in double precision (64 bits) arithmetic on distributed-memory computers. “It does so through a two-dimensional block- cyclic data distribution, and right-looking variant of the LU factorization with row partial pivoting.” It is a portable and freely available software package.

Three different AWS scenarios were tested including– hyper-threaded, non-hyper-threaded, and non-hyper-threaded with placement groups. On Azure, three different instance types were used, F-series, A-series, and H-series. Compute1-60 instances were used on Rackspace. The benchmark was also run on NERSC Edison supercomputer with hyper-threading enabled. Edison, of course, is a Cray XC30, with a peak performance of 2.57 PFLOPS, 133,824 compute cores, 357 terabytes of memory, and 7.56 petabytes of disk, holding number 60 rank on the top500. Specific configurations shown below.

In many cases the performances were quite similar but each also had strengths and weaknesses. For example, network saturation at scale and slower processor clock speeds affected IBM Softlayer’s performance according to the study. The authors also noted: “AWS and Rackspace show a significant degree of parallel performance degradation, such that at 32 nodes the measured performance is about one-half of the peak value.”

The brief paper is best read in full for the details. The performance data for each of the clouds is presented. Below is a summary figure of cloud performances.

Figure 1: Speedup ratios (the ratios of maximum speedup Rmax to peak speedup Rpeak) against the number of nodes for all benchmarked cases. Speedup ratio for 1,2,4,8,16 and 32 nodes are investigated and given by points. Lines are drawn to guide the eye. The legend is as follows: AWS – Amazon Web Services in the default hyper-threaded regime; AWS-NHT – same, with hyperthreading disabled; AWS-NHT- PG – same, with placement group option enabled; AZ – Mi- crosoft Azure standard F16 instances; AZ-IB-A – same provider, A9 instances; AZ-IB-H – same provider, H16 instances; RS – Rackspace compute1-60 instances; SL – IBM/Softlayer virtual servers; NERSC – Edison computing facility of the National Energy Research Scientific Computing Center.

On balance, argue the authors, “Our results demonstrate that the current generation of publicly available cloud computing systems are capable of delivering comparable, if not better, performance than the top-tier traditional high performance computing systems. This fact confirms that cloud computing is already a viable and cost-effective alternative to traditional cost- intensive supercomputing procurement.”

Here is a link to the paper on arXiv.org: https://arxiv.org/pdf/1702.02968.pdf

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

Nvidia Touts Strong Results on Financial Services Inference Benchmark

February 3, 2023

The next-gen Hopper family may be on its way, but that isn’t stopping Nvidia’s popular A100 GPU from leading another benchmark on its way out. This time, it’s the STAC-ML inference benchmark, produced by the Securi Read more…

Quantum Computing Firm Rigetti Faces Delisting

February 3, 2023

Quantum computing companies are seeing their market caps crumble as investors patiently await out the winner-take-all approach to technology development. Quantum computing firms such as Rigetti Computing, IonQ and D-Wave went public through mergers with blank-check companies in the last two years, with valuations at the time of well over $1 billion. Now the market capitalization of these companies are less than half... Read more…

US and India Strengthen HPC, Quantum Ties Amid Tech Tension with China

February 2, 2023

Last May, the United States and India announced the “Initiative on Critical and Emerging Technology” (iCET), aimed at expanding the countries’ partnerships in strategic technologies and defense industries across th Read more…

Pittsburgh Supercomputing Enables Transparent Medicare Outcome AI

February 2, 2023

Medical applications of AI are replete with promise, but stymied by opacity: with lives on the line, concerns over AI models’ often-inscrutable reasoning – and as a result, possible biases embedded in those models Read more…

Europe’s LUMI Supercomputer Has Officially Been Accepted

February 1, 2023

“LUMI is officially here!” proclaimed the headline of a blog post written by Pekka Manninen, director of science and technology for CSC, Finland’s state-owned IT center. The EuroHPC-organized supercomputer’s most Read more…

AWS Solution Channel

Shutterstock 2069893598

Cost-effective and accurate genomics analysis with Sentieon on AWS

This blog post was contributed by Don Freed, Senior Bioinformatics Scientist, and Brendan Gallagher, Head of Business Development at Sentieon; and Olivia Choudhury, PhD, Senior Partner Solutions Architect, Sujaya Srinivasan, Genomics Solutions Architect, and Aniket Deshpande, Senior Specialist, HPC HCLS at AWS. Read more…

Microsoft/NVIDIA Solution Channel

Shutterstock 1453953692

Microsoft and NVIDIA Experts Talk AI Infrastructure

As AI emerges as a crucial tool in so many sectors, it’s clear that the need for optimized AI infrastructure is growing. Going beyond just GPU-based clusters, cloud infrastructure that provides low-latency, high-bandwidth interconnects and high-performance storage can help organizations handle AI workloads more efficiently and produce faster results. Read more…

Intel’s Gaudi3 AI Chip Survives Axe, Successor May Combine with GPUs

February 1, 2023

Intel's paring projects and products amid financial struggles, but AI products are taking on a major role as the company tweaks its chip roadmap to account for more computing specifically targeted at artificial intellige Read more…

Quantum Computing Firm Rigetti Faces Delisting

February 3, 2023

Quantum computing companies are seeing their market caps crumble as investors patiently await out the winner-take-all approach to technology development. Quantum computing firms such as Rigetti Computing, IonQ and D-Wave went public through mergers with blank-check companies in the last two years, with valuations at the time of well over $1 billion. Now the market capitalization of these companies are less than half... Read more…

US and India Strengthen HPC, Quantum Ties Amid Tech Tension with China

February 2, 2023

Last May, the United States and India announced the “Initiative on Critical and Emerging Technology” (iCET), aimed at expanding the countries’ partnership Read more…

Intel’s Gaudi3 AI Chip Survives Axe, Successor May Combine with GPUs

February 1, 2023

Intel's paring projects and products amid financial struggles, but AI products are taking on a major role as the company tweaks its chip roadmap to account for Read more…

Roadmap for Building a US National AI Research Resource Released

January 31, 2023

Last week the National AI Research Resource (NAIRR) Task Force released its final report and roadmap for building a national AI infrastructure to include comput Read more…

PFAS Regulations, 3M Exit to Impact Two-Phase Cooling in HPC

January 27, 2023

Per- and polyfluoroalkyl substances (PFAS), known as “forever chemicals,” pose a number of health risks to humans, with more suspected but not yet confirmed Read more…

Multiverse, Pasqal, and Crédit Agricole Tout Progress Using Quantum Computing in FS

January 26, 2023

Europe-based quantum computing pioneers Multiverse Computing and Pasqal, and global bank Crédit Agricole CIB today announced successful conclusion of a 1.5-yea Read more…

Critics Don’t Want Politicians Deciding the Future of Semiconductors

January 26, 2023

The future of the semiconductor industry was partially being decided last week by a mix of politicians, policy hawks and chip industry executives jockeying for Read more…

Riken Plans ‘Virtual Fugaku’ on AWS

January 26, 2023

The development of a national flagship supercomputer aimed at exascale computing continues to be a heated competition, especially in the United States, the Euro Read more…

Leading Solution Providers

Contributors

SC22 Booth Videos

AMD @ SC22
Altair @ SC22
AWS @ SC22
Ayar Labs @ SC22
CoolIT @ SC22
Cornelis Networks @ SC22
DDN @ SC22
Dell Technologies @ SC22
HPE @ SC22
Intel @ SC22
Intelligent Light @ SC22
Lancium @ SC22
Lenovo @ SC22
Microsoft and NVIDIA @ SC22
One Stop Systems @ SC22
Penguin Solutions @ SC22
QCT @ SC22
Supermicro @ SC22
Tuxera @ SC22
Tyan Computer @ SC22
  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire