Heterogeneous Computing in Firing Range

By Nicole Hemsoth

April 8, 2013

There is nothing static for scientific and technical computing developers on the horizon as the industry pushes an ever-expanding bevy of specialized co-processors, GPUs, FPGAs and other elements into the mix. The problem, however, is how far users are willing to push themselves to wrap around new code for the simple sake of performance gains.

Despite developer hassle, this is a great problem from the perspective of companies who are finding ways to tailor clean layers around complex code for heterogeneous computing.

Take, for example, Atlanta-based AccelerEyes, which is seeing booming business because of the demand for GPU acceleration and interest in kicking the Xeon Phi co-processor tires. The company’s emphasis falls right in line with the needs of technical computing folks, with acceleration focused on C, C++ and Fortran codes, and it has a firm foothold for a company its size in the “big, small world” of HPC.

The Atlanta-based company was initially founded in 2007 to give MATLAB a performance boost with GPUs via their Jacket product, which is still a core offering that zips over some of the dev complexity. They recently worked with MathWorks again on bringing its Parallel Toolbox to life, which targets heterogeneous computing for scientific and technical users.

Their real emphasis going forward is likely to be on the multi-GPU and co-processor ArrayFire offering, which they unveiled in 2011 to extend a much longer arm to technical computing users interested in dipping their toes into GPU waters. As the company’s CEO, John Melonakos detailed for us in a recent conversation, ArrayFire aims at making CUDA, OpenCL and now Phi within closer developer range. ArrayFire has a broad library of functions for CUDA and to a lesser extent, OpenCL and now several users kicking the new tires on Phi.

So far ArrayFire has offered some significant speedups to research at NASA, which uses AccelerEyes to boost Mars Rover image compression via GPUs and genetic algorithms to the tune of 5x. They’ve also worked with financial services firms on the quant side for 37x speedups, geolocation for government agencies at a 17x boost, and a number of oil and gas companies, which are seeing big speedups for everything from ground water simulations to 3D mantle geodynamics applications.  

The use cases cited were all using GPU acceleration, but new innovations in heterogeneous computing will unleash some new examples of high-gear performance. In particular, he says Phi is a promising technology, even if Intel has a lot of work to do to catch up with the multi-year development of other vendors, especially NVIDIA. “There will be some advantages in terms of Intel’s software stack, which already has a solid user base…I think the Intel tools will be a big advantage for Phi but NVIDIA has taken a strong lead.”

Melonakos strongly believes that heterogeneous computing will be one of the biggest trends in the next decade of computing. “The parallelism that exists in workloads today can’t be ignored and GPUs really are a great immediate architecture to attack data parallel workloads in an energy efficiency manner. However, other heterogeneous options are on the rise and will play out over the coming decade. For instance, Phi hold great promise in terms of usability and technology roadmap, but Intel won’t be the immediate leader by any means—there are serious investments they need to make.”

At this point, however, CUDA is the real star of the GPU computing show because of NVIDIA’s commitment to seeing it through. Melonakos says that his company is still working on an OpenCL release for ArrayFire as they “wait for that software stack to mature a bit.” He notes that AMD has done a decent job of pushing OpenCL but they’re “tapped out on their push and have become just one of the players” but it has been getting a fresh amount of attention because of the buzz around Phi.

The AccelerEyes CEO says that the number of discussions about OpenCL his team has noted from around the ecosystem has definitely picked up in the wake of Phi. He expects this to continue since OpenCL lives something of a double life between the worlds of HPC and mobile, consumer computing. What drives both of these areas is that they’re equally committed to solving core problems around heterogeneous computing since they’re basically the same aside from the levels of computation and power considerations. In the end, whether for HPC or mobile users, the goal is to make use of every micrometer of hardware available.

Melonakos says that the industry is very fluid and kept healthy through active competition. However, he notes that for a small company like his, there’s no way that they can influence what is currently a dominant proprietary approach to GPU computing that makes developers have to commit to one or another. But open, portable standards are needed for developers to fully tap into the power of heterogeneous computing.

On that note, if you have a little time and don’t need a lot of eye candy with your media, there’s a pretty meaty presentation below on some of the tradeoffs between OpenCL and CUDA from AccelerEyes’ view. It also provides some solid points of comparison when it comes to using an abstraction layer over the two.

Although small now, AccelerEyes is set to grow in the wake of the rise of GPU and co-processor-driven projects. Melonakos said that they’re only a 20-person shop for now, but they’re adding new people to work on the ArrayFire offering weekly and are actively seeking new developers to join their ranks.

Related Articles

AccelerEyes Adds Support for Xeon Phi

CAPS and AccelerEyes Integrate ArrayFire Into DevDeck

AccelerEyes Intros New Version of Jacket

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

TACC Supercomputing Powers Climate Modeling for Fisheries

January 28, 2023

A tremendous portion of the world depends on the output of the oceans’ major fisheries, which have, in recent decades, found themselves under near-constant threat from mismanagement (e.g. overfishing). Climate change, Read more…

PFAS Regulations, 3M Exit to Impact Two-Phase Cooling in HPC

January 27, 2023

Per- and polyfluoroalkyl substances (PFAS), known as “forever chemicals,” pose a number of health risks to humans, with more suspected but not yet confirmed – and, as a result, PFAS are coming under increasing regu Read more…

Sweden Plans Expansion for Nvidia-Powered Berzelius Supercomputer

January 26, 2023

The Atos-built, Nvidia SuperPod-based Berzelius supercomputer – housed in and operated by Sweden’s Linköping-based National Supercomputer Centre (NSC) – is already no slouch. But now, Nvidia and NSC have announced Read more…

Multiverse, Pasqal, and Crédit Agricole Tout Progress Using Quantum Computing in FS

January 26, 2023

Europe-based quantum computing pioneers Multiverse Computing and Pasqal, and global bank Crédit Agricole CIB today announced successful conclusion of a 1.5-year POC study “to evaluate the contribution of an algorithmi Read more…

Critics Don’t Want Politicians Deciding the Future of Semiconductors

January 26, 2023

The future of the semiconductor industry was partially being decided last week by a mix of politicians, policy hawks and chip industry executives jockeying for influence at the World Economic Forum. Intel CEO Pat Gels Read more…

AWS Solution Channel

Shutterstock_1687123447

Numerix Scales HPC Workloads for Price and Risk Modeling Using AWS Batch

  • 180x improvement in analytics performance
  • Enhanced risk management
  • Decreased bottlenecks in analytics
  • Unlocked near-real-time analytics
  • Scaled financial analytics

Overview

Numerix, a financial technology company, needed to find a way to scale its high performance computing (HPC) solution as client portfolios ballooned in size. Read more…

Microsoft/NVIDIA Solution Channel

Shutterstock 1453953692

Microsoft and NVIDIA Experts Talk AI Infrastructure

As AI emerges as a crucial tool in so many sectors, it’s clear that the need for optimized AI infrastructure is growing. Going beyond just GPU-based clusters, cloud infrastructure that provides low-latency, high-bandwidth interconnects and high-performance storage can help organizations handle AI workloads more efficiently and produce faster results. Read more…

Riken Plans ‘Virtual Fugaku’ on AWS

January 26, 2023

The development of a national flagship supercomputer aimed at exascale computing continues to be a heated competition, especially in the United States, the European Union, China, and Japan. What is the value to be gained Read more…

PFAS Regulations, 3M Exit to Impact Two-Phase Cooling in HPC

January 27, 2023

Per- and polyfluoroalkyl substances (PFAS), known as “forever chemicals,” pose a number of health risks to humans, with more suspected but not yet confirmed Read more…

Critics Don’t Want Politicians Deciding the Future of Semiconductors

January 26, 2023

The future of the semiconductor industry was partially being decided last week by a mix of politicians, policy hawks and chip industry executives jockeying for Read more…

Riken Plans ‘Virtual Fugaku’ on AWS

January 26, 2023

The development of a national flagship supercomputer aimed at exascale computing continues to be a heated competition, especially in the United States, the Euro Read more…

Shutterstock 1134313550

Semiconductor Companies Create Building Block for Chiplet Design

January 24, 2023

Intel's CEO Pat Gelsinger last week made a grand proclamation that chips will be for the next few decades what oil and gas was to the world over the last 50 years. While that remains to be seen, two technology associations are joining hands to develop building blocks to stabilize the development of future chip designs. The goal of the standard is to set the stage for a thriving marketplace that fuels... Read more…

Royalty-free stock photo ID: 1572060865

Fujitsu Study Says Quantum Decryption Threat Still Distant

January 23, 2023

Global computer and chip manufacturer Fujitsu today reported that a new study performed on its 39-qubit quantum simulator suggests it will remain difficult for Read more…

At ORNL, Jeff Smith Becomes Interim Director, as Search for Permanent Lab Chief Continues

January 20, 2023

UT-Battelle, which manages Oak Ridge National Laboratory (ORNL) for the U.S. Department of Energy, has appointed Jeff Smith as interim director for the lab as t Read more…

Top HPC Players Creating New Security Architecture Amid Neglect

January 20, 2023

Security of high-performance computers is being neglected in the pursuit of horsepower, and there are concerns that the ignorance may be costly if safeguards ar Read more…

Ohio Supercomputer Center Debuts ‘Ascend’ GPU Cluster

January 19, 2023

Less than 10 months after it was announced, the Columbus-based Ohio Supercomputer Center (OSC) has debuted its Dell-built GPU cluster, “Ascend.” Designed to Read more…

Leading Solution Providers

Contributors

SC22 Booth Videos

AMD @ SC22
Altair @ SC22
AWS @ SC22
Ayar Labs @ SC22
CoolIT @ SC22
Cornelis Networks @ SC22
DDN @ SC22
Dell Technologies @ SC22
HPE @ SC22
Intel @ SC22
Intelligent Light @ SC22
Lancium @ SC22
Lenovo @ SC22
Microsoft and NVIDIA @ SC22
One Stop Systems @ SC22
Penguin Solutions @ SC22
QCT @ SC22
Supermicro @ SC22
Tuxera @ SC22
Tyan Computer @ SC22
  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire