Optimizing Codes for Heterogeneous HPC Clusters Using OpenACC

July 3, 2017

Looking at the Top500 and Green500 ranks, one clearly realizes that most HPC systems are heterogeneous architecture using COTS (Commercial Off-The-Shelf) hardware, combining traditional multi-core CPUs with massively parallel accelerators, such as GPUs and MICs. With processor frequencies now hitting a solid wall, the only truly open avenue for riding today the Moore’s law is increasing hardware parallelism in several different ways: more computing nodes, more processors in each node, more cores within each processor, and longer vector instructions in each core. Read more…

By Enrico Calore et. al.

Compilers and More: OpenACC to OpenMP (and back again)

June 29, 2016

In the last year or so, I’ve had several academic researchers ask me whether I thought it was a good idea for them to develop a tool to automatically convert OpenACC programs to OpenMP 4 and vice versa. In each case, the motivation was that some systems had OpenMP 4 compilers (x86 plus Intel Xeon Phi Knights Corner) and others had OpenACC (x86 plus NVIDIA GPU or AMD GPU), and someone wanting to run a program across both would need two slightly different programs. In each case, the proposed research sounded like a more-or-less mechanical translation process, something more like a sophisticated awk script, and that’s doomed from the start. I will explain below in more detail how I came to this conclusion. Read more…

By Michael Wolfe

A Comparison of Heterogeneous and Manycore Programming Models

March 2, 2015

The high performance computing (HPC) community is heading toward the era of exascale machines, expected to exhibit an unprecedented level of complexity and size Read more…

By Yonghong Yan, Barbara M. Chapman and Michael Wong

New Degrees of Parallelism, Old Programming Planes

August 28, 2014

Exploiting the capabilities of HPC hardware is now more a matter of pushing into deeper levels of parallelism versus adding more cores or overclocking. What thi Read more…

By Nicole Hemsoth

Parallel Programming with OpenMP

July 31, 2014

One of the most important tools in the HPC programmer's toolbox is OpenMP, a standard for expressing shared memory parallelism that was published in 1997. The c Read more…

By Tiffany Trader

A Data Locality Cure for Irregular Applications

February 18, 2014

Data locality plays a critical role in energy-efficiency and performance in parallel programs. For data-parallel algorithms where locality is abundant, it is a Read more…

By Carlo del Mundo

Photo courtesy of Nor-Tech

Unleashing The Potential of OpenMP via Bottleneck Analysis

February 13, 2014

To capitalize on the computational potential of parallel processors, programmers must identify bottlenecks that limit their application. These bottlenecks typic Read more…

By Carlo del Mundo, Contributing Editor

The Week in HPC Research

March 21, 2013

The top research stories of the week include an evaluation of sparse matrix multiplication performance on Xeon Phi versus four other architectures; a survey of HPC energy efficiency; performance modeling of OpenMP, MPI and hybrid scientific applications using weak scaling; an exploration of anywhere, anytime cluster monitoring; and a framework for data-intensive cloud storage. Read more…

By Tiffany Trader

  • arrow
  • Click Here for More Headlines
  • arrow

Leading Solution Providers

  • Off The Wire

  • Industry Headlines

More Off The Wire

Full & Complete coverage of SC17

Keynote Reviews, Analysts Write Ups, Booth Vidoes, Student Competition, Awards and so much

Did you miss out on Supercomputing 2017? Did you attend, but were stuck in meetings the whole time without an opportunity to walk the show floor and see what new announcements were being made? HPCwire's got you covered, we visited some of the hottest booths in the exhibit hall and spoke with their top executives to get the scoop on the latest solutions, partnerships, and product announcements.

Click here to view HPCwire Coverage of SC17


Avoiding the Storage Silo Trap; Best Practices for Data Storage in Scientific Research

From mismatches between compute and storage capabilities to colossal data volumes, data storage presents a number of challenges for scientific research. And as silos pop up and challenges expand, the pace of research often suffers.

Download this report

Sponsored by Quantum


Creating a Modular, Building-Block Architecture for Life Science Workflows

As genomic data becomes ubiquitous, infrastructure bottlenecks for life sciences organizations are narrowing. But speedy analysis and real-time decision making don't have to remain out of reach: modern end-to-end systems are emerging as flexible solutions for a competitive edge.

Download this report

Sponsored by Re-Store

Advanced Scale Career Development & Workforce Enhancement Center

Featured Advanced Scale Jobs:

Receive the Monthly
Advanced Computing Job Bank Resource:

HPCwire Resource Library

HPCwire Product Showcase

Subscribe to the Monthly
Technology Product Showcase: