Advancing Applications Toward Exascale Target

By Tiffany Trader

February 11, 2014

Exascale computers will employ tens of thousands of many-core nodes, leaving programmers with the challenging task of developing applications that can leverage tens of millions of threads. Over at the Cray blog, Dr. Jason Beech-Brandt, Manager Exascale Research for Cray Europe, writes about how the supercomputer maker is helping lay the groundwork for exascale applications through a collaborative effort known as CRESTA.

In order for exascale applications to be productive, all the supporting layers also need to be optimized: from the operating and runtime systems, through the communication and scientific libraries to the compilers and toolsets. The need for such an ecosystem led Cray to establish the Cray Research Initiative Europe back in 2009. As part of that endeavor, Cray is working with key high performance computing (HPC) centers and software and tools developers across Europe under the banner of CRESTA (Collaborative Research into Exascale Systemware).

“Co-design is at the project’s core,” explains Beech-Brandt, “real application requirements driving systemware developments and research, which then feed back into the applications in an ongoing, virtuous cycle.”

With funding from the European Union, CRESTA has selected six applications for exascale-focused development. The applications were chosen by CRESTA HPC center partners and represent a broad range of domains, including CFD, numerical weather prediction, biomolecular systems, fusion energy, and physiological flows.

The CRESTA team is exploring new programming models, such as PGAS languages and OpenACC, as well as enhanced libraries, e.g., FFTs and sparse matrix operations. Another technique is to introduce fault tolerance both in the applications and in the communication libraries. The team is also experimenting with improved compilers, workflow, and diagnostic tools, such as DDT and Vampir from partners Allinea and TU Dresden, which help developers address the bottlenecks that limit the progress from petascale and exascale.

The project relies on large Cray supercomputers installed at CRESTA partner sites in Europe the United States, including the 20-petaflop (peak) Cray XK7 Titan supercomputer, located at the Oak Ridge National Laboratory. Access is enabled by the U.S. Department of Energy INCITE program, and three of CRESTA’s partner co-design applications have leveraged the INCITE program.

The Cray rep cites several examples of how customers can benefit from CRESTA. The European Center for Medium range Weather Forecasting (ECMWF), for example, uses the Integrated Forecast System (IFS) model to provide medium-range weather forecasts to its 34 European member states. The global grid size for simulations, currently based on a 16 km resolution, is expected to be refined down to a 2.5 km global weather forecast model by 2030, when employed on an exascale system. This means IFS needs to run efficiently on a thousand times more cores. Advances achieved by CRESTA have enabled IFS to harness 200,000 CPU cores on Titan. This is the most cores ever used for a weather model and it marks the first use of the 5 km resolution model that will be needed in medium range forecasts in 2023.

The breakthrough was enabled by new programming models, which eliminated a performance bottleneck. More specifically, the Cray Compiler Environment (CCE) used nested Fortran coarrays within OpenMP, which allowed communication time to be absorbed into existing calculations.

Cray reports other successes in the field of CFD too, with CRESTA researchers using the OpenACC programming model to extend the existing Nek5000 code to exploit accelerators. Adding only one OpenACC directive per thousand lines of Fortran code has already enabled a Nek5000 test case to be scaled across more than 16,000 GPU nodes of Titan, with an almost threefold increase in performance compared to just using the CPUs.

CRESTA is also helping to boost the performance of Gromacs, a classical molecular dynamics package for simulating the behavior of millions of particles. The work involves using a hybrid approach (CPUs and GPUs) to understand the mechanism of membrane fusion in viruses.

The Cray rep notes that the R&D undertaken by CRESTA has led to improvements in the Cray Compiler Environment (CCE), something Cray users everywhere can benefit from.

In CRESTA’s last year, Beech-Brandt  says to “expect more big improvements in applications, systemware, and tools.” When the program ends, Cray’s work on the exascale front will continue with projects like EPiGRAM.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

Cray+Azure: Can Cloud Propel Supercomputing?

October 23, 2017

Cray and Microsoft today announced they will offer dedicated Cray supercomputers (the XC and CS-Storm lines) inside the Azure platform allowing customers to run their HPC and AI applications alongside their other cloud w Read more…

By Tiffany Trader

2017 Gordon Bell Prize Finalists Named

October 23, 2017

The three finalists for this year’s Gordon Bell Prize in High Performance Computing have been announced. They include two papers on projects run on China’s Sunway TaihuLight system and a third paper on 3D image recon Read more…

By John Russell

Data Vortex Users Contemplate the Future of Supercomputing

October 19, 2017

Last month (Sept. 11-12), HPC networking company Data Vortex held its inaugural users group at Pacific Northwest National Laboratory (PNNL) bringing together about 30 participants from industry, government and academia t Read more…

By Tiffany Trader

HPE Extreme Performance Solutions

Transforming Genomic Analytics with HPC-Accelerated Insights

Advancements in the field of genomics are revolutionizing our understanding of human biology, rapidly accelerating the discovery and treatment of genetic diseases, and dramatically improving human health. Read more…

AI Self-Training Goes Forward at Google DeepMind

October 19, 2017

DeepMind, Google’s AI research organization, announced today in a blog that AlphaGo Zero, the latest evolution of AlphaGo (the first computer program to defeat a Go world champion) trained itself within three days to play Go at a superhuman level (i.e., better than any human) – and to beat the old version of AlphaGo – without leveraging human expertise, data or training. Read more…

By Doug Black

Cray+Azure: Can Cloud Propel Supercomputing?

October 23, 2017

Cray and Microsoft today announced they will offer dedicated Cray supercomputers (the XC and CS-Storm lines) inside the Azure platform allowing customers to run Read more…

By Tiffany Trader

Data Vortex Users Contemplate the Future of Supercomputing

October 19, 2017

Last month (Sept. 11-12), HPC networking company Data Vortex held its inaugural users group at Pacific Northwest National Laboratory (PNNL) bringing together ab Read more…

By Tiffany Trader

AI Self-Training Goes Forward at Google DeepMind

October 19, 2017

DeepMind, Google’s AI research organization, announced today in a blog that AlphaGo Zero, the latest evolution of AlphaGo (the first computer program to defeat a Go world champion) trained itself within three days to play Go at a superhuman level (i.e., better than any human) – and to beat the old version of AlphaGo – without leveraging human expertise, data or training. Read more…

By Doug Black

Student Cluster Competition Coverage New Home

October 16, 2017

Hello computer sports fans! This is the first of many (many!) articles covering the world-wide phenomenon of Student Cluster Competitions. Finally, the Student Read more…

By Dan Olds

Intel Delivers 17-Qubit Quantum Chip to European Research Partner

October 10, 2017

On Tuesday, Intel delivered a 17-qubit superconducting test chip to research partner QuTech, the quantum research institute of Delft University of Technology (TU Delft) in the Netherlands. The announcement marks a major milestone in the 10-year, $50-million collaborative relationship with TU Delft and TNO, the Dutch Organization for Applied Research, to accelerate advancements in quantum computing. Read more…

By Tiffany Trader

Fujitsu Tapped to Build 37-Petaflops ABCI System for AIST

October 10, 2017

Fujitsu announced today it will build the long-planned AI Bridging Cloud Infrastructure (ABCI) which is set to become the fastest supercomputer system in Japan Read more…

By John Russell

HPC Chips – A Veritable Smorgasbord?

October 10, 2017

For the first time since AMD's ill-fated launch of Bulldozer the answer to the question, 'Which CPU will be in my next HPC system?' doesn't have to be 'Whichever variety of Intel Xeon E5 they are selling when we procure'. Read more…

By Dairsie Latimer

Delays, Smoke, Records & Markets – A Candid Conversation with Cray CEO Peter Ungaro

October 5, 2017

Earlier this month, Tom Tabor, publisher of HPCwire and I had a very personal conversation with Cray CEO Peter Ungaro. Cray has been on something of a Cinderell Read more…

By Tiffany Trader & Tom Tabor

Reinders: “AVX-512 May Be a Hidden Gem” in Intel Xeon Scalable Processors

June 29, 2017

Imagine if we could use vector processing on something other than just floating point problems.  Today, GPUs and CPUs work tirelessly to accelerate algorithms Read more…

By James Reinders

NERSC Scales Scientific Deep Learning to 15 Petaflops

August 28, 2017

A collaborative effort between Intel, NERSC and Stanford has delivered the first 15-petaflops deep learning software running on HPC platforms and is, according Read more…

By Rob Farber

Oracle Layoffs Reportedly Hit SPARC and Solaris Hard

September 7, 2017

Oracle’s latest layoffs have many wondering if this is the end of the line for the SPARC processor and Solaris OS development. As reported by multiple sources Read more…

By John Russell

US Coalesces Plans for First Exascale Supercomputer: Aurora in 2021

September 27, 2017

At the Advanced Scientific Computing Advisory Committee (ASCAC) meeting, in Arlington, Va., yesterday (Sept. 26), it was revealed that the "Aurora" supercompute Read more…

By Tiffany Trader

How ‘Knights Mill’ Gets Its Deep Learning Flops

June 22, 2017

Intel, the subject of much speculation regarding the delayed, rewritten or potentially canceled “Aurora” contract (the Argonne Lab part of the CORAL “ Read more…

By Tiffany Trader

GlobalFoundries Puts Wind in AMD’s Sails with 12nm FinFET

September 24, 2017

From its annual tech conference last week (Sept. 20), where GlobalFoundries welcomed more than 600 semiconductor professionals (reaching the Santa Clara venue Read more…

By Tiffany Trader

Google Releases Deeplearn.js to Further Democratize Machine Learning

August 17, 2017

Spreading the use of machine learning tools is one of the goals of Google’s PAIR (People + AI Research) initiative, which was introduced in early July. Last w Read more…

By John Russell

Nvidia Responds to Google TPU Benchmarking

April 10, 2017

Nvidia highlights strengths of its newest GPU silicon in response to Google's report on the performance and energy advantages of its custom tensor processor. Read more…

By Tiffany Trader

Leading Solution Providers

Graphcore Readies Launch of 16nm Colossus-IPU Chip

July 20, 2017

A second $30 million funding round for U.K. AI chip developer Graphcore sets up the company to go to market with its “intelligent processing unit” (IPU) in Read more…

By Tiffany Trader

Amazon Debuts New AMD-based GPU Instances for Graphics Acceleration

September 12, 2017

Last week Amazon Web Services (AWS) streaming service, AppStream 2.0, introduced a new GPU instance called Graphics Design intended to accelerate graphics. The Read more…

By John Russell

EU Funds 20 Million Euro ARM+FPGA Exascale Project

September 7, 2017

At the Barcelona Supercomputer Centre on Wednesday (Sept. 6), 16 partners gathered to launch the EuroEXA project, which invests €20 million over three-and-a-half years into exascale-focused research and development. Led by the Horizon 2020 program, EuroEXA picks up the banner of a triad of partner projects — ExaNeSt, EcoScale and ExaNoDe — building on their work... Read more…

By Tiffany Trader

Delays, Smoke, Records & Markets – A Candid Conversation with Cray CEO Peter Ungaro

October 5, 2017

Earlier this month, Tom Tabor, publisher of HPCwire and I had a very personal conversation with Cray CEO Peter Ungaro. Cray has been on something of a Cinderell Read more…

By Tiffany Trader & Tom Tabor

Cray Moves to Acquire the Seagate ClusterStor Line

July 28, 2017

This week Cray announced that it is picking up Seagate's ClusterStor HPC storage array business for an undisclosed sum. "In short we're effectively transitioning the bulk of the ClusterStor product line to Cray," said CEO Peter Ungaro. Read more…

By Tiffany Trader

Intel Launches Software Tools to Ease FPGA Programming

September 5, 2017

Field Programmable Gate Arrays (FPGAs) have a reputation for being difficult to program, requiring expertise in specialty languages, like Verilog or VHDL. Easin Read more…

By Tiffany Trader

IBM Advances Web-based Quantum Programming

September 5, 2017

IBM Research is pairing its Jupyter-based Data Science Experience notebook environment with its cloud-based quantum computer, IBM Q, in hopes of encouraging a new class of entrepreneurial user to solve intractable problems that even exceed the capabilities of the best AI systems. Read more…

By Alex Woodie

HPC Chips – A Veritable Smorgasbord?

October 10, 2017

For the first time since AMD's ill-fated launch of Bulldozer the answer to the question, 'Which CPU will be in my next HPC system?' doesn't have to be 'Whichever variety of Intel Xeon E5 they are selling when we procure'. Read more…

By Dairsie Latimer

  • arrow
  • Click Here for More Headlines
  • arrow
Share This