SYCL 2020 Launches with New Name, New Features, and High Ambition

By John Russell

February 9, 2021

The Khronos Group today formally launched SYCL 2020, the parallel programming framework based on IS0 standard C++ that has been gaining traction in HPC and will, for example, be supported on the forthcoming exascale supercomputer Aurora (ANL) and pre-exascale system Perlmutter (NERSC/LBNL). SYCL 2020 builds on the functionality of SYCL 1.2.1. adding 40-plus new features and introduces a new naming convention based on the year. SYCL 2020 is based on C++17.

Parallel programming and associated tools are hardly new, but the recent rise of heterogeneous computing has spurred development of several parallel programing frameworks targeting not just multicore CPUs but a whole array of diverse accelerators (GPUs, FPGA, etc.) and domains. SYCL was introduced by the Khronos Group (consortium) in 2014 as a high-level programming model for OpenCL which is also based on C++ and targets heterogeneous platforms. OpenCL was introduced in 2009 by Khronos.

Loosely, one can think of SYCL as playing a role similar to OpenMP as an HPC language for C++, but with significant technical differences and distinct strengths, drawbacks, and roots. OpenMP first supported Fortran (1997) and then C/C++ (2000). OpenMP has always had strength in incremental parallelism, specifically in C and Fortran. SYCL’s strength is focused on modern C++ and support parameterization and dynamic composition of algorithms making it suitable to compose directly with C++ template libraries such as TensorFlow.

SYCL is described as:

“[A] royalty-free, cross-platform abstraction layer that builds on the underlying concepts, portability and efficiency of OpenCL that enables code for heterogeneous processors to be written in a “single-source” style using completely standard C++. SYCL enables single source development where C++ template functions can contain both host and device code to construct complex algorithms that use OpenCL acceleration, and then re-use them throughout their source code on different types of data.”

“While originally developed for use with OpenCL and SPIR, it is actually a more general heterogeneous framework able to target other systems. For example, the hipSYCL implementation targets ROCm and CUDA via AMD’s cross-vendor HIP. While the SYCL standard started as the higher-level programming model sub-group of the OpenCL working group, it is a Khronos Group workgroup independent from the OpenCL working group since September 20, 2019.”

Calling SYCL 2020 a significant advance, Michael Wong, Codeplay distinguished engineer, ISO C++ Directions Group and SYCL working group chair, told HPCwire in a briefing, “We’re seeing significant adoption in embedded desktop and HPC markets. We think that it can improve programmability, it will allow smaller code size and have faster performance. It’s based on C++ 17, and is backwards compatible with SYCL 1.121. It should ease porting of standard C++ applications to SYCL and it should enable closer alignment and integration with ISO C++. The other thing, of course, is we now enable multiple different kinds of back-end accelerators.” (Khronos also posted a blog today describing SYCL 2020.)

Here’s snapshot of SYCL’s major new features:

  • Unified Shared Memory (USM) enables code with pointers to work naturally without buffers or accessors.
  • Parallel reductions add a built-in reduction operation to avoid boilerplate code and enable maximum performance on hardware with built-in reduction operation acceleration.
  • Work group and subgroup algorithms enable efficient parallel operations between work items.
  • Class template argument deduction (CTAD) and template deduction guides simplify class template instantiation.
  • Simplified use of Accessors with a built-in reduction operation, reducing boilerplate code and simplifying use of C++ software design patterns.
  • Expanded interoperability for efficient acceleration by diverse backend acceleration APIs.
  • Atomic operations are now closer to standard C++ atomics to enhance parallel programming freedom.

The latest version represents three years of effort, said Wong, who emphasized user input was key in determining new features. For example, the simplified use of accessors with a built-in reduction operator was important, he said, “because our users have asked us to get to a point where hello world no longer looks like it has lots of accessors and buffers. It just looks like plain hello world that you would see in C++.”

Apart from feature growth, it is interesting to look at the SYCL ecosystem. There are many pieces to the parallel programming puzzle. Wong has packed a lot into the next slide.

“This slide shows how SYCL fits within the larger framework of C++ programs, libraries, C++ application codes, and machine learning frameworks. [It also] shows how SCYL can work within those fairly complex applications that do complex machine learning,” said Wong. “There are libraries that involve oneMKL [and] oneDNN – these are just names from oneAPI – and also SYCL BLAS libraries and Eigen libraries. Even though these are used in fairly complex C++ template operations, they can be easily ingested by SYCL,” said Wong.

“The differentiation here is that these libraries would not be easily ingested by OpenMP because OpenMP cannot adapt to C++ template operations as easily. These template libraries, they can be absorbed by the SYCL compiler and separately by the CPU host compiler. The host compiler can be any compiler, could be LLVM, could be GCC, could be visual C++.

“Now the SYCL compiler would take a pass over the code and send a device code to an OpenCL back-end, or now with SYCL 2020, we can send it to other kinds of back-ends such as a PTX back-end for CUDA or OpenMP back-end [or] even a Vulkan back-end. Each of these back-ends can selectively distribute [code] to any number of heterogeneous devices,” he said.

“The real beauty here and the idea with using a C++ based language with SYCL is that it will enable things like kernel fusion, which gives you better performance on complex applications and libraries than hand-coding. SYCL is basically ideal for accelerating large C++ based engines and applications for performance portability.”

Perhaps the most prominent new addition to the SYCL ecosystem is Intel’s oneAPI effort which is built on what Intel calls data parallel C++ or DPC++ and being presented by Intel as an open standard for programing a variety of processor types. It will, for example, be the preferred method for porting code to Intel’s Xe GPU line. (See HPCwire coverage, Intel Debuts oneAPI Gold and Provides More Details on GPU Roadmap)

Wong is a oneAPI fan and has blogged about oneAPI. He told HPCwire, “I’ve been dreaming of something like oneAPI for a long time, basically, something that allows you to program to any device kind, any device workloads, across many different companies. Having said that, if there’s too much of an Intel label attached to it to the point where people aren’t aware that [it’s] for anybody, that’s going to be a challenge.”

Intel is hardly alone. In fact, Wong argues the number of SYCL development efforts is one of the clearest measures of SYCL’s growing traction. Xilinx has an effort as does AMD (with the University of Heidelberg) and its natural to wonder if those efforts could be merged if/when AMD’s acquisition of Xilinx is completed. Wong doesn’t think so. There’s a neoSYCL that is quite new targeting NEC and Intel processors. Wong packed a chart showing SYCL implementations. Take a moment to look at SYCL’s growing family tree and then read Wong’s comments.

“The SYCL implementations in development are now ballooning. Actually, we just put one in just in the last couple of weeks. Traditionally, there has always been Codeplay’s ComputeCpp. That’s the company I work for, which generates codes for any number of CPUs. GPUs have gone through OpenCL and SPIR-V that can work for Intel, AMD, Arm, Mali, IMG PowerVR, and the Renesas R-Car [devices]. But we also have one that goes through PTX to generate code for Nvidia’s GPUs,” said Wong.

“Then the big player that came in was Intel with their oneAPI. Inside oneAPI is a compiler called data parallel C++ (DPC++). They are doing that so they can generate code for Intel CPUs, GPUs, FPGAs, and I think in future for AI processors. They are using a Clang (compiler) implementation [and] so is Coldplay.

“We will also have the triSYCL from Xilinx, which is specifically for Xilinx FPGAs, and the hipSYCL, which has the support for AMD GPUs and Nvidia GPUs and they do it through an OpenMP back end. So implementers were already using different back-ends and OpenCL. So it just makes sense for us to legitimize that in the specification (as is done in SYCL 2020). On the far right is something we just added in the last couple of weeks based on announcement from HPC Asia by called neoSYCL for NEC for the vector engine. So it [also] supports x86 Intel CPUs, and the NEC vector engines. We’re very excited about that. That will be open source soon as they have an implementation. We don’t put things on unless there’s a there’s a confirmed implementation,” he noted.

You get the picture. There is a lot of activity around SYCL at the moment. This is noticeably so at the Department of Energy and in advanced systems generally. Wong argues the need for portable performance and multiple vendor support are driving factors. He contends the science project development path is changing in HPC. Again, he’s packed a lot into a single slide (based on a 2020 SYCLCon keynote by Hal Finkel, the newly promoted computer science program manager for the DOE Office of Advanced Scientific Computing Research in October). Check it out before reading Wong’s description.

“As you are well aware OpenMP has had staying power in HPC for a long time. So why use SYCL here,” said Wong. “HPC workloads persist usually for 20 or more years. But the hardware can change every five years with new exascale or petascale projects from DOE and they often could go to different vendors. They also basically need to serve three pillars of science problems. One is simulation [which] needs a high-performance computing language with solvers and parallel runtimes. [Second] one is data science that needs a high productivity language for big data. The third pillar is learning, training and inference and that needs a high productivity language for machine learning and deep learning,” said Wong.

“These have been supported by the top languages. The idea is that there’s OpenMP that’s mostly for C and Fortran. There’s a mix of CUDA, OpenACC, OpenCL, pthreads (POSIX threads). And now for C++ there’s SYCL, and the National Labs’ frameworks like Kokkos and Raja. Now, the development workflow over time has changed. These days a science project usually starts with choosing an algorithm. Unlike before when once you chose the algorithm you were done, now they’re finding the choice of the algorithm needs feedback – knowledge of the system architecture and the tool chain. These tools need to have control of the data layout, data movement, data locality, data affinity, so they can be optimized for portable performance.

“SYCL and these other C++ frameworks enable these parameterizations and dynamically configure the algorithms through the C++ capabilities like C++ templates, and inlining. The second thing that they do, after they choose the algorithm for the target, is they implement and test the algorithm. The third thing is, of course, is optimizing the algorithm and traditionally this was the only step that needed feedback from the architecture and tool knowledge. Today, it’s no longer the case. The choice of the algorithm also now needs that feedback. Languages like SYCL, Kokkos, and RAJA do that especially well because their template static polymorphism allows them to change the algorithm depending on the type of the parameters,” he explained.

“All these steps enable you to reach high performance portable code, but it needs to be using an open standard that everybody can collaborate in. So they are basically required to reach exascale computing for these four major systems of which two are now using SYCL. Aurora and NERSC’s Perlmutter are both adapting to SYCL and there are other ones and it’s not just SYCL. They will also use OpenMP and CUDA and OpenCL and OpenACC and pthreads. The other two systems coming in 2021, of course, Frontier (ORNL) and El Capitan (LLNL) are both AMD systems and SYCL has demonstrated to work on AMD systems as well. The key is this – parameterization and dynamically composed algorithms, along with compiler optimizations using an open standard programming model, is what we think will enable performance portability. That’s why DOE labs are adapting to it. They know that they need this to reach performance portability,” Wong said.

Wong is realistic but hopeful, “No language is perfect. I’ve been a language designer for the better part of 20 years of my professional life, starting with C++ than open MP, and then at SYCL. Every language is trying to serve a community, balancing between performance, portability and productivity.”

SYCL will need to determine how it balances those goals. Wong said the growing similarity of workflows in science to those in industry, largely driven by AI, should help SYCL expand its footprint further. The European Processor Initiative and RISC-V represent opportunities along with the embedded market such as automotive. “I think SYCL can do more in the embedded space, as well as some of the FPGA space. And that depends on having more of those vendors being on board, and that’s coming.”

It will be interesting to watch SYCL’s growth. SYCL 2020 seems an important step forward technically and from a market position. Its release cycle going forward, said Wong, will closely mirror the C++ cycle with a major release every three years. He said work has already started on SYCL 2023 which will be based on the just released C++ 2020. The three-year lag, he said was a necessary element in making sure all the released code was robust. Moreover, he said safety issue are becoming more important, such as in automotive.

As if to hammer home SYCL’s growing strength, Khronos released an unusually large number of testimonials with SYCL 2020. They are included below. Stay tuned.

 

TESTIMONIALS PROVIDED BY KHRONOS

“Our users will benefit from features in the SYCL 2020 specification. New features, such as support for unified memory (USM) and reductions, are important capabilities for programming high-performance-computing hardware. In addition, support for C++17 will allow our users to write better C++ code, with both language features (such as deduction guides) and library features (such as std::optional). Other new features (such as softening the requirements on kernel functions and sharing data between host and devices) are an important step for implementing backend support for SYCL in the Kokkos and RAJA performance portability ecosystems.” said Nevin Liber, computer scientist, Argonne National Laboratory’s Leadership Computing Facility

“At Cineca, based on our experience, we confirm the value that SYCL is bringing to the development of high-performance computing in a hybrid environment. In fact, through SYCL, it is possible to build a common and portable environment for the development of computing-intensive applications to be executed on HPC architectures configured with floating point accelerators, which allows industries and scientific communities to use the common availability of development tools, libraries of algorithms, accumulated experience,” said Sanzio Bassini, director of supercomputing, Application Innovation Dept, Cineca. “Cineca is already running the distributed Celerity runtime on top of several SYCL implementations on the new Marconi100 cluster, ranked no. 11 in the Top500, providing users with a unified API for both about 4,000 NVIDIA Volta V100 GPUs and IBM Power9 host processors. SYCL 2020 is a big step towards a much leaner API that unlocks all the potential provided by modern C++ standards for accelerated data-parallel kernels, making the development of large-scale scientific software easier and more sustainable, either for industrial oriented domain applications for industries, either for scientific domain oriented applications.”

Codeplay has been deeply involved in SYCL from its original definition and we are now enabling the standard on a range of systems with our ComputeCpp product. We strongly believe SYCL is the only software standard to link all the high performance processors to a unified programming solution.” said Andrew Richards, founder and CEO, Codeplay Software “Developers will find that SYCL 2020 refines the standard to streamline their development and adds some crucial new enhancements to improve productivity.”

Imagination recognizes the benefit of SYCL across multiple markets. Our software stacks have been designed to improve SYCL performance, enabling a straightforward path to exploit the teraflops of compute performance in our latest IP,” said Mark Butler, Vice President of Software Engineering, Imagination Technologies. “The ability to quickly port workloads from other proprietary APIs is a huge benefit, easing the transition from development on desktop to deployment on embedded systems. SYCL 2020 is a positive step forward for this API, enabling higher levels of performance, which will benefit developers and platform creators.”

“SYCL 2020 final specification brings significant features to the industry that enable C++ developers to more productively build high-performance heterogeneous applications with unified programming across XPU architectures,” said Jeff McVeigh, Intel vice president, Datacenter XPU Products and Solutions. “Several capabilities pioneered in the open source oneAPI C++/DPC++ compiler, such as unified shared memory, group algorithms, and sub-groups, contributed to this community effort. Open, cross-architecture programming is required for accelerated distributed computing; we look forward to continuing our collaboration to address the needs of the developer ecosystem.”

“With thousands of users and a wide range of applications using NERSC’s resources, we must support a wide range of programming models. In addition to directive-based approaches, we see modern C++ language-based approaches to accelerator programming, such as SYCL, as an important component of our programming environment offering for users of Perlmutter,” said Brandon Cook, application performance specialist at NERSC. “Further, this work supports the productivity of scientific application developers and users through performance portability of applications between Aurora and Perlmutter.”

NSITEXE supports the SYCL 2020 technology, which is gaining attention in embedded applications,” said Hideki Sugimoto, CTO, NSITEXE, Inc. “SYCL is very important to increase productivity by hiding complexities from users. We are considering adopting this technology in our next generation of IP platforms.”

“For Renesas, SYCL is a key enabler for automotive ADAS/AD software developers that allows them to easily use the highly-efficient, heterogeneous accelerators of the R-Car SoC Series through the open Khronos standard” said Cyril Cordoba, Director of ADAS Segment Marketing Department, Renesas.

“We are excited about the extensive list of features and improvements released with the new SYCL 2020 specification,” said Thomas Fahringer, head of the Distributed and Parallel Systems Group at the University of Innsbruck. “The API becomes terser and more developer friendly, while also introducing new ways for expert users to exercise fine-grained control over state-of-the-art hardware features. The move to a generalized backend model opens up new possibilities to integrate with existing legacy solutions, which is especially important in scientific research environments. As co-developers of the Celerity project, together with the University of Salerno, we are welcoming these changes and look forward to applying them within distributed-memory research and industry applications, for example as part of the recently launched EuroHPC LIGATE project.”

Xilinx is excited about the progress achieved with SYCL 2020,” said Ralph Wittig, fellow, Xilinx. “This single-source C++ framework unifies host and device code for various kinds of accelerators in the same C++ program. With host-fallback device execution, developers can emulate device code on a CPU, exploring hardware-software co-design for adaptable computing devices. SYCL is now extensible via customizable back-ends, enabling device plug-ins for FPGAs and ACAPs.”

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

African Supercomputing Center Inaugurates ‘Toubkal,’ Most Powerful Supercomputer on the Continent

February 25, 2021

Historically, Africa hasn’t exactly been synonymous with supercomputing. There are only a handful of supercomputers on the continent, with few ranking on the global stage. Now, the Mohammed VI Polytechnic University (U Read more…

By Oliver Peckham

Supercomputer-Powered Machine Learning Supports Fusion Energy Reactor Design

February 25, 2021

Energy researchers have been reaching for the stars for decades in their attempt to artificially recreate a stable fusion energy reactor. If successful, such a reactor would revolutionize the world’s energy supply over Read more…

By Oliver Peckham

Japan to Debut Integrated Fujitsu HPC/AI Supercomputer This Spring

February 25, 2021

The integrated Fujitsu HPC/AI Supercomputer, Wisteria, is coming to Japan this spring. The University of Tokyo is preparing to deploy a heterogeneous computing system, called "Wisteria/BDEC-01," that will tackle simulati Read more…

By Tiffany Trader

President Biden Signs Executive Order to Review Chip, Other Supply Chains

February 24, 2021

U.S. President Biden signed an executive order late today calling for a 100-day review of key supply chains including semiconductors, large capacity batteries, pharmaceuticals, and rare-earth elements. The scarcity of ch Read more…

By John Russell

Xilinx Launches Alveo SN1000 SmartNIC

February 24, 2021

FPGA vendor Xilinx has debuted its latest SmartNIC model, the Alveo SN1000, with integrated “composability” features that allow enterprise users to add their own custom networking functions to supplement its built-in networking. By providing deep flexibility... Read more…

By Todd R. Weiss

AWS Solution Channel

Introducing AWS HPC Tech Shorts

Amazon Web Services (AWS) is excited to announce a new videos series focused on running HPC workloads on AWS. This new video series will cover HPC workloads from genomics, computational chemistry, to computational fluid dynamics (CFD) and more. Read more…

ASF Keynotes Showcase How HPC and Big Data Have Pervaded the Pandemic

February 24, 2021

Last Thursday, a range of experts joined the Advanced Scale Forum (ASF) in a rapid-fire roundtable to discuss how advanced technologies have transformed the way humanity responded to the COVID-19 pandemic in indelible ways. The roundtable, held near the one-year mark of the first... Read more…

By Oliver Peckham

Japan to Debut Integrated Fujitsu HPC/AI Supercomputer This Spring

February 25, 2021

The integrated Fujitsu HPC/AI Supercomputer, Wisteria, is coming to Japan this spring. The University of Tokyo is preparing to deploy a heterogeneous computing Read more…

By Tiffany Trader

Xilinx Launches Alveo SN1000 SmartNIC

February 24, 2021

FPGA vendor Xilinx has debuted its latest SmartNIC model, the Alveo SN1000, with integrated “composability” features that allow enterprise users to add their own custom networking functions to supplement its built-in networking. By providing deep flexibility... Read more…

By Todd R. Weiss

ASF Keynotes Showcase How HPC and Big Data Have Pervaded the Pandemic

February 24, 2021

Last Thursday, a range of experts joined the Advanced Scale Forum (ASF) in a rapid-fire roundtable to discuss how advanced technologies have transformed the way humanity responded to the COVID-19 pandemic in indelible ways. The roundtable, held near the one-year mark of the first... Read more…

By Oliver Peckham

IBM’s Prototype Low-Power 7nm AI Chip Offers ‘Precision Scaling’

February 23, 2021

IBM has released details of a prototype AI chip geared toward low-precision training and inference across different AI model types while retaining model quality within AI applications. In a paper delivered during this year’s International Solid-State Circuits Virtual Conference, IBM... Read more…

By George Leopold

IBM Continues Mainstreaming Power Systems and Integrating Red Hat in Pivot to Cloud

February 23, 2021

As IBM continues its massive pivot to the cloud, its Power-microprocessor-based products are being mainstreamed and realigned with the corporate-wide strategy. Read more…

By John Russell

Livermore’s El Capitan Supercomputer to Debut HPE ‘Rabbit’ Near Node Local Storage

February 18, 2021

A near node local storage innovation called Rabbit factored heavily into Lawrence Livermore National Laboratory’s decision to select Cray’s proposal for its CORAL-2 machine, the lab’s first exascale-class supercomputer, El Capitan. Details of this new storage technology were revealed... Read more…

By Tiffany Trader

ENIAC at 75: Celebrating the World’s First Supercomputer

February 15, 2021

With little fanfare, today’s computer revolution was arguably born and announced through a small, innocuous, two-column story at the bottom of the front page of The New York Times on Feb. 15, 1946. In that story and others, the previously classified project, ENIAC... Read more…

By Todd R. Weiss

Microsoft, HPE Bringing AI, Edge, Cloud to Earth Orbit in Preparation for Mars Missions

February 12, 2021

The International Space Station will soon get a delivery of powerful AI, edge and cloud computing tools from HPE and Microsoft Azure to expand technology experi Read more…

By Todd R. Weiss

Julia Update: Adoption Keeps Climbing; Is It a Python Challenger?

January 13, 2021

The rapid adoption of Julia, the open source, high level programing language with roots at MIT, shows no sign of slowing according to data from Julialang.org. I Read more…

By John Russell

Esperanto Unveils ML Chip with Nearly 1,100 RISC-V Cores

December 8, 2020

At the RISC-V Summit today, Art Swift, CEO of Esperanto Technologies, announced a new, RISC-V based chip aimed at machine learning and containing nearly 1,100 low-power cores based on the open-source RISC-V architecture. Esperanto Technologies, headquartered in... Read more…

By Oliver Peckham

Azure Scaled to Record 86,400 Cores for Molecular Dynamics

November 20, 2020

A new record for HPC scaling on the public cloud has been achieved on Microsoft Azure. Led by Dr. Jer-Ming Chia, the cloud provider partnered with the Beckman I Read more…

By Oliver Peckham

NICS Unleashes ‘Kraken’ Supercomputer

April 4, 2008

A Cray XT4 supercomputer, dubbed Kraken, is scheduled to come online in mid-summer at the National Institute for Computational Sciences (NICS). The soon-to-be petascale system, and the resulting NICS organization, are the result of an NSF Track II award of $65 million to the University of Tennessee and its partners to provide next-generation supercomputing for the nation's science community. Read more…

Programming the Soon-to-Be World’s Fastest Supercomputer, Frontier

January 5, 2021

What’s it like designing an app for the world’s fastest supercomputer, set to come online in the United States in 2021? The University of Delaware’s Sunita Chandrasekaran is leading an elite international team in just that task. Chandrasekaran, assistant professor of computer and information sciences, recently was named... Read more…

By Tracey Bryant

10nm, 7nm, 5nm…. Should the Chip Nanometer Metric Be Replaced?

June 1, 2020

The biggest cool factor in server chips is the nanometer. AMD beating Intel to a CPU built on a 7nm process node* – with 5nm and 3nm on the way – has been i Read more…

By Doug Black

Top500: Fugaku Keeps Crown, Nvidia’s Selene Climbs to #5

November 16, 2020

With the publication of the 56th Top500 list today from SC20's virtual proceedings, Japan's Fugaku supercomputer – now fully deployed – notches another win, Read more…

By Tiffany Trader

Gordon Bell Special Prize Goes to Massive SARS-CoV-2 Simulations

November 19, 2020

2020 has proven a harrowing year – but it has produced remarkable heroes. To that end, this year, the Association for Computing Machinery (ACM) introduced the Read more…

By Oliver Peckham

Leading Solution Providers

Contributors

Texas A&M Announces Flagship ‘Grace’ Supercomputer

November 9, 2020

Texas A&M University has announced its next flagship system: Grace. The new supercomputer, named for legendary programming pioneer Grace Hopper, is replacing the Ada system (itself named for mathematician Ada Lovelace) as the primary workhorse for Texas A&M’s High Performance Research Computing (HPRC). Read more…

By Oliver Peckham

At Oak Ridge, ‘End of Life’ Sometimes Isn’t

October 31, 2020

Sometimes, the old dog actually does go live on a farm. HPC systems are often cursed with short lifespans, as they are continually supplanted by the latest and Read more…

By Oliver Peckham

Saudi Aramco Unveils Dammam 7, Its New Top Ten Supercomputer

January 21, 2021

By revenue, oil and gas giant Saudi Aramco is one of the largest companies in the world, and it has historically employed commensurate amounts of supercomputing Read more…

By Oliver Peckham

Intel Xe-HP GPU Deployed for Aurora Exascale Development

November 17, 2020

At SC20, Intel announced that it is making its Xe-HP high performance discrete GPUs available to early access developers. Notably, the new chips have been deplo Read more…

By Tiffany Trader

Intel Teases Ice Lake-SP, Shows Competitive Benchmarking

November 17, 2020

At SC20 this week, Intel teased its forthcoming third-generation Xeon "Ice Lake-SP" server processor, claiming competitive benchmarking results against AMD's second-generation Epyc "Rome" processor. Ice Lake-SP, Intel's first server processor with 10nm technology... Read more…

By Tiffany Trader

New Deep Learning Algorithm Solves Rubik’s Cube

July 25, 2018

Solving (and attempting to solve) Rubik’s Cube has delighted millions of puzzle lovers since 1974 when the cube was invented by Hungarian sculptor and archite Read more…

By John Russell

It’s Fugaku vs. COVID-19: How the World’s Top Supercomputer Is Shaping Our New Normal

November 9, 2020

Fugaku is currently the most powerful publicly ranked supercomputer in the world – but we weren’t supposed to have it yet. The supercomputer, situated at Japan’s Riken scientific research institute, was scheduled to come online in 2021. When the pandemic struck... Read more…

By Oliver Peckham

MIT Makes a Big Breakthrough in Nonsilicon Transistors

December 10, 2020

What if Silicon Valley moved beyond silicon? In the 80’s, Seymour Cray was asking the same question, delivering at Supercomputing 1988 a talk titled “What’s All This About Gallium Arsenide?” The supercomputing legend intended to make gallium arsenide (GaA) the material of the future... Read more…

By Oliver Peckham

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire