Momentum Builds for Intel’s HPC Scalable System Framework

By Tiffany Trader

July 13, 2015

At a press conference on Friday, Intel delivered a progress report on the state of its unfolding HPC portfolio. With key products like Omni-Path and Knights Landing set to launch later this year, Intel is focusing on the momentum that is building for its HPC Scalable System Framework. The company also announced that it is entering into a deeper alliance with system partner HP to support HPC at a full range of scales.

Intel sees the framework as a configurable design path that is customizable for a wide range of workloads. The framework combines compute (Xeon and Phi processors), fabric, silicon photonics, innovative memory technologies, and the Intel Lustre parallel file system, plus the ability to efficiently integrate these elements into a broad spectrum of system solutions, optimized for both compute- and data-intensive workloads.

Intel HPC Scalable System Framework ISC 2015 slide

“It’s an organizing principle for us and for our OEM partners,” said Charlie Wuischpard, vice president of Intel’s data center group and general manager. “Rather than just building a big one-off system it’s really to take those learnings and that technology to make sure it’s usable across the entire industry and that it can be dialed-in to cover both compute and data-centric workloads.”

“From a half rack, full rack, row, up to the largest supercomputer in the world, this technology should be usable and productive,” he added. “And it’s all based on standards-based programmability on the x86 ISA, and can handle different business models from on-premise to cloud.”

A bolstered alliance between Intel and HP illustrates how the Scalable System Framework is being used by the industry. Under the renewed partnership, covered in greater detail here, HP is building a set of industry and workload optimized solutions on top of the Intel framework. The agreement is essentially a merging of Intel and HP innovation with a strong focus on code modernization and rapid deployment that will be facilitated via a new Center of Excellence in Houston, Texas. Intel has a similar partnership in place with Cray, which in April announced that it is basing its future Shasta architecture on the Intel framework.

Although these are the only renewed partnerships that have been revealed so far, the system framework is an essential part of Intel’s strategy and the company sees opportunities for all its partners to provide differentiation on top of this framework with regard to their respective markets and customers.

Diving into some of the building blocks for this framework, Intel is putting the spotlight on its Omni-Path fabric, which is gearing up to hit its official general availability launch date later this year. Wuischpard said Intel has already won quite a few opportunities that it will be installing in Q4 of 2015, and it has been sampling with all the major HPC and OEM vendors. There is also a pipeline of opportunities that have been bid and priced, the VP noted. He added that Intel has been pleased with the health of the silicon and the progress that has been made from an engineering perspective. Some of the testing and performance has been “super encouraging,” according to Wuischpard, which is to say that while GA is still to come the firm is “essentially open for business and taking pre-orders.”

The other bit of information that Intel has let out about Omni-Path is that it will have single port adapter cards, plus a pair of edge switches for inside of racks or to be used as spines and a pair of director switches for aggregating across racks. The edge switches will come with 24 or 48 ports, and the director switches will come with 192 or 768 ports.

What most want to know is how Omni-Path will compare to InfiniBand, and particularly EDR InfiniBand. Wuischpard highlighted the high message rate, low latency and the benefit of strong resiliency that doesn’t introduce more latency. Other features include traffic flow optimization and dynamic lane scaling, and Intel’s commitment to “top to bottom product line coverage.” The chipmaker said it will be coming to market with single-port Host Fabric Adapter (PCIe) cards, 24- and 48-port edge switches to handle the low-to-mid range, and for very large systems, 192-port and 768-port (QSFP-based leaf switch) director switches.

Intel Omni-Path 100 Series ISC 2015 slide cropped

 

“Whether you are a small half-rack or storage focused solution up through the very largest supercomputers, you now have a range of options and a range of price points to work with. In the aggregate, this will be the best TCO and the best ROI solution on the market in this time frame,” Wuischpard stated.

When it comes to the new Phi Knights Landing chips, Wuischpard said the company has been focusing on ecosystem readiness and sampling. As has been previously mentioned in other reports, the SKUs will be up to 72 cores with 2 vector processing units per core. Intel expects to have more than 50 systems providers at general availability with first systems shipping by year’s end. The firm is also putting a lot of effort into the software development kits, something that Wuischpard acknowledged may not have had enough attention for the Knights Corner release. The aim is for the product to be easy to use out of the box, and intuitive. “[We are creating] the iPhone experience for Knights Landing, with scripts, sample codes and the like,” said Wuischpard.

Intel Lustre-based solutions were also covered in the briefing. The business that Intel acquired, formerly known as Whamcloud, was first resident in Intel’s software division. Two months ago, based on the concern that it wasn’t fully being leveraged, Lustre was moved over into the datacenter group. Intel says it is increasing its investment in Lustre, and its has a roadmap for the next two editions as well as a cloud edition with additional features being announced over the next six months. “More than anything we want to be able to make the product easy to use, more enterprise-ready and at the same time build the features that allow you to do extreme-scale I/O,” the VP stated.

Intel Lustre roadmap ISC 2015 slide

Another effort that Intel previewed is the extension of its Intel Cluster Ready program to develop a product built from open source components that will have free and for-fee versions. Intel says its primary goal is to enable a high quality solution at all scales by providing an appliance-like experience targeting smaller or commercial industry sites to support easier HPC stack building and testing.

Intel is also heavily invested in code modernization through its Intel Parallel Computing Centers. To date, Intel has 50 centers around the world including one recently opened in China. In line with this mission, Intel is creating an online community to reach approximately 400,000 developers and partners with tools, trainings and support by the end of the year. Wuischpard observed that when Intel pairs its “Black Belts in parallelism from inside Intel and industry” with subject matter experts to work on specific codes they achieve great results, so Intel is in the process of hiring more Black Belts and formalizing this process to scale coding skills broadly. The firm will also be doing Hackathon-like events with prizes to stimulate more interest.

An expanded HPC Developer Conference, where developers share proven techniques and best practices, is in the works for the US, Asia and Europe. A one-day session held the Sunday before Supercomputing last November drew more than 500 attendees, and Intel expects future events will draw many times this number. The company is also working to cultivate additional Intel code modernization partners. The program is aiming to train 10,000 developers and partners with remote access to Intel clusters and hardware.

For those attending ISC in Frankfurt, Germany, this week, Knights Landing and Omni-Path will be on display at Intel’s booth.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

Why HPC Storage Matters More Now Than Ever: Analyst Q&A

September 17, 2021

With soaring data volumes and insatiable computing driving nearly every facet of economic, social and scientific progress, data storage is seizing the spotlight. Hyperion Research analyst and noted storage expert Mark No Read more…

GigaIO Gets $14.7M in Series B Funding to Expand Its Composable Fabric Technology to Customers

September 16, 2021

Just before the COVID-19 pandemic began in March 2020, GigaIO introduced its Universal Composable Fabric technology, which allows enterprises to bring together any HPC and AI resources and integrate them with networking, Read more…

What’s New in HPC Research: Solar Power, ExaWorks, Optane & More

September 16, 2021

In this regular feature, HPCwire highlights newly published research in the high-performance computing community and related domains. From parallel programming to exascale to quantum computing, the details are here. Read more…

Cerebras Brings Its Wafer-Scale Engine AI System to the Cloud

September 16, 2021

Five months ago, when Cerebras Systems debuted its second-generation wafer-scale silicon system (CS-2), co-founder and CEO Andrew Feldman hinted of the company’s coming cloud plans, and now those plans have come to fruition. Today, Cerebras and Cirrascale Cloud Services are launching... Read more…

AI Hardware Summit: Panel on Memory Looks Forward

September 15, 2021

What will system memory look like in five years? Good question. While Monday's panel, Designing AI Super-Chips at the Speed of Memory, at the AI Hardware Summit, tackled several topics, the panelists also took a brief glimpse into the future. Unlike compute, storage and networking, which... Read more…

AWS Solution Channel

Supporting Climate Model Simulations to Accelerate Climate Science

The Amazon Sustainability Data Initiative (ASDI), AWS is donating cloud resources, technical support, and access to scalable infrastructure and fast networking providing high performance computing (HPC) solutions to support simulations of near-term climate using the National Center for Atmospheric Research (NCAR) Community Earth System Model Version 2 (CESM2) and its Whole Atmosphere Community Climate Model (WACCM). Read more…

ECMWF Opens Bologna Datacenter in Preparation for Atos Supercomputer

September 14, 2021

In January 2020, the European Centre for Medium-Range Weather Forecasts (ECMWF) – a juggernaut in the weather forecasting scene – signed a four-year, $89-million contract with European tech firm Atos to quintuple its supercomputing capacity. With the deal approaching the two-year mark, ECMWF... Read more…

Why HPC Storage Matters More Now Than Ever: Analyst Q&A

September 17, 2021

With soaring data volumes and insatiable computing driving nearly every facet of economic, social and scientific progress, data storage is seizing the spotlight Read more…

Cerebras Brings Its Wafer-Scale Engine AI System to the Cloud

September 16, 2021

Five months ago, when Cerebras Systems debuted its second-generation wafer-scale silicon system (CS-2), co-founder and CEO Andrew Feldman hinted of the company’s coming cloud plans, and now those plans have come to fruition. Today, Cerebras and Cirrascale Cloud Services are launching... Read more…

AI Hardware Summit: Panel on Memory Looks Forward

September 15, 2021

What will system memory look like in five years? Good question. While Monday's panel, Designing AI Super-Chips at the Speed of Memory, at the AI Hardware Summit, tackled several topics, the panelists also took a brief glimpse into the future. Unlike compute, storage and networking, which... Read more…

ECMWF Opens Bologna Datacenter in Preparation for Atos Supercomputer

September 14, 2021

In January 2020, the European Centre for Medium-Range Weather Forecasts (ECMWF) – a juggernaut in the weather forecasting scene – signed a four-year, $89-million contract with European tech firm Atos to quintuple its supercomputing capacity. With the deal approaching the two-year mark, ECMWF... Read more…

Quantum Computer Market Headed to $830M in 2024

September 13, 2021

What is one to make of the quantum computing market? Energized (lots of funding) but still chaotic and advancing in unpredictable ways (e.g. competing qubit tec Read more…

Amazon, NCAR, SilverLining Team for Unprecedented Cloud Climate Simulations

September 10, 2021

Earth’s climate is, to put it mildly, not in a good place. In the wake of a damning report from the Intergovernmental Panel on Climate Change (IPCC), scientis Read more…

After Roadblocks and Renewals, EuroHPC Targets a Bigger, Quantum Future

September 9, 2021

The EuroHPC Joint Undertaking (JU) was formalized in 2018, beginning a new era of European supercomputing that began to bear fruit this year with the launch of several of the first EuroHPC systems. The undertaking, however, has not been without its speed bumps, and the Union faces an uphill... Read more…

How Argonne Is Preparing for Exascale in 2022

September 8, 2021

Additional details came to light on Argonne National Laboratory’s preparation for the 2022 Aurora exascale-class supercomputer, during the HPC User Forum, held virtually this week on account of pandemic. Exascale Computing Project director Doug Kothe reviewed some of the 'early exascale hardware' at Argonne, Oak Ridge and NERSC (Perlmutter), while Ti Leggett, Deputy Project Director & Deputy Director... Read more…

Ahead of ‘Dojo,’ Tesla Reveals Its Massive Precursor Supercomputer

June 22, 2021

In spring 2019, Tesla made cryptic reference to a project called Dojo, a “super-powerful training computer” for video data processing. Then, in summer 2020, Tesla CEO Elon Musk tweeted: “Tesla is developing a [neural network] training computer called Dojo to process truly vast amounts of video data. It’s a beast! … A truly useful exaflop at de facto FP32.” Read more…

Berkeley Lab Debuts Perlmutter, World’s Fastest AI Supercomputer

May 27, 2021

A ribbon-cutting ceremony held virtually at Berkeley Lab's National Energy Research Scientific Computing Center (NERSC) today marked the official launch of Perlmutter – aka NERSC-9 – the GPU-accelerated supercomputer built by HPE in partnership with Nvidia and AMD. Read more…

Esperanto, Silicon in Hand, Champions the Efficiency of Its 1,092-Core RISC-V Chip

August 27, 2021

Esperanto Technologies made waves last December when it announced ET-SoC-1, a new RISC-V-based chip aimed at machine learning that packed nearly 1,100 cores onto a package small enough to fit six times over on a single PCIe card. Now, Esperanto is back, silicon in-hand and taking aim... Read more…

Enter Dojo: Tesla Reveals Design for Modular Supercomputer & D1 Chip

August 20, 2021

Two months ago, Tesla revealed a massive GPU cluster that it said was “roughly the number five supercomputer in the world,” and which was just a precursor to Tesla’s real supercomputing moonshot: the long-rumored, little-detailed Dojo system. “We’ve been scaling our neural network training compute dramatically over the last few years,” said Milan Kovac, Tesla’s director of autopilot engineering. Read more…

CentOS Replacement Rocky Linux Is Now in GA and Under Independent Control

June 21, 2021

The Rocky Enterprise Software Foundation (RESF) is announcing the general availability of Rocky Linux, release 8.4, designed as a drop-in replacement for the soon-to-be discontinued CentOS. The GA release is launching six-and-a-half months after Red Hat deprecated its support for the widely popular, free CentOS server operating system. The Rocky Linux development effort... Read more…

Google Launches TPU v4 AI Chips

May 20, 2021

Google CEO Sundar Pichai spoke for only one minute and 42 seconds about the company’s latest TPU v4 Tensor Processing Units during his keynote at the Google I Read more…

Intel Completes LLVM Adoption; Will End Updates to Classic C/C++ Compilers in Future

August 10, 2021

Intel reported in a blog this week that its adoption of the open source LLVM architecture for Intel’s C/C++ compiler is complete. The transition is part of In Read more…

AMD-Xilinx Deal Gains UK, EU Approvals — China’s Decision Still Pending

July 1, 2021

AMD’s planned acquisition of FPGA maker Xilinx is now in the hands of Chinese regulators after needed antitrust approvals for the $35 billion deal were receiv Read more…

Leading Solution Providers

Contributors

Hot Chips: Here Come the DPUs and IPUs from Arm, Nvidia and Intel

August 25, 2021

The emergence of data processing units (DPU) and infrastructure processing units (IPU) as potentially important pieces in cloud and datacenter architectures was Read more…

Julia Update: Adoption Keeps Climbing; Is It a Python Challenger?

January 13, 2021

The rapid adoption of Julia, the open source, high level programing language with roots at MIT, shows no sign of slowing according to data from Julialang.org. I Read more…

10nm, 7nm, 5nm…. Should the Chip Nanometer Metric Be Replaced?

June 1, 2020

The biggest cool factor in server chips is the nanometer. AMD beating Intel to a CPU built on a 7nm process node* – with 5nm and 3nm on the way – has been i Read more…

HPE Wins $2B GreenLake HPC-as-a-Service Deal with NSA

September 1, 2021

In the heated, oft-contentious, government IT space, HPE has won a massive $2 billion contract to provide HPC and AI services to the United States’ National Security Agency (NSA). Following on the heels of the now-canceled $10 billion JEDI contract (reissued as JWCC) and a $10 billion... Read more…

Quantum Roundup: IBM, Rigetti, Phasecraft, Oxford QC, China, and More

July 13, 2021

IBM yesterday announced a proof for a quantum ML algorithm. A week ago, it unveiled a new topology for its quantum processors. Last Friday, the Technical Univer Read more…

Intel Launches 10nm ‘Ice Lake’ Datacenter CPU with Up to 40 Cores

April 6, 2021

The wait is over. Today Intel officially launched its 10nm datacenter CPU, the third-generation Intel Xeon Scalable processor, codenamed Ice Lake. With up to 40 Read more…

Frontier to Meet 20MW Exascale Power Target Set by DARPA in 2008

July 14, 2021

After more than a decade of planning, the United States’ first exascale computer, Frontier, is set to arrive at Oak Ridge National Laboratory (ORNL) later this year. Crossing this “1,000x” horizon required overcoming four major challenges: power demand, reliability, extreme parallelism and data movement. Read more…

Intel Unveils New Node Names; Sapphire Rapids Is Now an ‘Intel 7’ CPU

July 27, 2021

What's a preeminent chip company to do when its process node technology lags the competition by (roughly) one generation, but outmoded naming conventions make it seem like it's two nodes behind? For Intel, the response was to change how it refers to its nodes with the aim of better reflecting its positioning within the leadership semiconductor manufacturing space. Intel revealed its new node nomenclature, and... Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire