Intel Carves Mainstream Highway for Lustre

By Nicole Hemsoth

June 12, 2013

It’s been just a tick under a year since Intel’s acquisition of Whamcloud and its Lustre braintrust. While it was a rather quiet transition, the chipmaker made some noteworthy strides toward pushing Lustre further into the mainstream–without ignoring its roots in HPC.

The company kicked two announcements about Lustre into the arena with high hopes of appealing to the enterprise crowd. Both news items are meant to strike a chord with the big business crowd–a market that’s increasingly interested in (you guessed it) “big data” and Hadoop, as well as general usability for ever-mounting scalability troubles.

Brent Gorda, founder and former CEO of Whamcloud, who now manages all things Lustre for Intel, said that the file system has a dangerous reputation as being hard to handle, despite its recognized benefits. This bad rap, however, spins from a misconception that’s fed from the stream of news out of multi-petabyte installations where Lustre really can require some serious expertise, according to Gorda.

Still, businesses are looking to plug in Lustre, at least according to Intel, which tends to pay attention to customer demand. While they want Lustre benefits, their hangup is that they don’t want to make new hires to run the file system. Hence the rollout today of Intel Enterprise Edition for Lustre, which comes with a new management tool called Intel Manager for Lustre.

As Gorda explains, it’s all about opening access to Lustre for the “rest of the world.” He says that currently, it’s the big national labs that are hiring people to run it since they need the scalability and reliability it offers. What Intel decided, based on customer demand, is that they want to offer approachable tools for Lustre to anyone who can manage a Linux box. The key here, argues Gorda, is not just the benefits of the management interface– it’s the Lustre support that’s now backed by a tier-1 vendor (versus little old Whamcloud peddling its support that made the big shops shy).

As with any software project, this mainstreaming of Lustre will be a process of evolution. But at this point, the management tool simplifies interacting with Lustre so users can wick away the need to deal with command lining, finding configuration files all over the place and are freed from watching logs scroll by. The management tool does all of this, thus Intel argues, opening access to Lustre without making companies add more bodies to deal with it–making it far more attractive to some key areas that are scaling at HPC levels (oil and gas, financial services…the usual suspects).

In addition to the management tool to facilitate wider adoption of Lustre in new environments, the company made a Hadoop announcement set to sing to those frustrated with the native Hadoop file system (HDFS), at least for those who have tapped Intel’s distro. Gorda described how one talented Lustre and HDFS expert steamrolled Hadoop with a new Java class to fully swap in Lustre over HDFS. The impetus here was that many of the traditional HPC sites were talking to Intel about their Hadoop plans. Of course, the needs of a file system for Hadoop at the Web 2.0 scale wouldn’t necessitate such a swap-out since it runs on local disk for most. For the HPC’ers, however, the disks were probably lifted from the boxes a decade ago, so the challenge is convincing Hadoop to run off a global parallel file system–a trick he says they’ve pulled off nicely.

Further, Gorda said that in a conversation with an oil and gas company about the pending HDFS news, the business did the math and found that Hadoop’s triple-replication was going to cost them quite a bit more than Lustre.

When it comes to Hadoop, “Lustre is performing faster than local disk because we’re feeding you data at network bandwidths, which are generally faster. When it comes time for the sort/merge stage, you already have a global scratch space, which means you don’t have to do all the communication steps that are necessary with local disk,” Gorda explained.

During our chat yesterday, Gorda seemed genuinely enthusiastic about how Intel’s might has led to increasing adoption and interest in Lustre. He said that prior to coming Intel’s wing, they would talk to potential big name customers in financial services and other areas who were in need of a robust file system for large node-counts but were put off by the fact that Whamcloud had only a small team. With Intel’s namepower and funding–both through the Fast Forward program and the company’s own ambition to deliver Lustre to new markets based on demand–Gorda says Lustre will continue to meet the eventual needs of exascale and the current demands from enterprise users who have smacked into the scaling wall with their current file systems.

Outside of today’s news items around Lustre, Gorda said there are some cool things brewing for the coming months, particularly powered by Fast Forward program funds.

For instance, he pointed to work being done on so-called burst buffers and new innovations fed by solid state. . As one can imagine, a file system that’s looking toward such technologies might be laden with some interesting features that could dramatically speed data and focus on latency as the data moves ever closer to the compute.

He also talked about how his team is adding an object storage model to Lustre. As he noted, “POSIX is getting long in the tooth; it was not designed to go this far due to locking and semantic issues. We’re going down the path of building out–specifically for HPC–an object-based interface that is different than today’s cloud object  models. The goal is to offer a way to communicate directly to the application about where its data is self-consistent. So if the application is humming along, clicking in at checkpoints, you’ll know, for example, that your fifth checkpoint is consistent when all our tasks check in and move on to the next iteration. Intel’s Lustre team is refining this development currently.

As for the future of Lustre in environments outside of the traditional purview of HPC, Gorda remains confident that the power of Intel will be enough to add some new sway. The new developments to add to usability help boost Lustre’s profile and once word hits the mainstream Hadoop-obsessed press with news of a super-scalable HDFS replacement, it could mean big things for the file system’s future.

Related Articles

Intel to Manufacture Its Own Destiny?

Intel and Hadoop Help Power Pecan Street

Intel Seeks Exascale Program Lead

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

South Africa CHPC: Home Grown Dynasty

October 22, 2018

Before the build up to the final event in the 2018 Student Cluster Competition season (the SC18 competition in Dallas), I want to take a moment to write about one of the great inspirational stories of these competitions. Read more…

By Dan Olds

NSF Launches Quantum Computing Faculty Fellows Program

October 22, 2018

Efforts to expand quantum computing research capacity continue to accelerate. The National Science Foundation today announced a Quantum Computing & Information Science Faculty Fellows (QCIS-FF) program aimed at devel Read more…

By John Russell

Democratization of HPC Part 3: Ninth Graders Tap HPC in the Cloud to Design Flying Boats

October 18, 2018

This is the third in a series of articles demonstrating the growing acceptance of high-performance computing (HPC) in new user communities and application areas. In this article we present UberCloud use case #208 on how Read more…

By Wolfgang Gentzsch and Håkon Bull Hove

HPE Extreme Performance Solutions

One Small Step Toward Mars: One Giant Leap for Supercomputing

Since the days of the Space Race between the U.S. and the former Soviet Union, we have continually sought ways to perform experiments in space. Read more…

IBM Accelerated Insights

Join IBM at SC18 and Learn to Harness the Next Generation of AI-focused Supercomputing

Blurring the lines between HPC and AI

Today’s high performance computers are helping clients gain insights at an unprecedented pace. The intersection of artificial intelligence (AI) and HPC can transform industries while solving some of the world’s toughest challenges. Read more…

Penguin Computing Launches Consultancy for Piecing AI Strategies Together

October 18, 2018

AI stands before the HPC industry as a beacon of great expectations, yet market research repeatedly shows that AI adoption is commonly stuck in the talking phase, on the near side of a difficult chasm to cross. In respon Read more…

By Tiffany Trader

South Africa CHPC: Home Grown Dynasty

October 22, 2018

Before the build up to the final event in the 2018 Student Cluster Competition season (the SC18 competition in Dallas), I want to take a moment to write about o Read more…

By Dan Olds

Penguin Computing Launches Consultancy for Piecing AI Strategies Together

October 18, 2018

AI stands before the HPC industry as a beacon of great expectations, yet market research repeatedly shows that AI adoption is commonly stuck in the talking phas Read more…

By Tiffany Trader

When Water Quality—Not Quantity—Hinders HPC Cooling

October 18, 2018

Attention has been paid to the sheer quantity of water consumed by supercomputers’ cooling towers – and rightly so, as they can require thousands of gallons per minute to cool. But in the background, another factor can emerge, bottlenecking efficiency and raising costs: water quality. Read more…

By Oliver Peckham

Paper Offers ‘Proof’ of Quantum Advantage on Some Problems

October 18, 2018

Is quantum computing worth all the effort being poured into it or should we just wait for classical computing to catch up? An IBM blog today posed those questio Read more…

By John Russell

Dell EMC to Supply U Michigan’s Great Lakes Cluster

October 16, 2018

The University of Michigan (U-M) today announced Dell EMC is the lead vendor for U-M’s $4.8 million Great Lakes HPC cluster scheduled for deployment in first Read more…

By John Russell

Houston to Field Massive, ‘Geophysically Configured’ Cloud Supercomputer

October 11, 2018

Based on some news stories out today, one might get the impression that the next system to crack number one on the Top500 would be an industrial oil and gas mon Read more…

By Tiffany Trader

Nvidia Platform Pushes GPUs into Machine Learning, High Performance Data Analytics

October 10, 2018

GPU leader Nvidia, generally associated with deep learning, autonomous vehicles and other higher-end enterprise and scientific workloads (and gaming, of course) Read more…

By Doug Black

Federal Investment in Exascale – What It Really Means

October 10, 2018

Earlier this month, the EuroHPC JU (Joint Undertaking) reached critical mass, and it seems all EU and affiliated member states, bar the UK (unsurprisingly), have or will sign on. The EuroHPC JU was born from a recognition that individual EU member states, and the EU as a whole, were significantly underinvesting in HPC compared to the US, China and Japan, who all have their own exascale investment and delivery strategies (NSCI, 13th 5 Year Plan, Post-K, etc). Read more…

By Dairsie Latimer

TACC Wins Next NSF-funded Major Supercomputer

July 30, 2018

The Texas Advanced Computing Center (TACC) has won the next NSF-funded big supercomputer beating out rivals including the National Center for Supercomputing Ap Read more…

By John Russell

IBM at Hot Chips: What’s Next for Power

August 23, 2018

With processor, memory and networking technologies all racing to fill in for an ailing Moore’s law, the era of the heterogeneous datacenter is well underway, Read more…

By Tiffany Trader

Requiem for a Phi: Knights Landing Discontinued

July 25, 2018

On Monday, Intel made public its end of life strategy for the Knights Landing "KNL" Phi product set. The announcement makes official what has already been wide Read more…

By Tiffany Trader

CERN Project Sees Orders-of-Magnitude Speedup with AI Approach

August 14, 2018

An award-winning effort at CERN has demonstrated potential to significantly change how the physics based modeling and simulation communities view machine learni Read more…

By Rob Farber

House Passes $1.275B National Quantum Initiative

September 17, 2018

Last Thursday the U.S. House of Representatives passed the National Quantum Initiative Act (NQIA) intended to accelerate quantum computing research and developm Read more…

By John Russell

Summit Supercomputer is Already Making its Mark on Science

September 20, 2018

Summit, now the fastest supercomputer in the world, is quickly making its mark in science – five of the six finalists just announced for the prestigious 2018 Read more…

By John Russell

New Deep Learning Algorithm Solves Rubik’s Cube

July 25, 2018

Solving (and attempting to solve) Rubik’s Cube has delighted millions of puzzle lovers since 1974 when the cube was invented by Hungarian sculptor and archite Read more…

By John Russell

D-Wave Breaks New Ground in Quantum Simulation

July 16, 2018

Last Friday D-Wave scientists and colleagues published work in Science which they say represents the first fulfillment of Richard Feynman’s 1982 notion that Read more…

By John Russell

Leading Solution Providers

HPC on Wall Street 2018 Booth Video Tours Playlist

Arista

Dell EMC

IBM

Intel

RStor

VMWare

TACC’s ‘Frontera’ Supercomputer Expands Horizon for Extreme-Scale Science

August 29, 2018

The National Science Foundation and the Texas Advanced Computing Center announced today that a new system, called Frontera, will overtake Stampede 2 as the fast Read more…

By Tiffany Trader

HPE No. 1, IBM Surges, in ‘Bucking Bronco’ High Performance Server Market

September 27, 2018

Riding healthy U.S. and global economies, strong demand for AI-capable hardware and other tailwind trends, the high performance computing server market jumped 28 percent in the second quarter 2018 to $3.7 billion, up from $2.9 billion for the same period last year, according to industry analyst firm Hyperion Research. Read more…

By Doug Black

Intel Announces Cooper Lake, Advances AI Strategy

August 9, 2018

Intel's chief datacenter exec Navin Shenoy kicked off the company's Data-Centric Innovation Summit Wednesday, the day-long program devoted to Intel's datacenter Read more…

By Tiffany Trader

GPUs Power Five of World’s Top Seven Supercomputers

June 25, 2018

The top 10 echelon of the newly minted Top500 list boasts three powerful new systems with one common engine: the Nvidia Volta V100 general-purpose graphics proc Read more…

By Tiffany Trader

Germany Celebrates Launch of Two Fastest Supercomputers

September 26, 2018

The new high-performance computer SuperMUC-NG at the Leibniz Supercomputing Center (LRZ) in Garching is the fastest computer in Germany and one of the fastest i Read more…

By Tiffany Trader

MLPerf – Will New Machine Learning Benchmark Help Propel AI Forward?

May 2, 2018

Let the AI benchmarking wars begin. Today, a diverse group from academia and industry – Google, Baidu, Intel, AMD, Harvard, and Stanford among them – releas Read more…

By John Russell

Aerodynamic Simulation Reveals Best Position in a Peloton of Cyclists

July 5, 2018

Eindhoven University of Technology (TU/e) and KU Leuven research group conducts the largest numerical simulation ever done in the sport industry and cycling discipline. The goal was to understand the aerodynamic interactions in the peloton, i.e., the main pack of cyclists in a race. Read more…

Houston to Field Massive, ‘Geophysically Configured’ Cloud Supercomputer

October 11, 2018

Based on some news stories out today, one might get the impression that the next system to crack number one on the Top500 would be an industrial oil and gas mon Read more…

By Tiffany Trader

  • arrow
  • Click Here for More Headlines
  • arrow
Do NOT follow this link or you will be banned from the site!
Share This