The State of the Lustre Community

By Brent Gorda

August 1, 2011

A year ago the Lustre community was stunned by Oracle’s message at the 2010 Lustre User Group (LUG). Lustre was no longer a vendor neutral platform; you had to buy Sun/Oracle storage hardware to get future versions of the software. The community uproar was strong to the threat HPC’s most popular file system going away. As a result, a flurry of activity ensued with the formation of multiple community groups and startups, including the founding of the Lustre-focused startup Whamcloud.

The Lustre community quickly organized meetings in Europe at the Jülich Supercomputing Centre to self organize and react to the issue. Because the Lustre code is under GPL (GNU General Public License), the threat of a fork, or multiple forks was in the air and everyone wondered what would be the fate of the technology.

Out of the meetings, not-for-profit community groups were formed including:

  • EOFS — The European Open File System Cooperative (Europe-based)
  • OpenSFS –The Open Scalable File System (US-based)
  • HPCFS — The High Performance Cluster File System (US-based)

Everyone had the best interests of Lustre in mind and opinions on how to best care for this critical bit of technology that so many were dependent upon.

The two US-based groups initially differed in their approach, a concern for those not paying close attention to the players and their goals. OpenSFS raised a significant amount of funds and pledged to lead the community through continued investment by the DOE labs (LLNL, ORNL) and interested HPC vendors (DataDirect Networks, Cray), the original members of the organization. Continued support from these large players would, OpenSFS stated, ensure the longevity of the technology and its availability to all. The argument made good sense as that was the environment in which Lustre was born and matured.

The HPCFS approach was to organize the community and leverage the resources that exist at member sites. By having minimal fees, the membership would thrive and self-organize to do the work necessary to preserve and move the technology forward. This focus on grassroots strengths saw that the large outpouring of interest in helping Lustre would carry the weight of caring for the source code, making releases and providing support.

In Europe, the community debated the issues and formed EOFS after the well-planned and attended meetings in Jülich. The organization took time to ensure that all voices were heard and in December of 2010 the founders gathered in Munich to officially sign the documents necessary for a non-profit cooperative to be headquartered in any of the European Union participating countries.

For several months leading up to the 2011 Lustre User Group, the three groups communicated to Lustre sites, within their organizations and across organizations. The feeling was that there was a bit of an overreaction and, as time passed and people communicated, everyone began to realize that we are all on the same team with the same goal of preserving the Lustre technology for the community worldwide.

At this year’s 2011 Lustre User Group, the two US-based groups announced a merger that was completed a few weeks later. At the International Supercomputing (ISC) show in Germany, shortly thereafter, the two existing community groups, OpenSFS and EOFS, signed a memorandum of understanding to show their allegiance.

Naysayers had been predicting a fork in the code and a fracturing of the community. Alternate solution providers were putting the scare on — spreading fear and doubt as to the future of the technology. In the end, however, they have all been proven wrong. There has been no fork, there is one base tree from which everyone has agreed to make their releases. And the community has not fractured. Instead, it has pulled more closely together than ever.

It’s important to mention the major companies that have been responsible for developing and spreading Lustre. In the past few years, the technology has seen quite a bit of change in corporate stewardship. In the early days, it had been closely held in a small company Cluster File Systems. Sun pledged support for HPC and acquired CFS but their big push into the HPC market faded with the economy. Finally at Oracle, community concern was based on a perceived lack of interest in anything HPC.

I think that we in the community owe these companies credit for the excellent state that Lustre is in and the fact that it is still fully available as open source. Lustre is, today, a mature and stable technology due in large part to the significant investment made in the Lustre engineering teams over the past several years. As just one example, through the Hyperion consortium at LLNL, Sun and Oracle invested in equipment and, significantly, in the engineering time necessary to mature the technology. This should not be overlooked.

Today the HPC community finds itself in excellent position. Through the efforts of all the groups, the community at large and the steadfast confidence of the engineering teams, Lustre has emerged in fine form and stronger than ever. As we look forward to embarking on the long path to exascale, Lustre is an obvious technology choice for the journey. At the end of this estimated 8 to 10 year effort, we may not recognize the code –it may all be replaced by then — and the name may be different, but the platform we are starting with is second to none in the industry.

—–

About the author

Brent Gorda, Whamcloud CEO and President, joined Whamcloud from the US Department of Energy where he was involved in program funding and strategic adoption of the Lustre File System at the Lawrence Livermore National Laboratory and other ASCI labs.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

2017 Gordon Bell Prize Finalists Named

October 23, 2017

The three finalists for this year’s Gordon Bell Prize in High Performance Computing have been announced. They include two papers on projects run on China’s Sunway TaihuLight system and a third paper on 3D image recon Read more…

By John Russell

Data Vortex Users Contemplate the Future of Supercomputing

October 19, 2017

Last month (Sept. 11-12), HPC networking company Data Vortex held its inaugural users group at Pacific Northwest National Laboratory (PNNL) bringing together about 30 participants from industry, government and academia t Read more…

By Tiffany Trader

AI Self-Training Goes Forward at Google DeepMind

October 19, 2017

DeepMind, Google’s AI research organization, announced today in a blog that AlphaGo Zero, the latest evolution of AlphaGo (the first computer program to defeat a Go world champion) trained itself within three days to play Go at a superhuman level (i.e., better than any human) – and to beat the old version of AlphaGo – without leveraging human expertise, data or training. Read more…

By Doug Black

HPE Extreme Performance Solutions

Transforming Genomic Analytics with HPC-Accelerated Insights

Advancements in the field of genomics are revolutionizing our understanding of human biology, rapidly accelerating the discovery and treatment of genetic diseases, and dramatically improving human health. Read more…

Researchers Scale COSMO Climate Code to 4888 GPUs on Piz Daint

October 17, 2017

Effective global climate simulation, sorely needed to anticipate and cope with global warming, has long been computationally challenging. Two of the major obstacles are the needed resolution and prolonged time to compute Read more…

By John Russell

Data Vortex Users Contemplate the Future of Supercomputing

October 19, 2017

Last month (Sept. 11-12), HPC networking company Data Vortex held its inaugural users group at Pacific Northwest National Laboratory (PNNL) bringing together ab Read more…

By Tiffany Trader

AI Self-Training Goes Forward at Google DeepMind

October 19, 2017

DeepMind, Google’s AI research organization, announced today in a blog that AlphaGo Zero, the latest evolution of AlphaGo (the first computer program to defeat a Go world champion) trained itself within three days to play Go at a superhuman level (i.e., better than any human) – and to beat the old version of AlphaGo – without leveraging human expertise, data or training. Read more…

By Doug Black

Student Cluster Competition Coverage New Home

October 16, 2017

Hello computer sports fans! This is the first of many (many!) articles covering the world-wide phenomenon of Student Cluster Competitions. Finally, the Student Read more…

By Dan Olds

Intel Delivers 17-Qubit Quantum Chip to European Research Partner

October 10, 2017

On Tuesday, Intel delivered a 17-qubit superconducting test chip to research partner QuTech, the quantum research institute of Delft University of Technology (TU Delft) in the Netherlands. The announcement marks a major milestone in the 10-year, $50-million collaborative relationship with TU Delft and TNO, the Dutch Organization for Applied Research, to accelerate advancements in quantum computing. Read more…

By Tiffany Trader

Fujitsu Tapped to Build 37-Petaflops ABCI System for AIST

October 10, 2017

Fujitsu announced today it will build the long-planned AI Bridging Cloud Infrastructure (ABCI) which is set to become the fastest supercomputer system in Japan Read more…

By John Russell

HPC Chips – A Veritable Smorgasbord?

October 10, 2017

For the first time since AMD's ill-fated launch of Bulldozer the answer to the question, 'Which CPU will be in my next HPC system?' doesn't have to be 'Whichever variety of Intel Xeon E5 they are selling when we procure'. Read more…

By Dairsie Latimer

Delays, Smoke, Records & Markets – A Candid Conversation with Cray CEO Peter Ungaro

October 5, 2017

Earlier this month, Tom Tabor, publisher of HPCwire and I had a very personal conversation with Cray CEO Peter Ungaro. Cray has been on something of a Cinderell Read more…

By Tiffany Trader & Tom Tabor

Intel Debuts Programmable Acceleration Card

October 5, 2017

With a view toward supporting complex, data-intensive applications, such as AI inference, video streaming analytics, database acceleration and genomics, Intel i Read more…

By Doug Black

Reinders: “AVX-512 May Be a Hidden Gem” in Intel Xeon Scalable Processors

June 29, 2017

Imagine if we could use vector processing on something other than just floating point problems.  Today, GPUs and CPUs work tirelessly to accelerate algorithms Read more…

By James Reinders

NERSC Scales Scientific Deep Learning to 15 Petaflops

August 28, 2017

A collaborative effort between Intel, NERSC and Stanford has delivered the first 15-petaflops deep learning software running on HPC platforms and is, according Read more…

By Rob Farber

Oracle Layoffs Reportedly Hit SPARC and Solaris Hard

September 7, 2017

Oracle’s latest layoffs have many wondering if this is the end of the line for the SPARC processor and Solaris OS development. As reported by multiple sources Read more…

By John Russell

US Coalesces Plans for First Exascale Supercomputer: Aurora in 2021

September 27, 2017

At the Advanced Scientific Computing Advisory Committee (ASCAC) meeting, in Arlington, Va., yesterday (Sept. 26), it was revealed that the "Aurora" supercompute Read more…

By Tiffany Trader

How ‘Knights Mill’ Gets Its Deep Learning Flops

June 22, 2017

Intel, the subject of much speculation regarding the delayed, rewritten or potentially canceled “Aurora” contract (the Argonne Lab part of the CORAL “ Read more…

By Tiffany Trader

Google Releases Deeplearn.js to Further Democratize Machine Learning

August 17, 2017

Spreading the use of machine learning tools is one of the goals of Google’s PAIR (People + AI Research) initiative, which was introduced in early July. Last w Read more…

By John Russell

Nvidia Responds to Google TPU Benchmarking

April 10, 2017

Nvidia highlights strengths of its newest GPU silicon in response to Google's report on the performance and energy advantages of its custom tensor processor. Read more…

By Tiffany Trader

GlobalFoundries Puts Wind in AMD’s Sails with 12nm FinFET

September 24, 2017

From its annual tech conference last week (Sept. 20), where GlobalFoundries welcomed more than 600 semiconductor professionals (reaching the Santa Clara venue Read more…

By Tiffany Trader

Leading Solution Providers

Graphcore Readies Launch of 16nm Colossus-IPU Chip

July 20, 2017

A second $30 million funding round for U.K. AI chip developer Graphcore sets up the company to go to market with its “intelligent processing unit” (IPU) in Read more…

By Tiffany Trader

Amazon Debuts New AMD-based GPU Instances for Graphics Acceleration

September 12, 2017

Last week Amazon Web Services (AWS) streaming service, AppStream 2.0, introduced a new GPU instance called Graphics Design intended to accelerate graphics. The Read more…

By John Russell

EU Funds 20 Million Euro ARM+FPGA Exascale Project

September 7, 2017

At the Barcelona Supercomputer Centre on Wednesday (Sept. 6), 16 partners gathered to launch the EuroEXA project, which invests €20 million over three-and-a-half years into exascale-focused research and development. Led by the Horizon 2020 program, EuroEXA picks up the banner of a triad of partner projects — ExaNeSt, EcoScale and ExaNoDe — building on their work... Read more…

By Tiffany Trader

Delays, Smoke, Records & Markets – A Candid Conversation with Cray CEO Peter Ungaro

October 5, 2017

Earlier this month, Tom Tabor, publisher of HPCwire and I had a very personal conversation with Cray CEO Peter Ungaro. Cray has been on something of a Cinderell Read more…

By Tiffany Trader & Tom Tabor

Cray Moves to Acquire the Seagate ClusterStor Line

July 28, 2017

This week Cray announced that it is picking up Seagate's ClusterStor HPC storage array business for an undisclosed sum. "In short we're effectively transitioning the bulk of the ClusterStor product line to Cray," said CEO Peter Ungaro. Read more…

By Tiffany Trader

Intel Launches Software Tools to Ease FPGA Programming

September 5, 2017

Field Programmable Gate Arrays (FPGAs) have a reputation for being difficult to program, requiring expertise in specialty languages, like Verilog or VHDL. Easin Read more…

By Tiffany Trader

IBM Advances Web-based Quantum Programming

September 5, 2017

IBM Research is pairing its Jupyter-based Data Science Experience notebook environment with its cloud-based quantum computer, IBM Q, in hopes of encouraging a new class of entrepreneurial user to solve intractable problems that even exceed the capabilities of the best AI systems. Read more…

By Alex Woodie

HPC Chips – A Veritable Smorgasbord?

October 10, 2017

For the first time since AMD's ill-fated launch of Bulldozer the answer to the question, 'Which CPU will be in my next HPC system?' doesn't have to be 'Whichever variety of Intel Xeon E5 they are selling when we procure'. Read more…

By Dairsie Latimer

  • arrow
  • Click Here for More Headlines
  • arrow
Share This