Spider II Emerges to Give ORNL a Big Speed Boost

By Alex Woodie

August 16, 2013

When the Jaguar supercomputer at Oak Ridge National Laboratory morphed into Titan in 2012, it delivered a huge increase in computational power. Recently, the ORNL’s parallel file system, called Spider, received a similar overhaul, and is in the process of emerging as Spider II.

When it goes online this fall, the Lustre-based Spider II file system will deliver more than 1TB per second of high-end bandwidth across the ORNL’s InfiniBand-based network, up from the 240GB per second delivered by the original Spider. The total storage capacity of the file system increased from 10PB to 32PB, according to a story on the Oak Ridge Leadership Computing Facility website.

“At that speed we expect Spider II to be safely in league with the top three parallel file systems in the world,” Sarp Oral, the task lead for File and Storage Systems projects in the Technology Integration Group, within the National Center for Computational Sciences (NCCS), told the ORLCF.

Spider was unique in that it was the first center-wide shared resource that served all major OLCF platforms, including Jaguar (now Titan), the LENS visualization cluster, the Smoky development cluster, and the lab’s GridFTP servers. Data stored centrally in Spider was accessible to these and other systems–a total of 26,000-plus compute nodes in all.

The physical dimensions of Spider have increased with Spider II, which occupies 672 square feet across four rows of cabinets. Inside the cabinets are I/O servers and a high-end storage array that controls more than 20,000 disks.

The Spider II project also included an upgrade to Lustre 2.4, which should improve the lab’s scalability and metadata performance, and deliver other new features that will benefit the lab. 

For example, Lustre 2.4 expands the number of object storage targets for single shared files from 160 to 2,000. An enhancement to the distributed namespace system will support a greater number of users and improve overall metadata performance and scalability, the ORLCF says, while full recoveries of Titan will also be able to be performed in a matter of minutes–“a huge reduction from previous times.”

“Spider II allows our parallel file system to keep pace with the newly increased size and computational horsepower of Titan,” Bronson Messer of the NCCS Scientific Computing Group told the ORLCF. “The anticipated metadata improvement, in particular, should enable our users to produce and analyze the kind of large, complex datasets we anticipate being produced on Titan. Spider II should be both bigger, and better.” 

Spider II is the result of collaboration by many parties, including the OLCF staff, Data Direct Networks (DDN), Cray, Mellanox, and Dell.

Related Articles

Spider Up and Spinning Connections to All Computing Platforms at ORNL

Vampir Rises to the Occasion at ORNL

Raijin Debuts as Fastest Supercomputer in Australia

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

2022 HPC Road Trip: LBNL, NERSC, and ESnet Briefings

February 7, 2023

Time to finally(!) clear the 2022 decks and get the rest of the 2022 Great American Supercomputing Road Trip content out into the wild. The last part of the year was grueling with more than 5,000 miles of driving over Read more…

Decarbonization Initiative at NETL Gets Computing Boost

February 7, 2023

A major initiative by U.S. president Joe Biden called EarthShots to decarbonize the power grid by 2035 and the U.S. economy by 2050 is getting a major boost through a computing breakthrough at the National Energy Technol Read more…

Nvidia Touts Strong Results on Financial Services Inference Benchmark

February 3, 2023

The next-gen Hopper family may be on its way, but that isn’t stopping Nvidia’s popular A100 GPU from leading another benchmark on its way out. This time, it’s the STAC-ML inference benchmark, produced by the Securi Read more…

Quantum Computing Firm Rigetti Faces Delisting

February 3, 2023

Quantum computing companies are seeing their market caps crumble as investors patiently await out the winner-take-all approach to technology development. Quantum computing firms such as Rigetti Computing, IonQ and D-Wave went public through mergers with blank-check companies in the last two years, with valuations at the time of well over $1 billion. Now the market capitalization of these companies are less than half... Read more…

US and India Strengthen HPC, Quantum Ties Amid Tech Tension with China

February 2, 2023

Last May, the United States and India announced the “Initiative on Critical and Emerging Technology” (iCET), aimed at expanding the countries’ partnerships in strategic technologies and defense industries across th Read more…

AWS Solution Channel

Shutterstock 1072473599

Optimizing your AWS Batch architecture for scale with observability dashboards

AWS Batch is a fully managed service enabling you to run computational jobs at any scale without the need to manage compute resources. Customers often ask for guidance to optimize their architectures and make their workload to scale rapidly using the service. Read more…

 

Shutterstock 1453953692

Microsoft and NVIDIA Experts Talk AI Infrastructure

As AI emerges as a crucial tool in so many sectors, it’s clear that the need for optimized AI infrastructure is growing. Going beyond just GPU-based clusters, cloud infrastructure that provides low-latency, high-bandwidth interconnects and high-performance storage can help organizations handle AI workloads more efficiently and produce faster results. Read more…

Pittsburgh Supercomputing Enables Transparent Medicare Outcome AI

February 2, 2023

Medical applications of AI are replete with promise, but stymied by opacity: with lives on the line, concerns over AI models’ often-inscrutable reasoning – and as a result, possible biases embedded in those models Read more…

2022 HPC Road Trip: LBNL, NERSC, and ESnet Briefings

February 7, 2023

Time to finally(!) clear the 2022 decks and get the rest of the 2022 Great American Supercomputing Road Trip content out into the wild. The last part of the y Read more…

Decarbonization Initiative at NETL Gets Computing Boost

February 7, 2023

A major initiative by U.S. president Joe Biden called EarthShots to decarbonize the power grid by 2035 and the U.S. economy by 2050 is getting a major boost thr Read more…

Nvidia Touts Strong Results on Financial Services Inference Benchmark

February 3, 2023

The next-gen Hopper family may be on its way, but that isn’t stopping Nvidia’s popular A100 GPU from leading another benchmark on its way out. This time, it Read more…

Quantum Computing Firm Rigetti Faces Delisting

February 3, 2023

Quantum computing companies are seeing their market caps crumble as investors patiently await out the winner-take-all approach to technology development. Quantum computing firms such as Rigetti Computing, IonQ and D-Wave went public through mergers with blank-check companies in the last two years, with valuations at the time of well over $1 billion. Now the market capitalization of these companies are less than half... Read more…

US and India Strengthen HPC, Quantum Ties Amid Tech Tension with China

February 2, 2023

Last May, the United States and India announced the “Initiative on Critical and Emerging Technology” (iCET), aimed at expanding the countries’ partnership Read more…

Intel’s Gaudi3 AI Chip Survives Axe, Successor May Combine with GPUs

February 1, 2023

Intel's paring projects and products amid financial struggles, but AI products are taking on a major role as the company tweaks its chip roadmap to account for Read more…

Roadmap for Building a US National AI Research Resource Released

January 31, 2023

Last week the National AI Research Resource (NAIRR) Task Force released its final report and roadmap for building a national AI infrastructure to include comput Read more…

PFAS Regulations, 3M Exit to Impact Two-Phase Cooling in HPC

January 27, 2023

Per- and polyfluoroalkyl substances (PFAS), known as “forever chemicals,” pose a number of health risks to humans, with more suspected but not yet confirmed Read more…

Leading Solution Providers

Contributors

SC22 Booth Videos

AMD @ SC22
Altair @ SC22
AWS @ SC22
Ayar Labs @ SC22
CoolIT @ SC22
Cornelis Networks @ SC22
DDN @ SC22
Dell Technologies @ SC22
HPE @ SC22
Intel @ SC22
Intelligent Light @ SC22
Lancium @ SC22
Lenovo @ SC22
Microsoft and NVIDIA @ SC22
One Stop Systems @ SC22
Penguin Solutions @ SC22
QCT @ SC22
Supermicro @ SC22
Tuxera @ SC22
Tyan Computer @ SC22
  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire