Keeping Big Data Cool at SDSC

June 29, 2016

June 29 — When most people think of a supercomputer center, they may think of one massive computer performing a single task. Inside the data center at the San Diego Supercomputer Center (SDSC) at the University of California San Diego, however, there are several large supercomputer systems, each performing multiple tasks simultaneously across a wide range of science domains that include genome sequencing to help pave the way to personalized medical treatment, coming up with new drug designs for conditions such as Parkinson’s and Alzheimer’s disease, or creating detailed fluid dynamics simulations for hypersonic aircraft.

Keeping SDSC’s main data center cool enough so that its Comet and Gordon supercomputers, among smaller clusters, don’t overheat is a complex yet mission-critical task, according to Todor Milkov, SDSC’s senior project engineer. A computing architecture such as the one found in Comet, SDSC’s newest supercomputer, requires one megawatt of power to operate the system. Using that much electricity generates a tremendous amount of heat, so SDSC, with the help of outside experts, developed three cooling system prototypes and conducted research to determine the most efficient system.

Each prototype system was designed using vendor-specific technology controlling five air handlers as a baseline to evaluate system performance. One of the prototypes used wireless temperature sensors that read the temperature of the hot and cold aisles every three minutes to increase battery life.

SDSC Datacenter AisleMany data centers use a standard hot aisle/cold aisle design. This design involves lining up server racks in alternating rows, with cold air intakes facing one way and hot air exhausts facing the other. The rows composed of rack fronts are called cold aisles. Typically, cold aisles face air conditioner output ducts. The rows that the heated exhausts pour into are called hot aisles. Typically, hot aisles face air conditioner return ducts.

Containment systems can help isolate hot aisles and cold aisles from each other and prevent hot and cold air from mixing. Such systems started out as using physical barriers that simply separated the hot and cold aisles with vinyl plastic sheeting or Plexiglas covers. Modern containment systems offer plenums and other commercial options that combine containment with variable fan drives (VFDs) to prevent cold air and hot air from mixing.

At SDSC, however, the entire area under the raised floor is used for the supply plenum, and the entire area above the ceiling is for the return plenum. Cold aisles use perforated floor tiles with specifically designed hole sizes to control the air flow volume from the space below the floor, while the hot aisles use ceiling grates that allow heated air to enter the space above the ceiling.

Controlling the air flow from all air handlers discharging into one common plenum presents a difficult problem, especially since these spaces also contain obstructions such as pipes and conduits. Moreover, not all of the compute clusters run at full capacity at any given time, and systems loads also change regularly as research projects start up or stop. These constantly changing factors cause the amount of heat dissipated from the supercomputer systems to fluctuate from minute to minute. The data center cooling system has to quickly adjust to accommodate these fluctuations in temperature.

“We learned a lot during the prototype and research phase of the cooling system design,” said Milkov. “We started by collecting a lot of data on how air flowed through the data center. We found that three minutes between temperature readings was too long an interval to keep the data center within the desired temperature ranges. Because of the longer interval, we used more electricity bringing the data center back to its temperature set points than we needed if we took temperature readings over shorter intervals and could make changes to the cooling system sooner.”

Realizing that a different approach was needed, Milkov put together a vendor evaluation process for an updated data center management system with the objective of reducing energy use while increasing the level of control capability available to the SDSC operations staff.

After extensive research, Milkov selected three companies for prototype installations. At the conclusion of a detailed evaluation, systems integration company Earth Base One (EBO) Corporation and a SNAP PAC-based control system were chosen for providing extensive control capabilities and energy savings.

Milkov and Michael Hyde, EBO’s president, approached the project with the same vision. “Rather than adapting an off-the-shelf data center management system to SDSC, we designed a tailor-built system for SDSC’s unique challenges,” said Hyde.

Opto 22, which develops and manufactures hardware and software products for applications in industrial automation, remote monitoring, and data acquisition, was chosen as the primary controls manufacturer. “The Opto 22 hardware and software not only won the competition for control and energy savings, but was also the least expensive vendor solution,” said Hyde. “The software’s excellent historical data collection and trending abilities allowed SDSC engineers to continue improving the system based on real data.”

“We appreciated the outstanding technical support SDSC received from Opto 22 during our design and prototype phase,” said Milkov. “When you’re trying to protect millions of dollars’ worth of research, you need a control system you can rely on.”

The full case study is available here.

About SDSC

As an Organized Research Unit of UC San Diego, SDSC is considered a leader in data-intensive computing and cyberinfrastructure, providing resources, services, and expertise to the national research community, including industry and academia. Cyberinfrastructure refers to an accessible, integrated network of computer-based resources and expertise, focused on accelerating scientific inquiry and discovery. SDSC supports hundreds of multidisciplinary programs spanning a wide variety of domains, from earth sciences and biology to astrophysics, bioinformatics, and health IT. SDSC’s Comet joins the Center’s data-intensive Gordon cluster, and are both part of the National Science Foundation’s XSEDE (eXtreme Science and Engineering Discovery Environment) program, the most advanced collection of integrated digital resources and services in the world.

About Opto 22

Opto 22 develops and manufactures hardware and software products for applications in industrial automation, remote monitoring, and data acquisition. Using standard, commercially available Internet, networking, and computer technologies, Opto 22’s input/output and control systems allow customers to monitor, control, and acquire data from all of the mechanical, electrical, and electronic assets that are key to their business operations. More information is at www.opto22.com.


Source: SDSC

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

NREL ‘Eagle’ Supercomputer to Advance Energy Tech R&D

August 14, 2018

The U.S. Department of Energy (DOE) National Renewable Energy Laboratory (NREL) has contracted with Hewlett Packard Enterprise (HPE) for a new 8-petaflops (peak) supercomputer that will be used to advance early-stage R&a Read more…

By Tiffany Trader

Training Time Slashed for Deep Learning

August 14, 2018

Fast.ai, an organization offering free courses on deep learning, claimed a new speed record for training a popular image database using Nvidia GPUs running on public cloud infrastructure. A pair of researchers trained Read more…

By George Leopold

CERN Project Sees Orders-of-Magnitude Speedup with AI Approach

August 14, 2018

An award-winning effort at CERN has demonstrated potential to significantly change how the physics based modeling and simulation communities view machine learning. The CERN team demonstrated that AI-based models have the Read more…

By Rob Farber

HPE Extreme Performance Solutions

Introducing the First Integrated System Management Software for HPC Clusters from HPE

How do you manage your complex, growing cluster environments? Answer that big challenge with the new HPC cluster management solution: HPE Performance Cluster Manager. Read more…

IBM Accelerated Insights

Super Problem Solving

You might think that tackling the world’s toughest problems is a job only for superheroes, but at special places such as the Oak Ridge National Laboratory, supercomputers are the real heroes. Read more…

Rigetti Eyes Scaling with 128-Qubit Architecture

August 10, 2018

Rigetti Computing plans to build a 128-qubit quantum computer based on an equivalent quantum processor that leverages emerging hybrid computing algorithms used to test programs and potential applications. Founded in 2 Read more…

By George Leopold

NREL ‘Eagle’ Supercomputer to Advance Energy Tech R&D

August 14, 2018

The U.S. Department of Energy (DOE) National Renewable Energy Laboratory (NREL) has contracted with Hewlett Packard Enterprise (HPE) for a new 8-petaflops (peak Read more…

By Tiffany Trader

CERN Project Sees Orders-of-Magnitude Speedup with AI Approach

August 14, 2018

An award-winning effort at CERN has demonstrated potential to significantly change how the physics based modeling and simulation communities view machine learni Read more…

By Rob Farber

Intel Announces Cooper Lake, Advances AI Strategy

August 9, 2018

Intel's chief datacenter exec Navin Shenoy kicked off the company's Data-Centric Innovation Summit Wednesday, the day-long program devoted to Intel's datacenter Read more…

By Tiffany Trader

SLATE Update: Making Math Libraries Exascale-ready

August 9, 2018

Practically-speaking, achieving exascale computing requires enabling HPC software to effectively use accelerators – mostly GPUs at present – and that remain Read more…

By John Russell

Summertime in Washington: Some Unexpected Advanced Computing News

August 8, 2018

Summertime in Washington DC is known for its heat and humidity. That is why most people get away to either the mountains or the seashore and things slow down. H Read more…

By Alex R. Larzelere

NSF Invests $15 Million in Quantum STAQ

August 7, 2018

Quantum computing development is in full ascent as global backers aim to transcend the limitations of classical computing by leveraging the magical-seeming prop Read more…

By Tiffany Trader

By the Numbers: Cray Would Like Exascale to Be the Icing on the Cake

August 1, 2018

On its earnings call held for investors yesterday, Cray gave an accounting for its latest quarterly financials, offered future guidance and provided an update o Read more…

By Tiffany Trader

Google is First Partner in NIH’s STRIDES Effort to Speed Discovery in the Cloud

July 31, 2018

The National Institutes of Health, with the help of Google, last week launched STRIDES - Science and Technology Research Infrastructure for Discovery, Experimen Read more…

By John Russell

Leading Solution Providers

SC17 Booth Video Tours Playlist

Altair @ SC17

Altair

AMD @ SC17

AMD

ASRock Rack @ SC17

ASRock Rack

CEJN @ SC17

CEJN

DDN Storage @ SC17

DDN Storage

Huawei @ SC17

Huawei

IBM @ SC17

IBM

IBM Power Systems @ SC17

IBM Power Systems

Intel @ SC17

Intel

Lenovo @ SC17

Lenovo

Mellanox Technologies @ SC17

Mellanox Technologies

Microsoft @ SC17

Microsoft

Penguin Computing @ SC17

Penguin Computing

Pure Storage @ SC17

Pure Storage

Supericro @ SC17

Supericro

Tyan @ SC17

Tyan

Univa @ SC17

Univa

  • arrow
  • Click Here for More Headlines
  • arrow
Do NOT follow this link or you will be banned from the site!
Share This