Bull Makes Big Push Into HPC with New Supercomputer Blades

By Michael Feldman

June 16, 2009

French-owned computer maker Bull has unveiled a new family of HPC servers based on a novel blade architecture. Branded as “bullx,” for extreme computing, the blades are designed for speed, density, energy efficiency and ease of management. The new offering will take over the HPC mantle from Bull’s NovaScale servers, which will continue to be sold to enterprise customers for more standard computing workloads.

Bull has been building its HPC capabilities for the last few years, and in the past 18 months has acquired two companies, Serviware and science + computing ag (s+c), to add to its portfolio. Paris-based Serviware brought its integration expertise in deploying complex cluster systems, while Stuttgart-based s+c contributes its ability to help customers manage complex HPC infrastructure. Now, with bullx, the company has a purpose-built HPC architecture to distinguish itself from commodity cluster vendors.

Unlike many other blade-based architectures, which are designed to handle both enterprise and HPC workloads, Bull built the new servers specifically with high performance computing in mind. “This system was designed from the start to support HPC applications…with no compromise on performance,” said Fabio Gallo, vice president of the extreme computing business unit at Bull. The architecture is meant to scale from single-chassis systems all the way up to top-of-the-line supercomputers, where just a 100 bullx racks can deliver a petaflop of computing horsepower.

The bullx blades come in two flavors: CPU-only and GPU-accelerated. Both versions are based on dual-socket Nehalem EP (Xeon 5500) nodes, but the accelerator blades include up to two NVIDIA Tesla M1060 GPUs on board. CPU-only and CPU-GPU blades can be mixed within a system, but only the CPU blade is currently available. Bull plans to launch the GPU-equipped version in November.

The basic building block for a bullx system is a 7U 18-blade chassis. A CPU-only configuration delivers up to 1.7 teraflops and an entire 42U rack will yield 10 teraflops (108 nodes of dual-socket, quad-core). The interconnects are managed through the backplane, so there are no external cables save for the power supplies. An optional 36-port InfiniBand switch can be slotted into the chassis for cluster connectivity.

The Bull engineers maxed out on just about every system component for a dual-socket set-up, especially I/O. A single node incorporates two Intel Tylersburg chipsets, which each provide a PCIe x16 and PCIe x8 interface. This enables each node to drive up to two QDR InfiniBand on-board ConnectX chips as well as two GPU accelerators. The ability to support dual-on-board QDR and dual-on-board GPUs is probably the most distinguishing hardware feature of the bullx design, and makes the servers one of the most advanced being sold into the HPC market.

The memory subsystem is also high-end. Each socket supports three channels of DDR3 memory for a total of six, and up to 12 DIMMs can be loaded on each node, for a maximum memory capacity of 96 GB (using 8GB DIMMs).

A Bull-engineered ultra capacitor module (UCM) can also be included with each chassis to protect the system from 250ms power brown-outs. The UCM also eliminates the requirement for UPS on each individual node. By doing this, the module will save 15 percent in power costs, according to Bull. A water-cooled rack door can be installed to save even more power. Bull estimates water cooling will save about 75 percent of the cooling costs, which can be a third to a half of the total power consumption for a typical system. Racks can be air cooled as well, but for denser configurations (and especially when GPU accelerators are present) water cooling is going to be the way to go.

On the system software side, bullx comes with a Linux-based cluster suite, derived mainly from open source components (although Microsoft Windows HPC Server is also an option). The bullx cluster suite provides the usual job scheduling and resource management, libraries, Lustre file system support, and interconnect access. It also offers installation/configuration support as well as cluster diagnostics, monitoring and control. The cluster suite was designed for fast installation and updates, and provides seamless management of systems using a mixture of CPU-only and GPU-accelerated nodes.

Bull is positioning the new HPC blades at the high end of the HPC market, covering the top third of the departmental HPC segment, and all of the divisional and supercomputing segments. At this point, the company is confining most of its efforts to Western Europe, where it’s already made inroads selling HPC systems to companies like Airbus, Daussalt Aviation, Total, CEA, and the Jülich Supercomputing Center, among others. Bull estimates revenue in the segments it is going after at EUR1.2 billion in 2009, growing to EUR1.6 billion in 2013. The company’s goal is to grab 10 percent of that market. “We definitely think we’re on track in reaching that objective,” said Bruno Pinna, Bull’s group marketing director.

Bull is also targeting HPC opportunities in South America and Africa, and will be looking to expand its presence there and in other emerging markets. But for the time being, the company appears reluctant to expand into North America and tackle the likes of IBM, HP, and Dell on their home turf.

Two customers have already signed up for bullx machines: CEA, the French authority for nuclear energy, and the University of Cologne in Germany. Both are CPU-only deployments, although CEA recently completed installation of a 300-teraflop Bull supercomputer based on NovaScale servers and NVIDIA GPUs.

Pricing on the new gear is not publicly available, although Bull says the new bullx systems will generally cost from a few tens of thousands of Euros for departmental, single chassis configurations, to several tens of millions of Euros for petascale-class systems.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industry updates delivered to you every week!

High-Performance Storage for AI and Analytics Panel

October 31, 2024

When storage is mentioned in an AI or Big Data analytics context, it is assumed to be a high-performance system. In practice, it may not be, and the user eventually learns about scaleable storage as the amounts of data g Read more…

White House Mulls Expanding AI Chip Export Bans Beyond China

October 31, 2024

The Biden administration is reportedly considering capping sales of advanced artificial intelligence (AI) chips from US-based manufacturers like AMD and Nvidia to certain countries, including those in the Middle East. � Read more…

Lottery to Determine Major AI Conference Attendees Amid Registration Boom

October 31, 2024

A boom in AI has created a problem for the organizers of the NeurIPS conference, which is considered an essential machine-learning research conference. The sheer number of registrations has overwhelmed organizers, who Read more…

Role Reversal: Google Teases Nvidia’s Blackwell as It Softens TPU Rivalry

October 30, 2024

Customers now have access to Google's homegrown hardware -- its Axion CPU and latest Trillium TPU -- in its Cloud service.  At the same time, Google gave customers a teaser on Nvidia's Blackwell coming to Google Cloud, Read more…

AI Has a Data Problem, Appen Report Says

October 30, 2024

AI may be a priority at American companies, but the difficulty in managing data and obtaining high quality data to train AI models is becoming a bigger hurdle to achieving AI aspirations, according to Appen’s State of Read more…

Microsoft Azure & AMD Solution Channel

Join Microsoft Azure and AMD at SC24

Atlanta, Georgia is the place to be this fall as the high-performance computing (HPC) community convenes for Supercomputing 2024. SC24 will bring together an unparalleled mix of scientists, engineers, researchers, educators, programmers, and developers for a week of learning and sharing. Read more…

Report from HALO Details Issues Facing HPC-AI Industry

October 28, 2024

Intersect360 Research has released a comprehensive new report concerning the challenges facing the combined fields of high-performance computing (HPC) and artificial intelligence (AI). Titled “Issues Facing the HPC-AI Read more…

High-Performance Storage for AI and Analytics Panel

October 31, 2024

When storage is mentioned in an AI or Big Data analytics context, it is assumed to be a high-performance system. In practice, it may not be, and the user eventu Read more…

Shutterstock_556401859

Role Reversal: Google Teases Nvidia’s Blackwell as It Softens TPU Rivalry

October 30, 2024

Customers now have access to Google's homegrown hardware -- its Axion CPU and latest Trillium TPU -- in its Cloud service.  At the same time, Google gave custo Read more…

AI Has a Data Problem, Appen Report Says

October 30, 2024

AI may be a priority at American companies, but the difficulty in managing data and obtaining high quality data to train AI models is becoming a bigger hurdle t Read more…

Report from HALO Details Issues Facing HPC-AI Industry

October 28, 2024

Intersect360 Research has released a comprehensive new report concerning the challenges facing the combined fields of high-performance computing (HPC) and artif Read more…

Archetype AI’s Newton Model Masters Physics From Raw Data

October 28, 2024

Physicists have developed a deep understanding of the fundamental laws of nature through careful observations, experiments, and precise measurements. However, w Read more…

PNNL-Microsoft Collaborate on Cloud Computing for Chemistry, More to Come

October 25, 2024

RICHLAND, Wash.—Some computing challenges are so big that it’s necessary to go all in. That’s the approach a diverse team of scientists and computing expe Read more…

Xeon 6 vs. Zen-5 HPC Benchmark Showdown

October 24, 2024

In this GPU age, CPUs are often considered second citizens because most of the performance comes from the GPU. In most systems, GPUs are separate PCIe devices u Read more…

Nvidia’s Newest Foundation Model Can Actually Spell ‘Strawberry’

October 23, 2024

A new AI model from Nvidia knows just how many R’s are in the word strawberry, a feat that OpenAI’s GPT-4o model has yet to achieve. In what is known as the Read more…

Leading Solution Providers

Contributors

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire