SARA Opens Gate for HPC Cloud Researchers

By Nicole Hemsoth

May 10, 2010

Researchers in the Netherlands are being granted the opportunity to take part in a grand HPC experiment over the coming year as the limits of BiG Grid are pushed into the cloud. If the full test is a success, this could mean that there will be a significant number of similar efforts in coming years from other national and international grid and research organizations.

As one of the national grid projects in Europe, BiG Grid in the Netherlands hosts “four core centers providing large scale data storage and compute facilities with over twelve distributed seed clusters for Life Sciences, and supporting more than 35 research communities.”  Founded in 1970, one of the critical core centers, SARA,  is a national high-performance computing and e-Science support center as well as a supernode in the International Science Grid. SARA currently supports researchers with state-of-the-art integrated services, facilities and infrastructure as well as advanced networking, storage, visualization and broader e-Science services.

In 2009, SARA conducted a small-scale pilot experiment with five groups of scientific users to explore the possibilities of cloud computing in an HPC environment. The experiment proved to be a success and BiG Grid decided that it was time to usher in a new phase for the Dutch scientific community and began to offer a still-experimental but much larger-scale offering of HPC cloud as a service — the catch, of course, is that this is only open to members of the Dutch scientific community.

The new HPC cloud environment will provide researchers with a chance to operate within their very own virtual private HPC cluster that can host full individual configuration that will operate according to each scientific team’s needs. The most attractive part of the offer for those who are selected is, of course, on-demand scalability. Participants will be able to start from images or create their own cluster from the bottom-up with the added bonus that users can create a copy of their current software environment (from small or personal machines) and weave it into an HPC cluster operating within the cloud without any expensive rewriting or dramatic changes between their development and production environments.

It won’t just be SARA evaluating the success of the HPC cloud service, of course, the large bevy of researchers from the Netherlands lining up to take part in the experiment will have their eyes peeled for challenges presented by the shift to the cloud including possible performance strains, for example. Following the experimental stage of migration and actually application use in the cloud it will no doubt be fascinating to read what issues emerged for both the infrastructure providers and the scientific community.

While the group states that this experimental phase is open to all Dutch scientific researchers, there are some special members of the community that will be offered prime consideration. As the announcement reveals, the SARA team is particularly interested “in applications which are difficult or nearly impossible to run o the existing HPC platforms (Huygens, Lisa, Grid) but do run in one’s local environment.”

If SARA’s experiment with a larger test of more diverse and numerous users is a success, it seems that this might prove to be a valuable proof of concept for members of other scientific HPC communities around the world. In fact, even if it’s not a total success and there are rampant complaints about any number of hurdles for either side, it’s still a success in that the program is setting the course for other national and large-scale research institutions. The challenges the participants faced can be clearly mapped out and solutions to the barriers can be addressed in a manner that is focus and specific to the needs of scientific HPC users who want to take their capability to the next level.

Recently, HPC in the Cloud spoke with two of the leaders of the SARA cloud experiment, Tom Visser and Floris Sluiter. The following material includes some highlights from the discussion.

Currently, we are involving the dutch scientific community to evaluate this service. At SARA we are convinced that the involvement of the community in the development of new types of infrastructure and services is a key-succes factor.

We are currently only offering this evaluation service to the dutch community, and people affilitated with them. We have received international requests for access in this phase, that we currently cannot accommodate because of the scale of the infrastructure and of funding structures. We have set up partnerships with sister institutes around the globe with whom we exchange experiences and development efforts. This has already lead to the SARA developed graphical management interface (called ONE-MC) for OpenNebula, that is available in open source for the community. We will continue to actively share insights, experiences and results.

 What are some of the scientific applications that could be selected for this beta–what types of submissions are you seeing?

We are open for all kinds of suitable applications from the dutch scientific community. We are especially interested in applications which are difficult or near impossible to run on our existing HPC platforms (Supercomputers, Compute clusters, GPU-clusters, or Grids), but do run on their local systems. For example applications that need specific and custom libraries that are difficult to offer and maintain in a shared environment.

We don’t want to limit the kind of applications that users can evaluate, we have already seen new scientific approaches evolving from this new infrastructure. However we will have to make a selection because of the limited scale of the current infrastructure. We strive for a good mix of different usage models, i.e. cloning laptops, large databases, simulation clusters, data-mining, hybrid HPC, virtual private networking, etc.

Using cloud computing in our HPC setting has already elicited new modes of scientific research approaches. For example: a scientist will be able to develop and finetune models on his own workstation. This workstation can be cloned and started as a virtual cluster in the HPC cloud. Now the scientist can run these models seamlessly on an HPC system.

Describe in some detail the this HPC cloud environment and the virtual private cluster–what tools and technologies are you using in particular and what level of configurability do you provide for researchers?

The cloud is hosted on a cluster with 128 cores with the following characteristics:

    * 16 compute nodes with dual quad-core CPUs
    * backupped storage: 100 TB
    * Host Software
          o OpenNebula
          o Virtual machine software: KVM
          o Multicore/multiprocess is possible, also MPI and OpenMP.
          o Virtual Private Compute Cluster(c): starting multiple VMs in their own private network (vlan)

In this beta phase we strive to offer a small production grade environment and we are continuously improving and expanding the technical environment.

The management software we use is OpenNebula and we have developed a web-based userinterface (ONE-MC) on top of that. This software is also available under an open source license. As a community support system we use redmine.

We have developed additional software to manage clusters of virtual machines to enable Virtual Private Compute Clusters.

Security is, particularly in a scientific setting, a major concern. We’ve put various mechanisms in place to assure that users are protected from the outside world, other users and vice-versa.

Users are limited as little as possible, they will have their own Virtual Private Compute Cluster, that can be configured from scratch. They can actually start with an install cd of the operating system of their choice. It is also possible to start by uploading their own pre-configured virtual machine image. As a service we provide configuration templates and provide a community repository for virtual machines. In this repository people can share their own configuration templates and virtual machine images.

Because the HPC infrastructure and computing environment is fully configurable on demand to specific needs, the user can save time and effort porting their applications to a specific HPC platform. In some cases porting can be impossible because the source code is unavailable to the user. Then especially virtualisation can provide a solution. In general we are convinced that a shorter time from scientific question to computational solution will be facilitated by the use of our HPC Cloud.

We are convinced that HPC cloud computing provides a flexible solution to scientist and provides added value to the HPC ecosystem.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industry updates delivered to you every week!

2024 Winter Classic: The Return of Team Fayetteville

April 18, 2024

Hailing from Fayetteville, NC, Fayetteville State University stayed under the radar in their first Winter Classic competition in 2022. Solid students for sure, but not a lot of HPC experience. All good. They didn’t Read more…

Software Specialist Horizon Quantum to Build First-of-a-Kind Hardware Testbed

April 18, 2024

Horizon Quantum Computing, a Singapore-based quantum software start-up, announced today it would build its own testbed of quantum computers, starting with use of Rigetti’s Novera 9-qubit QPU. The approach by a quantum Read more…

2024 Winter Classic: Meet Team Morehouse

April 17, 2024

Morehouse College? The university is well-known for their long list of illustrious graduates, the rigor of their academics, and the quality of the instruction. They were one of the first schools to sign up for the Winter Read more…

MLCommons Launches New AI Safety Benchmark Initiative

April 16, 2024

MLCommons, organizer of the popular MLPerf benchmarking exercises (training and inference), is starting a new effort to benchmark AI Safety, one of the most pressing needs and hurdles to widespread AI adoption. The sudde Read more…

Quantinuum Reports 99.9% 2-Qubit Gate Fidelity, Caps Eventful 2 Months

April 16, 2024

March and April have been good months for Quantinuum, which today released a blog announcing the ion trap quantum computer specialist has achieved a 99.9% (three nines) two-qubit gate fidelity on its H1 system. The lates Read more…

Mystery Solved: Intel’s Former HPC Chief Now Running Software Engineering Group 

April 15, 2024

Last year, Jeff McVeigh, Intel's readily available leader of the high-performance computing group, suddenly went silent, with no interviews granted or appearances at press conferences.  It led to questions -- what's Read more…

Software Specialist Horizon Quantum to Build First-of-a-Kind Hardware Testbed

April 18, 2024

Horizon Quantum Computing, a Singapore-based quantum software start-up, announced today it would build its own testbed of quantum computers, starting with use o Read more…

MLCommons Launches New AI Safety Benchmark Initiative

April 16, 2024

MLCommons, organizer of the popular MLPerf benchmarking exercises (training and inference), is starting a new effort to benchmark AI Safety, one of the most pre Read more…

Exciting Updates From Stanford HAI’s Seventh Annual AI Index Report

April 15, 2024

As the AI revolution marches on, it is vital to continually reassess how this technology is reshaping our world. To that end, researchers at Stanford’s Instit Read more…

Intel’s Vision Advantage: Chips Are Available Off-the-Shelf

April 11, 2024

The chip market is facing a crisis: chip development is now concentrated in the hands of the few. A confluence of events this week reminded us how few chips Read more…

The VC View: Quantonation’s Deep Dive into Funding Quantum Start-ups

April 11, 2024

Yesterday Quantonation — which promotes itself as a one-of-a-kind venture capital (VC) company specializing in quantum science and deep physics  — announce Read more…

Nvidia’s GTC Is the New Intel IDF

April 9, 2024

After many years, Nvidia's GPU Technology Conference (GTC) was back in person and has become the conference for those who care about semiconductors and AI. I Read more…

Google Announces Homegrown ARM-based CPUs 

April 9, 2024

Google sprang a surprise at the ongoing Google Next Cloud conference by introducing its own ARM-based CPU called Axion, which will be offered to customers in it Read more…

Computational Chemistry Needs To Be Sustainable, Too

April 8, 2024

A diverse group of computational chemists is encouraging the research community to embrace a sustainable software ecosystem. That's the message behind a recent Read more…

Nvidia H100: Are 550,000 GPUs Enough for This Year?

August 17, 2023

The GPU Squeeze continues to place a premium on Nvidia H100 GPUs. In a recent Financial Times article, Nvidia reports that it expects to ship 550,000 of its lat Read more…

Synopsys Eats Ansys: Does HPC Get Indigestion?

February 8, 2024

Recently, it was announced that Synopsys is buying HPC tool developer Ansys. Started in Pittsburgh, Pa., in 1970 as Swanson Analysis Systems, Inc. (SASI) by John Swanson (and eventually renamed), Ansys serves the CAE (Computer Aided Engineering)/multiphysics engineering simulation market. Read more…

Intel’s Server and PC Chip Development Will Blur After 2025

January 15, 2024

Intel's dealing with much more than chip rivals breathing down its neck; it is simultaneously integrating a bevy of new technologies such as chiplets, artificia Read more…

Choosing the Right GPU for LLM Inference and Training

December 11, 2023

Accelerating the training and inference processes of deep learning models is crucial for unleashing their true potential and NVIDIA GPUs have emerged as a game- Read more…

Baidu Exits Quantum, Closely Following Alibaba’s Earlier Move

January 5, 2024

Reuters reported this week that Baidu, China’s giant e-commerce and services provider, is exiting the quantum computing development arena. Reuters reported � Read more…

Comparing NVIDIA A100 and NVIDIA L40S: Which GPU is Ideal for AI and Graphics-Intensive Workloads?

October 30, 2023

With long lead times for the NVIDIA H100 and A100 GPUs, many organizations are looking at the new NVIDIA L40S GPU, which it’s a new GPU optimized for AI and g Read more…

Shutterstock 1179408610

Google Addresses the Mysteries of Its Hypercomputer 

December 28, 2023

When Google launched its Hypercomputer earlier this month (December 2023), the first reaction was, "Say what?" It turns out that the Hypercomputer is Google's t Read more…

AMD MI3000A

How AMD May Get Across the CUDA Moat

October 5, 2023

When discussing GenAI, the term "GPU" almost always enters the conversation and the topic often moves toward performance and access. Interestingly, the word "GPU" is assumed to mean "Nvidia" products. (As an aside, the popular Nvidia hardware used in GenAI are not technically... Read more…

Leading Solution Providers

Contributors

Shutterstock 1606064203

Meta’s Zuckerberg Puts Its AI Future in the Hands of 600,000 GPUs

January 25, 2024

In under two minutes, Meta's CEO, Mark Zuckerberg, laid out the company's AI plans, which included a plan to build an artificial intelligence system with the eq Read more…

DoD Takes a Long View of Quantum Computing

December 19, 2023

Given the large sums tied to expensive weapon systems – think $100-million-plus per F-35 fighter – it’s easy to forget the U.S. Department of Defense is a Read more…

China Is All In on a RISC-V Future

January 8, 2024

The state of RISC-V in China was discussed in a recent report released by the Jamestown Foundation, a Washington, D.C.-based think tank. The report, entitled "E Read more…

Shutterstock 1285747942

AMD’s Horsepower-packed MI300X GPU Beats Nvidia’s Upcoming H200

December 7, 2023

AMD and Nvidia are locked in an AI performance battle – much like the gaming GPU performance clash the companies have waged for decades. AMD has claimed it Read more…

Nvidia’s New Blackwell GPU Can Train AI Models with Trillions of Parameters

March 18, 2024

Nvidia's latest and fastest GPU, codenamed Blackwell, is here and will underpin the company's AI plans this year. The chip offers performance improvements from Read more…

Eyes on the Quantum Prize – D-Wave Says its Time is Now

January 30, 2024

Early quantum computing pioneer D-Wave again asserted – that at least for D-Wave – the commercial quantum era has begun. Speaking at its first in-person Ana Read more…

GenAI Having Major Impact on Data Culture, Survey Says

February 21, 2024

While 2023 was the year of GenAI, the adoption rates for GenAI did not match expectations. Most organizations are continuing to invest in GenAI but are yet to Read more…

Intel’s Xeon General Manager Talks about Server Chips 

January 2, 2024

Intel is talking data-center growth and is done digging graves for its dead enterprise products, including GPUs, storage, and networking products, which fell to Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire