Maria Grazia Giuffreda Discusses Leading the User Support Team at CSCS

By Simone Ulmer

February 27, 2019

After her Postdoc in Chemistry at ETH Zurich, Maria Grazia Giuffreda started working at CSCS in 2006 as a support specialist in the group responsible for User Support. In 2010 she was promoted to Group Leader of the User Support team, and then she became Associate Director and head of the User Engagement and Support Unit in 2013. As a group leader, she has now served the user community for about 9 years. In this interview, Maria Grazia Giuffreda gives insight into her challenging work, into the work of her team and, last but not least, into user development during the last decade.

Interview conducted by Simone Ulmer.

Maria Grazia, are you bored after nearly a decade of user support?

Maria Grazia Giuffreda: It is difficult to be bored in this role (laughs). CSCS has always been a very dynamic workplace. With enthusiasm and readiness to get involved and be challenged, there is really no space for boredom. I cannot remember a single quiet year since I have taken over User Support. Every couple of years there is a new flagship system being installed or upgraded, at other times there is a new user community coming in, a new software stack, new services. Frequent changes are common in High-Performance Computing and data centers that are at the forefront of innovation and technology, like CSCS.

What does a Head of User Engagement and Support at CSCS do? What does your normal day look like?

I am responsible for the User Lab Program, including proposal submissions, and I am the liaison with the User Lab scientific community. Furthermore, I have account managers reporting to me on paying customers, and I am responsible for PRACE Tier-0 proposal calls. On a typical day I receive several emails from PIs and users asking me questions that need my attention, and I am involved in multiple meetings with colleagues and other leadership team members. I coordinate the activities of two groups, Scientific Computing Support and Compute and Data Services Support, with weekly discussions with the group leaders. I am also responsible for the account management team who is taking care of the relationships between CSCS and the paying customers, making sure that needs and expectations from both parties are met. Additionally, I supervise help desk activities, problem intervention, and user communication.

You mentioned PRACE, the Partnership for Advanced Scientific Computing in Europe, where CSCS is a hosting member of a so-called Tier-0 system. What does this mean?

The objective of PRACE is to enable high-impact scientific discovery and engineering research and development across all disciplines, to enhance European competitiveness for the benefit of society, and to provide a persistent pan-European High-Performance Computing service and infrastructure. Being a hosting member in this organization means that CSCS is offering world class computing and data management resources to scientists all over Europe and seeking to promote challenging and ambitious science. In PRACE, Swiss scientists can receive access to extreme-scale computing resources of different architectures. Together with Switzerland, currently the other hosting members are France, Italy, Germany and Spain.

Are there other European collaborations involving CSCS?

The beauty of working in this field is that science has no borders. Scientists and experts in extreme computing and data science thrive on collaborations and joint ventures. This is why CSCS is involved in a number of European and international collaborations, such as the European Centers of Excellence for HPC applications, MaX (Materials at eXascale), ESiWACE2 (Excellence in Simulation of Weather and Climate in Europe), the Human Brain Project, MAESTRO (Middleware for memory and data-awareness in workflows), and PLAN-E (Platform of National eScience/Data Research Centers in Europe), to name a few.

Users from the User Lab scientific community apply for free resources, but customers are users buying computing time without application?

Academic users can get access to computational and data resources for free, but they have to present high-quality projects that their peers deem to be worth pursuing. In particular, CSCS organizes two national calls for proposals each year and participates in two annual European PRACE Tier-0 calls. Proposals submitted to the national calls are first scrutinized by in-house experts for their technical soundness and feasibility and then sent to two scientific reviewers from academic institutions abroad. Based on these assessments, an independent expert committee ranks the proposals and makes recommendations on allocations of computer time, which the Director of CSCS has so far always followed in making final decisions. The painstaking procedure is designed to guarantee that all projects be treated equally and that all promising projects can be implemented on high-performance computers. Alternatively, users have the choice to buy resources and so become paying customers of CSCS. Allocations are then granted without peer review; however, used funds will typically come from funding organizations that implement their own selection process.

We are talking about 1,500 users. How many people take care of them, or in other words, how large is your team?

There are 20 members in the User Engagement and Support Unit. This might look like a lot but, actually, being part of this team does not mean that all we do is answer tickets from users. The team does tremendous work to keep the system healthy from a user’s point of view. The team recently assembled a professional regression suite, known as Reframe, to check the status of the system, and they have been so successful that other centers are starting to adopt their work flow and other teams within CSCS are starting to use it for their own daily work. The team has automated the installation of applications and scientific libraries such that, whenever there are major upgrades, we can easily reinstall and recompile our supported software stack. The team also prepares procedures for users to help them install their own applications easily. Furthermore, team members are working on benchmark suites for the production system and they are looking into new cloud services that CSCS is starting to provide, including continuous integration and interactive computing. What perhaps is not clear is that being part of the support team comes with a huge responsibility; after all, we take care of the core business of CSCS: If users are happy, we are happy, and we can consider ourselves successful.

“User Engagement” sounds like a challenge.

It depends on what we mean by User Engagement. For me it means to have an open channel with the user community. I am very excited by the User Lab Day, which we re-introduced in 2018. It is important that members of our user community know that there is an opportunity to come and discuss with us their wishes and their requests, and to make sure that they understand that we value their opinion. On the other hand, it is also of absolute importance for CSCS to reach out and present new services being offered. In my opinion, we cannot detach ourselves from our users. We need to make sure that we convey our messages, inform about our strategies, visions, and services, and work together with users who play a vital role in ensuring our success through supporting their outstanding science in the most effective way.

I assume there are users with a lot of experience as well as newcomers. How specific to a particular person is the user support?

The very experienced users are often considered collaborators more than anything else. They come to us with very high-level issues that regularly require the effort of both parts to diagnose and to solve, but we also get in touch with them when we need their help, for example when testing new services in pre-production.  Newcomers are more likely to require our assistance to get started. There are numerous ways in which we support them, for instance with an interactive tool on our user portal that generates job scripts custom-tailored to their needs. We also offer webinars that help them get started at CSCS. Our webpage provides instructions and information on a range of topics. Furthermore, we offer courses that are relevant to new users. I think the bottom line is that all users are important. There are no silly or intelligent questions, there are just questions; and we are there to help our users to get the most out of our resources.

Has your job changed over the years, or has it stayed pretty much the same?

The job has certainly changed as it needs to adapt to the rapid advances in hardware and software technology. Responsibilities and even the strategy of CSCS as an organization may change whenever new services are implemented. I am always behind my team and find it very important to discuss changes in daily work as well as in medium-term goals. This may not be as obvious for other units, but whatever we do has an immediate impact on our user community. Whatever we deploy as tools, any new service has to be robust, well-thought out and well-planned. We do not have the freedom to simply test and see how it goes, because the impact on the users will be immediate and non-negligible. Our services are evolving, and therefore also are the responsibilities that come with it. Even in the user program, I face challenges at times when new scientific disciplines join our user community. Their requirements may be quite different from those that we are used to, and we may need to implement new tools, software, and services and even adapt proposal submissions.

In addition to user support, another challenging part of your work is your involvement in the distribution of computing time. You are a kind of “interface” between user and Scientific Advisory Board. What is the biggest challenge for you in that role?

This is something that I really enjoy doing. I like to look at proposals and find expert scientific reviewers, even though this requires a lot of time and concentration. We have an excellent Scientific Advisory Board who meticulously discuss every proposal based on technical assessment at CSCS and scientific review. The biggest challenge for me is to convey the right messages to the applicants concerning the outcome of their project proposals. There is a lot of competition for HPC resources on Piz Daint, and therefore only the very best projects are granted full allocation. Lower-ranked proposals need to be cut in allocation and some proposals need to be rejected. Proper response to the latter is not always easy, however, it is important to let applicants know why their proposals were cut or rejected so they rest assured that their proposal was considered seriously and they find ways to improve their chances in future calls.

In addition to the daily business of your group, CSCS offers a wide range of training courses for users. Can you briefly name the most important ones?

Every year we develop a training program that includes new offerings covering new tools and technology, as well as courses we have repeatedly offered over the years due to their proven importantance to many of our users. In 2019 we are offering courses on: Distributed TensorFlow, Scientific Python, GPU programming, OpenACC (in our Summer School), Interactive supercomputing (Jupyter and similar services), Advanced C++, and HPX as well as visualization.

Is the large spectrum of courses a consequence of the increasingly complex technologies and the increasingly complex scientific questions that researchers want to solve with the help of simulations, or are there other reasons?

We are trying to help our users to deploy our production systems in the most effective way, therefore we offer courses in parallel computing on GPUs among others. On the other hand, we also want to make users aware of new technologies and new services that they might not be aware of but may prove useful to them, or that they might know of but not in as much detail as is necessary to tap their full potential.

Has the user behavior changed during your many years of experience?

Of course, users always want to do their research as soon as possible, and, if possible, just “now”. Still I have noticed growing awareness of the complexity of running a computer center successfully, of offering stable and reliable HPC and storage resources. Users have definitely shown a growing readiness to collaborate with us and contribute to making our services as useful as possible. I think that these days, more than ever before, they see us as their peers, not on a scientific level, but for issues regarding the technical realization of their scientific projects.

As you mentioned earlier, CSCS re-introduced the user day in 2018. The annual user meeting offered — besides a scientific presentation of ETH-professor Vanessa Wood and insights into the work of CSCS behind the curtain — for the first time workshops on various topics, at which CSCS experts answered questions from the audience. This was very well received. What was the trigger for the new programme, which offered plenty of room for discussion?

We want to reach out to our users. We wish to make them aware of new services in place at CSCS. In other words, we need to move forward but we want the community to evolve with us and not be left behind to catch up only later. The new format establishes better communication needed for CSCS to know about changing user needs and for users to learn about future plans of CSCS. We also need to reach out to scientists that have not yet used HPC but might well benefit of it.  The User Lab Day is the day where everybody can “meet the Swiss National Supercomputing Centre” to openly discuss wishes, visions, and services.

Will next year’s programme again have such a broad spectrum of topics or has the success inspired even more new ideas?

We will certainly repeat the format with parallel sessions to cover those topics that are important for CSCS and for our users. We are finalizing the program based also on feedback from the participants and the users.

If you made a wish for the CSCS users, what would it be?

Looking at the future, I wish for a community of users and customers that continues to be open-minded, like only scientists and dreamers can be, that embraces whatever new technologies and evolutions come our way, and that willingly accepts the challenges and the opportunities rather than looking back and wishing for what can no longer be, as nice as it might have been.

Originally published by CSCS. Feature image by Alessandro Della Bella.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industry updates delivered to you every week!

Anders Dam Jensen on HPC Sovereignty, Sustainability, and JU Progress

April 23, 2024

The recent 2024 EuroHPC Summit meeting took place in Antwerp, with attendance substantially up since 2023 to 750 participants. HPCwire asked Intersect360 Research senior analyst Steve Conway, who closely tracks HPC, AI, Read more…

AI Saves the Planet this Earth Day

April 22, 2024

Earth Day was originally conceived as a day of reflection. Our planet’s life-sustaining properties are unlike any other celestial body that we’ve observed, and this day of contemplation is meant to provide all of us Read more…

Intel Announces Hala Point – World’s Largest Neuromorphic System for Sustainable AI

April 22, 2024

As we find ourselves on the brink of a technological revolution, the need for efficient and sustainable computing solutions has never been more critical.  A computer system that can mimic the way humans process and s Read more…

Empowering High-Performance Computing for Artificial Intelligence

April 19, 2024

Artificial intelligence (AI) presents some of the most challenging demands in information technology, especially concerning computing power and data movement. As a result of these challenges, high-performance computing Read more…

Kathy Yelick on Post-Exascale Challenges

April 18, 2024

With the exascale era underway, the HPC community is already turning its attention to zettascale computing, the next of the 1,000-fold performance leaps that have occurred about once a decade. With this in mind, the ISC Read more…

2024 Winter Classic: Texas Two Step

April 18, 2024

Texas Tech University. Their middle name is ‘tech’, so it’s no surprise that they’ve been fielding not one, but two teams in the last three Winter Classic cluster competitions. Their teams, dubbed Matador and Red Read more…

Anders Dam Jensen on HPC Sovereignty, Sustainability, and JU Progress

April 23, 2024

The recent 2024 EuroHPC Summit meeting took place in Antwerp, with attendance substantially up since 2023 to 750 participants. HPCwire asked Intersect360 Resear Read more…

AI Saves the Planet this Earth Day

April 22, 2024

Earth Day was originally conceived as a day of reflection. Our planet’s life-sustaining properties are unlike any other celestial body that we’ve observed, Read more…

Kathy Yelick on Post-Exascale Challenges

April 18, 2024

With the exascale era underway, the HPC community is already turning its attention to zettascale computing, the next of the 1,000-fold performance leaps that ha Read more…

Software Specialist Horizon Quantum to Build First-of-a-Kind Hardware Testbed

April 18, 2024

Horizon Quantum Computing, a Singapore-based quantum software start-up, announced today it would build its own testbed of quantum computers, starting with use o Read more…

MLCommons Launches New AI Safety Benchmark Initiative

April 16, 2024

MLCommons, organizer of the popular MLPerf benchmarking exercises (training and inference), is starting a new effort to benchmark AI Safety, one of the most pre Read more…

Exciting Updates From Stanford HAI’s Seventh Annual AI Index Report

April 15, 2024

As the AI revolution marches on, it is vital to continually reassess how this technology is reshaping our world. To that end, researchers at Stanford’s Instit Read more…

Intel’s Vision Advantage: Chips Are Available Off-the-Shelf

April 11, 2024

The chip market is facing a crisis: chip development is now concentrated in the hands of the few. A confluence of events this week reminded us how few chips Read more…

The VC View: Quantonation’s Deep Dive into Funding Quantum Start-ups

April 11, 2024

Yesterday Quantonation — which promotes itself as a one-of-a-kind venture capital (VC) company specializing in quantum science and deep physics  — announce Read more…

Nvidia H100: Are 550,000 GPUs Enough for This Year?

August 17, 2023

The GPU Squeeze continues to place a premium on Nvidia H100 GPUs. In a recent Financial Times article, Nvidia reports that it expects to ship 550,000 of its lat Read more…

Synopsys Eats Ansys: Does HPC Get Indigestion?

February 8, 2024

Recently, it was announced that Synopsys is buying HPC tool developer Ansys. Started in Pittsburgh, Pa., in 1970 as Swanson Analysis Systems, Inc. (SASI) by John Swanson (and eventually renamed), Ansys serves the CAE (Computer Aided Engineering)/multiphysics engineering simulation market. Read more…

Intel’s Server and PC Chip Development Will Blur After 2025

January 15, 2024

Intel's dealing with much more than chip rivals breathing down its neck; it is simultaneously integrating a bevy of new technologies such as chiplets, artificia Read more…

Choosing the Right GPU for LLM Inference and Training

December 11, 2023

Accelerating the training and inference processes of deep learning models is crucial for unleashing their true potential and NVIDIA GPUs have emerged as a game- Read more…

Comparing NVIDIA A100 and NVIDIA L40S: Which GPU is Ideal for AI and Graphics-Intensive Workloads?

October 30, 2023

With long lead times for the NVIDIA H100 and A100 GPUs, many organizations are looking at the new NVIDIA L40S GPU, which it’s a new GPU optimized for AI and g Read more…

Baidu Exits Quantum, Closely Following Alibaba’s Earlier Move

January 5, 2024

Reuters reported this week that Baidu, China’s giant e-commerce and services provider, is exiting the quantum computing development arena. Reuters reported � Read more…

Shutterstock 1179408610

Google Addresses the Mysteries of Its Hypercomputer 

December 28, 2023

When Google launched its Hypercomputer earlier this month (December 2023), the first reaction was, "Say what?" It turns out that the Hypercomputer is Google's t Read more…

AMD MI3000A

How AMD May Get Across the CUDA Moat

October 5, 2023

When discussing GenAI, the term "GPU" almost always enters the conversation and the topic often moves toward performance and access. Interestingly, the word "GPU" is assumed to mean "Nvidia" products. (As an aside, the popular Nvidia hardware used in GenAI are not technically... Read more…

Leading Solution Providers

Contributors

Shutterstock 1606064203

Meta’s Zuckerberg Puts Its AI Future in the Hands of 600,000 GPUs

January 25, 2024

In under two minutes, Meta's CEO, Mark Zuckerberg, laid out the company's AI plans, which included a plan to build an artificial intelligence system with the eq Read more…

China Is All In on a RISC-V Future

January 8, 2024

The state of RISC-V in China was discussed in a recent report released by the Jamestown Foundation, a Washington, D.C.-based think tank. The report, entitled "E Read more…

Shutterstock 1285747942

AMD’s Horsepower-packed MI300X GPU Beats Nvidia’s Upcoming H200

December 7, 2023

AMD and Nvidia are locked in an AI performance battle – much like the gaming GPU performance clash the companies have waged for decades. AMD has claimed it Read more…

Nvidia’s New Blackwell GPU Can Train AI Models with Trillions of Parameters

March 18, 2024

Nvidia's latest and fastest GPU, codenamed Blackwell, is here and will underpin the company's AI plans this year. The chip offers performance improvements from Read more…

Eyes on the Quantum Prize – D-Wave Says its Time is Now

January 30, 2024

Early quantum computing pioneer D-Wave again asserted – that at least for D-Wave – the commercial quantum era has begun. Speaking at its first in-person Ana Read more…

GenAI Having Major Impact on Data Culture, Survey Says

February 21, 2024

While 2023 was the year of GenAI, the adoption rates for GenAI did not match expectations. Most organizations are continuing to invest in GenAI but are yet to Read more…

The GenAI Datacenter Squeeze Is Here

February 1, 2024

The immediate effect of the GenAI GPU Squeeze was to reduce availability, either direct purchase or cloud access, increase cost, and push demand through the roof. A secondary issue has been developing over the last several years. Even though your organization secured several racks... Read more…

Intel’s Xeon General Manager Talks about Server Chips 

January 2, 2024

Intel is talking data-center growth and is done digging graves for its dead enterprise products, including GPUs, storage, and networking products, which fell to Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire