You Can Lure Unicorns to Water, but You Can’t Make Them Drink

By Elizabeth Leake, STEM-Trek

May 30, 2019

Lessons learned from Practice & Experience in Advanced Research Computing

If you’ve spent much time cruising employment ads lately, you’ve probably noticed that certain research computing specializations are in high demand. Some university-based centers have had positions open for months; others years. It’s the same in densely-populated communities that compete with regional industries as it is elsewhere. The culture has forced managers and human resource professionals to explore novel ways to fill the prospect pipeline.

Few academic programs provide the practical knowledge necessary to support research computing, so some universities have begun to incorporate advanced skills training into the curriculum. But what do you call the course, and where should it live? If you’re just starting out, there are additional, even more critical things to consider than what to call the course; how you approach this effort could make or break your program.

In smaller schools, if you do manage to get a for-credit course approved, at some point in your future, it’s possible that an administrator with no background in computer science (CS) will judge it, unfairly, based on economics, alone. How many students were served, and could that classroom be better utilized with a course that draws more paying customers? It may inspire scrutiny over the return on investment of your computational cluster, itself. Some are unlikely to prioritize something with such immense power, network and personnel operational costs—especially when budgets are tight and an athletic program might be on the chopping block!

If you envision the course to be structured around the use of a cluster; in other words, if you want to train advanced computer science students on how to run, maintain and optimize workflows for its use, you might draw 5-15 students in a 400-level CS program. Frankly, depending on the size of your resources and data center, that’s about as many as you’d want in a hands-on lab. If you’re only communicating with CS students, it could be called “Distributed and Parallel Computing.” Uber-geeks will understand what they’re in for. But if you do this, don’t encumber a full classroom. Occupy a conference room and pull them into the data center when it’s appropriate. That’ll keep the space auditors happy.

But if you want your program to grow, you should call it something that denotes employment potential and economic prosperity, for example, “Performance Computing for Research and Industry.” That title will resonate favorably with a broader range of prospective stakeholders (and advocates). At the master’s level, plan to train 5-15 the first year, with the goal of doubling that number after two years. A worthy goal for the future would be to attract interdisciplinary students—that’s where the magic happens in terms of scientific and engineering discoveries.

And, if you’re all-in for the academic approach, you might want to create an undergraduate-level, general education course with the same title. This might convene in a small classroom the first year, and move to an auditorium as the number of registrants grows. Open that course to all majors, targeting computationally-curious students, with the lion’s share in CS, engineering, physics, bio, and business (in terms of allocations awarded—those are big users). That could serve as a prerequisite for those who would ultimately pursue any computationally-intensive graduate program.

If this type of course is not established as an undergraduate, interdisciplinary gen-ed course from the beginning, it will invariably get political. Each college will begin to sponsor their own as the demand for computational knowledge increases across disciplines, and departments will make a power grab for seats that would be gained by CS, if that’s where it’s initially housed.

A gen-ed survey course should present applications for HPC in the full spectrum of industries so that more can envision the economic outcomes, in terms of research advances, startups that employ regional workers, collaborating industrial partners, and grant awards. You might incorporate a lecture about cloud computing, and how to determine if it’s a good fit for the workflow. I visited a company recently (1,000 employees; mostly technical) that has a data-intensive mission for which AWS Lambda plus cloud-based GPU computing is performing quite well. They have no interest in supporting HPC. They pay as they go—much like you’d pay for a utility—and they don’t have the capital burden. That wouldn’t work for universities whose mission is to prepare the workforce for a range of occupations, however. Those that do this well support a diverse portfolio of systems and services, including cloud. But understanding when it’s appropriate would prepare students for a cloud-exclusive scenario upon graduation, especially those who take jobs in industries where it’s normalized and they don’t need to employ people who can spell HPC.

There’s a lot to be said for vocational training. In that case, you can bypass academic credit hurdles and politics all together. Most of the senior sysadmins I know—especially generalists capable of handling a range of tasks well—earned their stripes as student employees at one point. But plan to focus on quantity in anticipation of attrition—even among student employees. Students with LinkedIn profiles showing two or three years of in-house experience are getting noticed by talent scouts from the 14 big tech companies that recently waived degree requirements. While the starting salary is tempting to an undergrad who thinks that dropping out would reduce student debt, they need sound advice when it comes to assessing community cost of living comparisons. Also, it’s difficult to return to school once departing; you must soon begin to repay student debt. Many tech companies offer a combination of salary and stock, but that grass isn’t always greener.

Someone recently explained to me that a year after joining a big tech company, he realized that it wasn’t all he had hoped it would be. While doubling his salary, one-third was in the form of stocks that aren’t currently performing well. At the same time, he had to move to a region that costs three times as much as the one he left behind. When comparing lifestyles, he said, “I can’t eat stocks; I live in cramped quarters and there’s nothing left at the end of the month.”

By 2025, three-fourths of the world’s workers will have been born between 1977 and 1995. According to BVK Marketing research, this demographic is impatient; 91 percent expect to change jobs every three years. The gig culture which gained popularity after the 2008 economic downturn affects both employee and employer loyalty. A 2017 Intuit (company behind TurboTax) study found that by 2020, 43 percent of the workforce will be temporary. While I devoted 22 years of service to Illinois’ public university system, and STEM-Trek’s Vice President, David Stack, recently retired after a long career in Wisconsin’s, such commitment and loyalty will be extremely rare in the future.

Quality of life is important to this demographic; if the stars aren’t in alignment where they land, they won’t stick around for long. While a competitive salary is important, university employers who can’t compete with industry would do well to focus on fringes that they may have more control over, such as professional development and related travel, and the ability to work-from-home. In many cases, this is institutionally frowned upon, so it’s incumbent upon technical leadership to drive positive change on their campuses, and offer peer support for such changes to others through professional organizations.

Do you have experience to share, or suggestions that I haven’t thought of?

Many would love to hear from you. Please continue the dialogue during a Practice & Experience in Advanced Research Computing (PEARC19) panel titled, “Stop Chasing Unicorns in the Global Gig Economy,” Wednesday, July 31, 2019 in Chicago. In this panel, five senior research computing center directors will share lessons learned, and road-tested recruitment and retention strategies. PEARC19 is July 28-August 1, 2019 in Chicago, Illinois; early registration ends June 23.

About the Author

HPCwire Contributing Editor Elizabeth Leake is a consultant, correspondent and advocate who serves the global high performance computing (HPC) and data science industries. In 2012, she founded STEM-Trek, a global, grassroots nonprofit organization that supports workforce development opportunities for science, technology, engineering and mathematics (STEM) scholars from underserved regions and underrepresented groups.

As a program director, Leake has mentored hundreds of early-career professionals who are breaking cultural barriers in an effort to accelerate scientific and engineering discoveries. Her multinational programs have specific themes that resonate with global stakeholders, such as food security data science, blockchain for social good, cybersecurity/risk mitigation, and more. As a conference blogger and communicator, her work drew recognition when STEM-Trek received the 2016 and 2017 HPCwire Editors’ Choice Awards for Workforce Diversity Leadership.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industry updates delivered to you every week!

MLPerf Inference 4.0 Results Showcase GenAI; Nvidia Still Dominates

March 28, 2024

There were no startling surprises in the latest MLPerf Inference benchmark (4.0) results released yesterday. Two new workloads — Llama 2 and Stable Diffusion XL — were added to the benchmark suite as MLPerf continues Read more…

Q&A with Nvidia’s Chief of DGX Systems on the DGX-GB200 Rack-scale System

March 27, 2024

Pictures of Nvidia's new flagship mega-server, the DGX GB200, on the GTC show floor got favorable reactions on social media for the sheer amount of computing power it brings to artificial intelligence.  Nvidia's DGX Read more…

Call for Participation in Workshop on Potential NSF CISE Quantum Initiative

March 26, 2024

Editor’s Note: Next month there will be a workshop to discuss what a quantum initiative led by NSF’s Computer, Information Science and Engineering (CISE) directorate could entail. The details are posted below in a Ca Read more…

Waseda U. Researchers Reports New Quantum Algorithm for Speeding Optimization

March 25, 2024

Optimization problems cover a wide range of applications and are often cited as good candidates for quantum computing. However, the execution time for constrained combinatorial optimization applications on quantum device Read more…

NVLink: Faster Interconnects and Switches to Help Relieve Data Bottlenecks

March 25, 2024

Nvidia’s new Blackwell architecture may have stolen the show this week at the GPU Technology Conference in San Jose, California. But an emerging bottleneck at the network layer threatens to make bigger and brawnier pro Read more…

Who is David Blackwell?

March 22, 2024

During GTC24, co-founder and president of NVIDIA Jensen Huang unveiled the Blackwell GPU. This GPU itself is heavily optimized for AI work, boasting 192GB of HBM3E memory as well as the the ability to train 1 trillion pa Read more…

MLPerf Inference 4.0 Results Showcase GenAI; Nvidia Still Dominates

March 28, 2024

There were no startling surprises in the latest MLPerf Inference benchmark (4.0) results released yesterday. Two new workloads — Llama 2 and Stable Diffusion Read more…

Q&A with Nvidia’s Chief of DGX Systems on the DGX-GB200 Rack-scale System

March 27, 2024

Pictures of Nvidia's new flagship mega-server, the DGX GB200, on the GTC show floor got favorable reactions on social media for the sheer amount of computing po Read more…

NVLink: Faster Interconnects and Switches to Help Relieve Data Bottlenecks

March 25, 2024

Nvidia’s new Blackwell architecture may have stolen the show this week at the GPU Technology Conference in San Jose, California. But an emerging bottleneck at Read more…

Who is David Blackwell?

March 22, 2024

During GTC24, co-founder and president of NVIDIA Jensen Huang unveiled the Blackwell GPU. This GPU itself is heavily optimized for AI work, boasting 192GB of HB Read more…

Nvidia Looks to Accelerate GenAI Adoption with NIM

March 19, 2024

Today at the GPU Technology Conference, Nvidia launched a new offering aimed at helping customers quickly deploy their generative AI applications in a secure, s Read more…

The Generative AI Future Is Now, Nvidia’s Huang Says

March 19, 2024

We are in the early days of a transformative shift in how business gets done thanks to the advent of generative AI, according to Nvidia CEO and cofounder Jensen Read more…

Nvidia’s New Blackwell GPU Can Train AI Models with Trillions of Parameters

March 18, 2024

Nvidia's latest and fastest GPU, codenamed Blackwell, is here and will underpin the company's AI plans this year. The chip offers performance improvements from Read more…

Nvidia Showcases Quantum Cloud, Expanding Quantum Portfolio at GTC24

March 18, 2024

Nvidia’s barrage of quantum news at GTC24 this week includes new products, signature collaborations, and a new Nvidia Quantum Cloud for quantum developers. Wh Read more…

Alibaba Shuts Down its Quantum Computing Effort

November 30, 2023

In case you missed it, China’s e-commerce giant Alibaba has shut down its quantum computing research effort. It’s not entirely clear what drove the change. Read more…

Nvidia H100: Are 550,000 GPUs Enough for This Year?

August 17, 2023

The GPU Squeeze continues to place a premium on Nvidia H100 GPUs. In a recent Financial Times article, Nvidia reports that it expects to ship 550,000 of its lat Read more…

Shutterstock 1285747942

AMD’s Horsepower-packed MI300X GPU Beats Nvidia’s Upcoming H200

December 7, 2023

AMD and Nvidia are locked in an AI performance battle – much like the gaming GPU performance clash the companies have waged for decades. AMD has claimed it Read more…

DoD Takes a Long View of Quantum Computing

December 19, 2023

Given the large sums tied to expensive weapon systems – think $100-million-plus per F-35 fighter – it’s easy to forget the U.S. Department of Defense is a Read more…

Synopsys Eats Ansys: Does HPC Get Indigestion?

February 8, 2024

Recently, it was announced that Synopsys is buying HPC tool developer Ansys. Started in Pittsburgh, Pa., in 1970 as Swanson Analysis Systems, Inc. (SASI) by John Swanson (and eventually renamed), Ansys serves the CAE (Computer Aided Engineering)/multiphysics engineering simulation market. Read more…

Choosing the Right GPU for LLM Inference and Training

December 11, 2023

Accelerating the training and inference processes of deep learning models is crucial for unleashing their true potential and NVIDIA GPUs have emerged as a game- Read more…

Intel’s Server and PC Chip Development Will Blur After 2025

January 15, 2024

Intel's dealing with much more than chip rivals breathing down its neck; it is simultaneously integrating a bevy of new technologies such as chiplets, artificia Read more…

Baidu Exits Quantum, Closely Following Alibaba’s Earlier Move

January 5, 2024

Reuters reported this week that Baidu, China’s giant e-commerce and services provider, is exiting the quantum computing development arena. Reuters reported � Read more…

Leading Solution Providers

Contributors

Comparing NVIDIA A100 and NVIDIA L40S: Which GPU is Ideal for AI and Graphics-Intensive Workloads?

October 30, 2023

With long lead times for the NVIDIA H100 and A100 GPUs, many organizations are looking at the new NVIDIA L40S GPU, which it’s a new GPU optimized for AI and g Read more…

Shutterstock 1179408610

Google Addresses the Mysteries of Its Hypercomputer 

December 28, 2023

When Google launched its Hypercomputer earlier this month (December 2023), the first reaction was, "Say what?" It turns out that the Hypercomputer is Google's t Read more…

AMD MI3000A

How AMD May Get Across the CUDA Moat

October 5, 2023

When discussing GenAI, the term "GPU" almost always enters the conversation and the topic often moves toward performance and access. Interestingly, the word "GPU" is assumed to mean "Nvidia" products. (As an aside, the popular Nvidia hardware used in GenAI are not technically... Read more…

Shutterstock 1606064203

Meta’s Zuckerberg Puts Its AI Future in the Hands of 600,000 GPUs

January 25, 2024

In under two minutes, Meta's CEO, Mark Zuckerberg, laid out the company's AI plans, which included a plan to build an artificial intelligence system with the eq Read more…

Google Introduces ‘Hypercomputer’ to Its AI Infrastructure

December 11, 2023

Google ran out of monikers to describe its new AI system released on December 7. Supercomputer perhaps wasn't an apt description, so it settled on Hypercomputer Read more…

China Is All In on a RISC-V Future

January 8, 2024

The state of RISC-V in China was discussed in a recent report released by the Jamestown Foundation, a Washington, D.C.-based think tank. The report, entitled "E Read more…

Intel Won’t Have a Xeon Max Chip with New Emerald Rapids CPU

December 14, 2023

As expected, Intel officially announced its 5th generation Xeon server chips codenamed Emerald Rapids at an event in New York City, where the focus was really o Read more…

IBM Quantum Summit: Two New QPUs, Upgraded Qiskit, 10-year Roadmap and More

December 4, 2023

IBM kicks off its annual Quantum Summit today and will announce a broad range of advances including its much-anticipated 1121-qubit Condor QPU, a smaller 133-qu Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire