ALCF Summer Students Tackle Supercomputing and AI Research Projects

October 7, 2024

This summer, over 40 undergraduate and graduate students collaborated with ALCF mentors on innovative research projects, gaining firsthand experience with high-performance computing (HPC) and artificial intelligence (AI) for science.

ALCF intern Idunnuoluwa Adeniji discusses her research using virtual reality to explore large-scale scientific data at a poster session for students working in Argonne’s Computing, Environment, and Life Sciences directorate this summer. Image credit: Argonne Lab.

Oct. 7, 2024 — Every summer, the Argonne Leadership Computing Facility (ALCF), a U.S. Department of Energy (DOE) Office of Science user facility at DOE’s Argonne National Laboratory, hosts a new group of students to gain experience working on real-world scientific computing research.

“Our summer students get to work closely with experts in HPC on projects that harness powerful supercomputing and AI resources,” said Michael Papka, ALCF director and professor of computer science at the University of Illinois Chicago (UIC). “This collaborative environment helps these students advance their skills, preparing them to join the next-generation HPC and AI workforce.”

This year, over 40 students contributed to ALCF projects spanning from scaling up deep learning benchmark applications for exascale computing to compressing data for AI models that can give us insights into nuclear fusion. We spoke to four students about their work and experiences this summer at the ALCF.

Developing Digital Twins for Dexterous Robots

Athena Angara

Athena Angara, a rising senior studying data science at UIC, used virtual reality (VR) and augmented reality (AR) to help create digital twins of critical components of robotic arms. These models allow researchers to explore experiments that are generally considered too time-intensive or hazardous, and determine whether it’s possible to complete these experiments remotely and automatically.

Using Unreal Engine 5 and NVIDIA’s Unity, Omniverse, and Isaac Sim, Angara developed an immersive display system characterized by real-time, low latency, and high spatial resolution for remote control processes. She also converted data to ensure accurate representations between these digital platforms.

“I had several breakthrough moments during my research,” Angara said. “One of the most notable was figuring out how to send joint angles and all the data from the robot to Omniverse. I discovered that by implementing a custom data serialization protocol combined with a real-time data streaming framework, we could achieve seamless integration between the robot and the virtual environment. This allowed for accurate and responsive control within the simulation.”

For Angara, interdisciplinary collaboration was a cornerstone of her summer at the ALCF. Along with her ALCF mentor, Victor Mateevitsi, Angara worked with Silvio Rizzo, Joe Insley, Yunghuo Kim, Nicola Ferrier, and Papka. “Collaborating with experts from various fields has shown me the power of teamwork and diverse perspectives,” she said. “This experience has equipped me with the confidence and skills to tackle complex challenges in my future career. I now feel more prepared and inspired to pursue innovative projects that can make a real difference.”

Scaling Up Applications for Aurora

Colin Luangrath

Colin Luangrath’s work with the ALCF team centered on scaling the DLIO benchmark application, a tool that emulates the I/O patterns of modern scientific deep learning applications. He helped scale DLIO for use cases on Aurora, identified bottlenecks, and found solutions to mitigate issues with scaling to multiple nodes.

A rising sophomore studying computer science and psychology at the University of Wisconsin-Madison, Luangrath appreciated the opportunity to work with Argonne’s powerful supercomputing resources.

“Working on Polaris has been an incredible experience. I’ve learned a lot about HPC that I never would have experienced in any other field,” Luangrath said. “I felt like I had freedom to try things out and get hands-on experience with these systems and more time to learn about the technical side of things. It felt very collaborative.”

Luangrath said working with his ALCF mentor, Huihuo Zheng, challenged him to become a better problem solver. “Before this experience, I often made decisions without really thinking about what was going on, and just focused on fixing the problem. My mentor taught me a lot about debugging. I learned about investigating deeply and understanding why things are or aren’t working, rather than trying to find a quick workaround.”

Machine Learning for Molecular Dynamics Research

Hariharan Ramasubramanian

Hariharan Ramasubramanian, a Ph.D. student in mechanical engineering from Carnegie Mellon University (CMU), has focused his studies on computational materials modeling. This summer at the ALCF, he was able to explore new types of problems in his field, focusing on machine learning potentials (MLPs) for long-range systems.

In molecular dynamics simulations, MLPs are effective at modeling the structures and dynamics of complex systems. However, they struggle to handle charged species or systems with magnetization where the non-local effects become a major driving factor.

“It’s challenging to model a charged system with an interaction which has a land scale longer than its neighbors,” Ramasubramanian said. “In an ionic system, for instance, one might predict partial charges for each atom, which are then used to calculate long-range electrostatics. Similarly, various empirical electrostatics and dispersion baseline corrections can be incorporated. Our model can help address long-range interactions in such atomistic systems.”

Alongside his ALCF mentor, Álvaro Vázquez-Mayagoitia, Ramasubramanian researched introducing partial charges into the MLPs’ learnable descriptors.

“Working at the ALCF exposed me to new types of problems,” Ramasubramanian said. “Long-range interactions are not something we focus on at CMU. I got new understandings of materials modeling from this project.”

This summer also gave Ramasubramanian new insights into his career. “I got to see how national labs are different from academic settings. There’s more collaboration across fields, and it’s easier to meet new people,” he said.

Solving Problems in Nuclear Fusion with AI-enabled Disruption Prediction

Apollo Lee

In nuclear fusion research, the leading experimental device is the tokamak, which aims to achieve sustained reactions by using powerful magnets to confine hot plasma. Electron Cyclotron Emission Imaging (ECEI) is a diagnostic technique that provides detailed information on the confined plasma. Integrating ECEI data with machine learning models can enable researchers to predict future plasma states. This approach is crucial for minimizing disruptions that could damage the tokamak and halt experiments.

This summer, Apollo Lee, a rising junior studying electrical engineering at Stanford University, leveraged ALCF’s HPC resources to identify ways to better integrate ECEI data with predictive models. Alongside his ALCF mentor, Kyle Felker, Lee explored the application of both next-generation lossless and lossy compressors and signal filtering techniques.

“It’s a huge amount of data—terabytes upon terabytes,” Lee said. “If you’re looking to train machine learning models with it, it’s worth seeing how you could scale it down—of course, without losing any important information.”

Lee particularly enjoyed the HPC aspect of this project. “Working with Polaris has been awesome. It’s amazing to see what these supercomputers are capable of—they handle these huge datasets with ease.”

Reflecting on his summer, Lee said, “Working at Argonne this summer was an incredible experience. I was able to meet a lot of really talented and passionate people, and it kept things exciting every day. As someone who loves learning and tackling new challenges, and during my time here, I felt right at home.”


Source: Rachel Taub, Argonne

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industry updates delivered to you every week!

Boffo HPC Conference Sessions: Leicester 2024

November 11, 2024

Leicester was the center of the HPC universe on October 15-16 as the HPC/AI Advisory Council and DiRAC hosted their sixth annual UK conference.The theme this year was “Democratising HPC and AI Opportunities” was co Read more…

Top500 Wild Cards Could Add Thrills to Supercomputing 2024 Show

November 11, 2024

The fantastic Supercomputing 2024 show is coming back to Atlanta this year. If last year was any indication, there will be plenty of coffee flowing and lots of discussions around power efficiency, storage, and the future Read more…

The Growing E-Waste Footprint of GenAI

November 7, 2024

The rapid advancement of digital technologies has led to the proliferation of electronic devices and systems, resulting in an alarming increase in electronic waste (e-waste).  GenAI, in particular, requires substantial Read more…

OSI Open AI Definition Stops Short of Requiring Open Data for LLMs

November 6, 2024

The movement toward open source AI made progress today when the Open Source Initiative released the first Open Source AI Definition (OSAID). While the OSAID provides one step forward, the lack of requirements around open Read more…

D-Wave Readies 4,400-plus-qubit Advantage2 System for Use

November 6, 2024

Quantum computing pioneer D-Wave today announced it had completed calibration and benchmarking the latest latest version of its Advantage2 quantum processor, a 4,400-plus-qubit device. D-Wave said that compared with the Read more…

Microsoft Azure & AMD Solution Channel

Join Microsoft Azure and AMD at SC24

Atlanta, Georgia is the place to be this fall as the high-performance computing (HPC) community convenes for Supercomputing 2024. SC24 will bring together an unparalleled mix of scientists, engineers, researchers, educators, programmers, and developers for a week of learning and sharing. Read more…

Bill Gropp on ‘Different Approaches to AI’

November 6, 2024

Around this same time last year, I expounded on what the “Future of AI” may entail. A lot has happened in the 12 months since then, including new approaches, new trends and, yes, new complications. A lot of the ne Read more…

Top500 Wild Cards Could Add Thrills to Supercomputing 2024 Show

November 11, 2024

The fantastic Supercomputing 2024 show is coming back to Atlanta this year. If last year was any indication, there will be plenty of coffee flowing and lots of Read more…

OSI Open AI Definition Stops Short of Requiring Open Data for LLMs

November 6, 2024

The movement toward open source AI made progress today when the Open Source Initiative released the first Open Source AI Definition (OSAID). While the OSAID pro Read more…

Bill Gropp on ‘Different Approaches to AI’

November 6, 2024

Around this same time last year, I expounded on what the “Future of AI” may entail. A lot has happened in the 12 months since then, including new approaches Read more…

Shutterstock 1179408610

Google Cloud Sporting a New Look in HPC and AI Hardware

November 5, 2024

It's raining hardware at Google Cloud, with the company making major upgrades in advance of bringing Nvidia's Blackwell GPUs into its fold next year. The upg Read more…

Go (Mountain) West, Quantum Workers! CU, CUbit, and Elevate Quantum Issue Workforce Roadmap

November 5, 2024

Last week the University of Colorado (Boulder), the CUbit Quantum Initiative, and the Elevate Quantum consortium released workforce roadmap for educating and bu Read more…

Collaboration Speeds Complex Chemical Modeling

November 4, 2024

A recent collaboration among researchers from HUN-REN Wigner Research Centre for Physics in Hungary and the Department of Energy's Pacific Northwest National La Read more…

High-Performance Storage for AI and Analytics Panel

October 31, 2024

When storage is mentioned in an AI or Big Data analytics context, it is assumed to be a high-performance system. In practice, it may not be, and the user eventu Read more…

Shutterstock_556401859

Role Reversal: Google Teases Nvidia’s Blackwell as It Softens TPU Rivalry

October 30, 2024

Customers now have access to Google's homegrown hardware -- its Axion CPU and latest Trillium TPU -- in its Cloud service.  At the same time, Google gave custo Read more…

Sorry, but nothing matches what you're looking for. Please try again with some different keywords.

Leading Solution Providers

Contributors

CORNELL I-WAY DEMONSTRATION PITS PARASITE AGAINST VICTIM

October 6, 1995

Ithaca, NY --Visitors to this year's Supercomputing '95 (SC'95) conference will witness a life-and-death struggle between parasite and victim, using virtual Read more…

SGI POWERS VIRTUAL OPERATING ROOM USED IN SURGEON TRAINING

October 6, 1995

Surgery simulations to date have largely been created through the development of dedicated applications requiring considerable programming and computer graphi Read more…

U.S. Will Relax Export Restrictions on Supercomputers

October 6, 1995

New York, NY -- U.S. President Bill Clinton has announced that he will definitely relax restrictions on exports of high-performance computers, giving a boost Read more…

Dutch HPC Center Will Have 20 GFlop, 76-Node SP2 Online by 1996

October 6, 1995

Amsterdam, the Netherlands -- SARA, (Stichting Academisch Rekencentrum Amsterdam), Academic Computing Services of Amsterdam recently announced that it has pur Read more…

Cray Delivers J916 Compact Supercomputer to Solvay Chemical

October 6, 1995

Eagan, Minn. -- Cray Research Inc. has delivered a Cray J916 low-cost compact supercomputer and Cray's UniChem client/server computational chemistry software Read more…

NEC Laboratory Reviews First Year of Cooperative Projects

October 6, 1995

Sankt Augustin, Germany -- NEC C&C (Computers and Communication) Research Laboratory at the GMD Technopark has wrapped up its first year of operation. Read more…

Sun and Sybase Say SQL Server 11 Benchmarks at 4544.60 tpmC

October 6, 1995

Mountain View, Calif. -- Sun Microsystems, Inc. and Sybase, Inc. recently announced the first benchmark results for SQL Server 11. The result represents a n Read more…

New Study Says Parallel Processing Market Will Reach $14B in 1999

October 6, 1995

Mountain View, Calif. -- A study by the Palo Alto Management Group (PAMG) indicates the market for parallel processing systems will increase at more than 4 Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire