ASC23: Meet the Teams

By Dan Olds

May 18, 2023

The ASC23 cluster competition was held in a basketball stadium on the campus of the University of Science and Technology of China, located in Hefei, China – a modest Chinese town of nine million.

The competition nearly filled up the floor of the stadium but gave the student teams a comfortable amount of room for their work tables and systems. With the size of the room, there wasn’t any problem with keeping the temperatures at a comfortable level, but with all of the systems running, the sound level was incredible.

We interviewed nearly all the teams and, aided greatly by our trusty translators, we were able to talk about their configuration choices, the competition, and the challenges they anticipated they’d be facing. We put in a couple of long days filming, but it was a lot of fun to meet the students and learn more about them.

Here are the teams we interviewed at the stadium:

Fuzhou University: This is the fourth ASC competition for Fuzhou, they previously competed at ASC18, ASC19, and ASC21. Their final configuration consisted of three nodes and six GPUs. We processed this one in black and white because our camera exposure was set way too high. So kind of a noir look on this one.

 

Jinan University: They are the defending champion from ASC21, but also participated at ASC19, ISC21 (scoring Bronze), and SC21. Sporting a three node, six GPU, cluster, they’re looking to repeat their success from the last ASC competition.

 

Lanzhou University: Competing in their second ASC event, Lanzhou is looking to drive their three node, six GPU cluster to glory. Well, no, they didn’t exactly say that, they basically said that they’d try to do their best, but I can read between the lines.

 

Peking University: This is the eighth cluster competition for Peking University and their third ASC appearance. They’re looking to unseat their Beijing cross-town rival Tsinghua University and take home some trophy hardware. The team is driving a cluster that is a departure from the three node, six GPU configurations we’ve been seeing so far. The Peking cluster is three nodes, accompanied by nine GPUs, which should certainly give them more performance on several of the tasks – but only if they can precisely control the power draw and stay under the 3,000 watt power cap.

 

Qilu Iniversity of Technology: In an extraordinarily washed out video, we talk to the team from Qilu University of Technology. They’re a first time competitor, but don’t seem intimidated by pressure at all. The team has configured a four node, eight GPU cluster which is significantly larger than the other competitors. Maybe this will give them the edge they need to make some waves at ASC23.

 

Qinghai University: Qinghai is making their third ASC appearance and is driving a three node, four GPU cluster. In the video, we interview the team leader, who discusses pre-competition nerves and the chance that they’re under-powered when it comes to hardware. The team might move to a dual-node, four GPU configuration, which should allow them to run full out on the applications but might not get them to the performance they need to triumph.

 

Shanghai Jiao Tong University: This is the 12th cluster competition appearance for Shanghai Jiao Tong University, making them one of the most experienced institutions in the competition. They have won three Silver medals, plus a Bronze medal in previous competitions, but haven’t brought home a trophy since ASC18. The team believes that the most difficult part of the challenge for them will be the AI-centric tasks.

 

Shanghai University: Another first-time competitor in student cluster competitions, Shanghai University has a mountain to climb. Not only are they here for the first time, but they’re outgunned on the hardware side with only four GPUs when most of the other teams are sporting six. To compensate, they’re running four nodes to give them a bit more CPU power, which should help on some of the applications. Which will help more is that some of their team members have had some real world HPC experience, which is something that most of the other teams lack. We’ll see if it pays off.

 

ShanghaiTech University: This is the seventh competition for the team from ShanghaiTech University. The team nailed down a Silver medal in their first competition at ASC18. They’re driving four nodes and eight GPUs. At the time of the taping, the team was trying to see if they could go with directly connecting the nodes together and thus be able to devote some extra power (as much as 300 watts) to their compute components. But in order to accomplish this, they’ll need to get their hands on some dual-port IB NICs, which, at this point, doesn’t seem to be in the cards. Gotta like the innovative thinking, right?

 

Shanxi University: Third time competitor Shanxi has settled on a four node, eight GPU, cluster after experimenting with several other configurations. Their biggest challenge in their mind will be the YLLM training task, which will require them to build a language model with 17.88 billion tokens. In the video, we discuss the challenges of power management and how important it is to practice their power management techniques before the competition begins.

 

Southern University of Science and Technology: At the time of filming, the team captain is uneasy about the status of their server. They spent a lot of time correcting a network and GPU configuration error, plus even more time getting their Spack packages up and running. They were relying on the ability to download the SW they needed off the web, but ran into trouble finding the specific  packages they needed. Ouch. However, they’re recovering well, as you’d expect from a team that has competed five times before. Good luck, SUSTech.

 

Taiyuan University of Science & Technology: This is the seventh competition for the team from Taiyuan. When we caught up with them, they were comfortable with their progress and felt that they had everything under control. They have a bit of a hurdle in the competition as they only have two nodes and four GPUs, which could be a little underpowered compared to the rest of the field.

 

Tsinghua University: This university has competed in a record 25 student cluster competitions world-wide, has won 13 Gold medals, plus six Silver medals and three Bronze. In other words, they know their way around a student cluster competition. However, this is a brand-new slate of students, which means you can throw the record book out the window. The team was originally planning to run four nodes and eight GPUs but ended up with a configuration of three nodes and six GPUs. In the video, we talk to the team leader about team preparation and their unique approach to workload management. Rather than strictly split up task and responsibilities between team members, this edition of Team Tsinghua has decided that everyone is going to work on everything – often at the same time.

 

University of Science & Technology of China: This team is representing the host institution, USTC, and is located in Hefei, China, on a beautiful campus. Team USTC has competed in nine previous events, taking home Silver and Bronze awards from the ISC 2014 and 2015 competitions. At the time of filming, the team is driving a cluster with four nodes and eight GPUs. The team believes that their biggest challenge will be to control the power draw of their cluster, which is certainly true.

 

Zhejiang University: At their sixth competition, Team Zhejiang radically departed from the rest of the field in their hardware choice:  a single node attached to a PCIe expansion box with eight GPUs. Purists will argue that this isn’t really a cluster, since it’s only a single node, but they’re here and competing, so let’s see what happens. While this config will scream on LINPACK, which is probably what the team is gunning for, it’s doubtful that it can adequately perform on the other, more CPU-centric, applications.  Zhejiang has successfully captured the LINPACK crown before with their “Suicide LINPACK” at ASC 2016 (they turned off all their fans, crossed their fingers, and ran a scorching fast HPL). With this system, they have to be the favorite to win HPL again this year.

 

That’s it for the in-stadium competition, but there’s also a virtual competition (with the same apps but utilizing AWS hardware) that features four non-mainland China teams, let’s meet them….

The Chinese University of Hong Kong: We did a quick virtual interview with this school. This is the fourth time the university had entered a team in the ASC competition, but the first time for these students.

 

Kasetsart University: The pride of Thailand, this is the fifth outing for the team from Kasestsart U. In the interview, we meet all of the team members and talk about their responsibilities in the competition. This is an entirely new team for Kasetsart, but they seem enthusiastic and ready for the fight. Using a cloud is also a new experience for the students, not to mention running HPC applications. So a lot of new experiences are ahead for the Kasetsart team.

 

National Tsing Hua University: NTHU has competed in a whopping 20 previous student cluster competitions, including the very first one at SC07. Over the years, the team has collected a lot of awards including four Gold medals, two Silver medals, two Bronze awards, and three Highest LINPACK titles. In our interview, the team says that they are ready for the upcoming challenges. In their minds, the most difficult application will be the YLLM (large language model) due to the size of the model. But this is an experienced team, with veterans from their ASC/ISC 2021 and championship SC2022 teams.

 

Universid ad EAFIT: This is the ninth student cluster competition appearance for the team from Colombia, albeit with new members. In addition to learning the applications, learning HPC, and learning how to navigate the cloud, the team also must contend with an 11-hour time zone difference. Yikes. In the interview, we discuss the competition, their experience in HPC, and why only two members of the team are named Santiago (they could have had more, I think). Since this is their first look at HPC, they feel they’ve had a slow start at getting familiar with the applications. But that’s a common story in student cluster competitions.

 

Now that we’ve met the teams, next up are some results from the titanic battles that took place, stay tuned….

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

ASC23: Application Results

June 2, 2023

The ASC23 organizers put together a slate of fiendishly difficult applications for the students this year. The apps were a mix of traditional HPC packages, like WRF-Hydro and FVCOM, plus machine learning centric programs Read more…

Q&A with Marco Pistoia, an HPCwire Person to Watch in 2023

June 2, 2023

HPCwire Person to Watch Marco Pistoia wears a lot of hats at JPMorgan Chase & Co.: managing director, distinguished engineer, head of global technology applied research and head of quantum computing. That work with J Read more…

HPC Career Notes: June 2023 Edition

June 1, 2023

In this monthly feature, we’ll keep you up-to-date on the latest career developments for individuals in the high-performance computing community. Whether it’s a promotion, new company hire, or even an accolade, we’ Read more…

Intersect360: HPC Market ‘Returning to Stable Growth’

June 1, 2023

The folks at Intersect360 Research released their latest report and market update just ahead of ISC 2023, which was held in Hamburg, Germany, last week. The headline: “We’re returning to stable growth,” per Addison Read more…

Lori Diachin to Lead the Exascale Computing Project as It Nears Final Milestones

May 31, 2023

The end goal is in sight for the multi-institutional Exascale Computing Project (ECP), which launched in 2016 with a mandate from the Department of Energy (DOE) and National Nuclear Security Administration (NNSA) to achi Read more…

AWS Solution Channel

Shutterstock 1493175377

Introducing GPU health checks in AWS ParallelCluster 3.6

GPU failures are relatively rare but when they do occur, they can have severe consequences for HPC and deep learning tasks. For example, they can disrupt long-running simulations and distributed training jobs. Read more…

 

Shutterstock 1415788655

New Thoughts on Leveraging Cloud for Advanced AI

Artificial intelligence (AI) is becoming critical to many operations within companies. As the use and sophistication of AI grow, there is a new focus on the infrastructure requirements to produce results fast and efficiently. Read more…

ASC23: LINPACK Results

May 30, 2023

With ISC23 now in the rearview mirror, let’s get back to the results from the ASC23 Student Cluster Competition. In our last articles, we looked at the competition and applications, plus introduced the teams, now it’ Read more…

ASC23: Application Results

June 2, 2023

The ASC23 organizers put together a slate of fiendishly difficult applications for the students this year. The apps were a mix of traditional HPC packages, like Read more…

Intersect360: HPC Market ‘Returning to Stable Growth’

June 1, 2023

The folks at Intersect360 Research released their latest report and market update just ahead of ISC 2023, which was held in Hamburg, Germany, last week. The hea Read more…

Lori Diachin to Lead the Exascale Computing Project as It Nears Final Milestones

May 31, 2023

The end goal is in sight for the multi-institutional Exascale Computing Project (ECP), which launched in 2016 with a mandate from the Department of Energy (DOE) Read more…

At ISC, Sustainable Computing Leaders Discuss HPC’s Energy Crossroads

May 30, 2023

In the wake of SC22 last year, HPCwire wrote that “the conference’s eyes had shifted to carbon emissions and energy intensity” rather than the historical Read more…

Nvidia Announces Four Supercomputers, with Two in Taiwan

May 29, 2023

At the Computex event in Taipei this week, Nvidia announced four new systems equipped with its Grace- and Hopper-generation hardware, including two in Taiwan. T Read more…

Nvidia to Offer a ‘1 Exaflops’ AI Supercomputer with 256 Grace Hopper Superchips

May 28, 2023

We in HPC sometimes roll our eyes at the term “AI supercomputer,” but a new system from Nvidia might live up to the moniker: the DGX GH200 AI supercomputer. Read more…

Closing ISC Keynote by Sterling and Suarez Looks Backward and Forward

May 25, 2023

ISC’s closing keynote this year was given jointly by a pair of distinguished HPC leaders, Thomas Sterling of Indiana University and Estela Suarez of Jülich S Read more…

The Grand Challenge of Simulating Nuclear Fusion: An Overview with UKAEA’s Rob Akers

May 25, 2023

As HPC and AI continue to rapidly advance, the alluring vision of nuclear fusion and its endless zero-carbon, low-radioactivity energy is the sparkle in many a Read more…

CORNELL I-WAY DEMONSTRATION PITS PARASITE AGAINST VICTIM

October 6, 1995

Ithaca, NY --Visitors to this year's Supercomputing '95 (SC'95) conference will witness a life-and-death struggle between parasite and victim, using virtual Read more…

SGI POWERS VIRTUAL OPERATING ROOM USED IN SURGEON TRAINING

October 6, 1995

Surgery simulations to date have largely been created through the development of dedicated applications requiring considerable programming and computer graphi Read more…

U.S. Will Relax Export Restrictions on Supercomputers

October 6, 1995

New York, NY -- U.S. President Bill Clinton has announced that he will definitely relax restrictions on exports of high-performance computers, giving a boost Read more…

Dutch HPC Center Will Have 20 GFlop, 76-Node SP2 Online by 1996

October 6, 1995

Amsterdam, the Netherlands -- SARA, (Stichting Academisch Rekencentrum Amsterdam), Academic Computing Services of Amsterdam recently announced that it has pur Read more…

Cray Delivers J916 Compact Supercomputer to Solvay Chemical

October 6, 1995

Eagan, Minn. -- Cray Research Inc. has delivered a Cray J916 low-cost compact supercomputer and Cray's UniChem client/server computational chemistry software Read more…

NEC Laboratory Reviews First Year of Cooperative Projects

October 6, 1995

Sankt Augustin, Germany -- NEC C&C (Computers and Communication) Research Laboratory at the GMD Technopark has wrapped up its first year of operation. Read more…

Sun and Sybase Say SQL Server 11 Benchmarks at 4544.60 tpmC

October 6, 1995

Mountain View, Calif. -- Sun Microsystems, Inc. and Sybase, Inc. recently announced the first benchmark results for SQL Server 11. The result represents a n Read more…

New Study Says Parallel Processing Market Will Reach $14B in 1999

October 6, 1995

Mountain View, Calif. -- A study by the Palo Alto Management Group (PAMG) indicates the market for parallel processing systems will increase at more than 4 Read more…

Leading Solution Providers

Contributors

CORNELL I-WAY DEMONSTRATION PITS PARASITE AGAINST VICTIM

October 6, 1995

Ithaca, NY --Visitors to this year's Supercomputing '95 (SC'95) conference will witness a life-and-death struggle between parasite and victim, using virtual Read more…

SGI POWERS VIRTUAL OPERATING ROOM USED IN SURGEON TRAINING

October 6, 1995

Surgery simulations to date have largely been created through the development of dedicated applications requiring considerable programming and computer graphi Read more…

U.S. Will Relax Export Restrictions on Supercomputers

October 6, 1995

New York, NY -- U.S. President Bill Clinton has announced that he will definitely relax restrictions on exports of high-performance computers, giving a boost Read more…

Dutch HPC Center Will Have 20 GFlop, 76-Node SP2 Online by 1996

October 6, 1995

Amsterdam, the Netherlands -- SARA, (Stichting Academisch Rekencentrum Amsterdam), Academic Computing Services of Amsterdam recently announced that it has pur Read more…

Cray Delivers J916 Compact Supercomputer to Solvay Chemical

October 6, 1995

Eagan, Minn. -- Cray Research Inc. has delivered a Cray J916 low-cost compact supercomputer and Cray's UniChem client/server computational chemistry software Read more…

NEC Laboratory Reviews First Year of Cooperative Projects

October 6, 1995

Sankt Augustin, Germany -- NEC C&C (Computers and Communication) Research Laboratory at the GMD Technopark has wrapped up its first year of operation. Read more…

Sun and Sybase Say SQL Server 11 Benchmarks at 4544.60 tpmC

October 6, 1995

Mountain View, Calif. -- Sun Microsystems, Inc. and Sybase, Inc. recently announced the first benchmark results for SQL Server 11. The result represents a n Read more…

New Study Says Parallel Processing Market Will Reach $14B in 1999

October 6, 1995

Mountain View, Calif. -- A study by the Palo Alto Management Group (PAMG) indicates the market for parallel processing systems will increase at more than 4 Read more…

ISC 2023 Booth Videos

Cornelis Networks @ ISC23
Dell Technologies @ ISC23
Intel @ ISC23
Lenovo @ ISC23
ISC23 Playlist
  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire