URISC@SC17 and the #LongestLastMile

By Elizabeth Leake, STEM-Trek Nonprofit

January 11, 2018

A multinational delegation recently attended the Understanding Risk in Shared CyberEcosystems workshop, or URISC@SC17, in Denver, Colorado. URISC participants and presenters from 11 countries, including eight African nations, 12 U.S. states, Canada, India and Nepal, also attended SC17, the annual international conference for high performance computing (HPC), networking, storage and analysis that drew nearly 13,000 attendees. Von Welch (Indiana University), who directs the Center for Trustworthy Scientific Cyberinfrastructure, provided expert oversight for the URISC program. Welch invited nine specialists who presented open-source tools and cybersecurity best practices.

URISC Presenter Nick Roy, Director of Technology and Strategy for Internet2’s InCommon Federation, explained eduGAIN and its benefits to the global research community. “From a local management standpoint, eduGAIN saves managers time and effort because home credentials provide authentication and access to resources, instrumentation and data that are physically located at institutions in in 48 member countries that comprise an interfederated trust fabric,” said Roy. “It’s more secure, and takes less time to manage since researchers must only remember one user name and password,” he added.

1 eduGAIN member map. Key: dark-blue indicates eduGAIN membership, green are voting-only, and aqua indicates “candidate” sites.

While eduGAIN’s convenience and added security would be welcome in the many resource-constrained regions represented by URISC delegates, it was difficult for some to imagine that they could ever engage; there are many physical and financial barriers to entry.

For more than 50 years, HPC has supported tremendous advances in all areas of science. But densely-populated communities can more easily support subscription-based commodity networks and energy infrastructure that make it more affordable for urban universities to engage globally. Research centers based in sparsely-populated regions are extremely disadvantaged. There are fewer partners with which to cost-share connectivity, and copper thieves make it challenging to sustain infrastructure in the poorest regions. Their universities have a more difficult time recruiting and retaining skilled personnel who must travel further for training. In some cases, consumer prices are 70-80 percent lower, so hardware and software purchases are inflated; everything is shipped from developed countries which increases the cost.

But these regions reflect globally-significant human capacity, environmental factors, biodiversity, geology and minerals. Each site has a unique perspective of our universe, and less-populated areas offer the most detailed and unfettered vantage points. We can’t expect rural universities to pay for the pipe used by the rest of the world, however. The effort will require global cooperation, with broad public and private financial support. When researchers everywhere can access data generated by and stored at these sites, progress will be accelerated toward solutions to problems that impact global climate, environment, food and water security, public health, quality of life, and world peace.

Justifying #LongestLastMile engagement one case at a time…

The pan-European data network for research and education, GÉANT, with U.S. stakeholders, forged the pathway that originally made eduGAIN possible. It was conceived by the global High Energy Physics (HEP) community whose users required access to HEP instrumentation and data located in the U.S. (Laser Inferometer Gravitational-Wave Observatory, LIGO) and Europe (Large Hadron Collider at the European Organization for Nuclear Research, LHC-CERN).

The Office of CyberInfrastructure and Computational Biology at the National Institute of Allergy and Infectious Diseases (NIAID is part of the U.S. National Institutes of Health), is another such driver, and NIAID Chief Information Officer Michael Tartakovsky is eager to accommodate more global researchers who are fighting infectious diseases.

NIAID supports centers in Mali and Uganda that provide support and services for collaborations working on treatments and vaccines for Malaria, Ebola, and tuberculosis (TB) via eduGAIN and GÉANT’s research and education federation (REFEDS R&S). Beyond Africa, NIH looks forward to providing access to research staff at Fudan University when the China Federation joins eduGAIN. They are also working with the Indian Federation and its National Institute for Research in TB. “By joining the global trust federation network, we can all work together to solve the most daunting global infectious disease challenges,” said Tartakovsky.

The computational biology community is working to solve the world’s direst grand challenges. South African Computational Biologist Nicola Mulder’s group from the University of Cape Town’s (UCT) Institute of Infectious Disease and Molecular Medicine is analyzing sequence data from African human genomes that are of critical importance to public health and food security research. Until the South African Centre for High Performance Computing (CHPC) introduced the Lengau supercomputer in 2016, UCT ran computations in the U.S. on Blue Waters at the National Center for Supercomputing Applications (NCSA). “We had access to NCSA computing facilities and then returned the processed data to South Africa; the processing and transfer took months to complete,” said Mulder.

Global energy demands will rely on a larger supply from alternative sources in the future, and Africa is expected to play a major role in energy production and innovation. “The need to power portable electronic devices and manage peaks and valleys associated with solar and wind energy will require more advanced battery storage solutions that will likely require minerals and rare earths that are abundant in sub-Saharan Africa,” said Principal Researcher Rapela Regina Maphanga (South African Council for Scientific and Industrial Research (CSIR), Modelling and Digital Science Division).

The global astrophysics and astronomy communities are watching sub-Saharan Africa with great anticipation. The Square Kilometer Array (SKA) is being built in the great Karoo region of South Africa and will be the world’s biggest radio telescope. With an expected 50-year lifespan, SKA is investing in regional infrastructure and human capital development, but SKA can’t do it alone; African infrastructure to serve the global research needs of the future will require a much larger investment.

In their SC17 keynote, Professor Phil Diamond (SKA Organization Director General) and Dr. Rosie Bolton (SKA Regional Centre Project Scientist) described the SKA project and its computational challenges. For the first phase of the project, which represents a fraction of what it will be in the future, the total processing power required in the SKA observatory’s Science Data Processors is about 250 PF (peak). Each SKA site is expected to generate up to 1 PB of data each day during full operations (from about 2026). SKA data will be globally-distributed to SKA “Regional Centres” which will provide researchers with access to data for analysis and processing. The design of this federated network is an interesting challenge since it will likely also support users from other observatories and even from other science disciplines as part of the HPC and networking infrastructures supported in each country or region.

With SKA’s presence in South Africa, a larger astro research presence will begin to take root in the region that will demand access to the global treasure-trove of data currently generated by six telescopes supported by the U.S. National Science Foundation, and complementary instrumentation, such as the Murchison Widefield Array (MWA), a precursor to SKA, in Western Australia at the Murchison Radio-astronomy Observatory (MRO).

LIGO’s Identity and Access Management (IAM) Architect Scott Koranda (University of Wisconsin at Milwaukee which first piloted eduGAIN in 2014) said that MWA is establishing a new IAM infrastructure that is built on federated identity. Their services are published in the Australian Access Federation (AAF) and will soon be “pushed” into eduGAIN. “The eduGAIN component is important because MWA, like SKA, is a global project with scientists who live in and work from many countries,” said Koranda.

The important role NRENs play and their status in sub-Saharan Africa

African regional-serving universities benefit from fast and affordable bandwidth delivered via National Research and Education Networks, or NRENs, that engage with larger networks, such as the UbuntuNet Alliance in eastern and southern Africa, and WACREN in Western and Central Africa, to deliver more advanced service options. The major backbone then allies with Internet2 in the U.S. and GÉANT in Europe. Through this complex fabric of trust, it’s possible for NRENs to deliver eduGAIN service.

But, as was explained, developing an NREN from scratch is challenging for stakeholders in sparsely-populated, resource-constrained regions. In his December presentation to the Southern African Development Community (SADC) Cyberinfrastructure Forum that was co-located with the South African Centre for High Performance Computing’s (CHPC) National Meeting in Pretoria, SANReN’s Director Leon Staphorst cited a 2016 World Bank Report by Michael Foley titled, “The Role and Status of NRENs in Africa.” The document serves as an important guide for those who wish to develop, use or fund an NREN.2

Photo: Leon Staphorst (SANRen)

Staphorst shared a table of progress being made toward African NREN development. Among nations represented at URISC that participate in the African HPC Ecosystems and SKA Readiness Projects (see map and slide excerpt below), South Africa is the only country whose researchers use eduGAIN (through relationships with GÉANT, SANReN and SAFIRE). In South Africa’s case, the HEP community’s need to reach LIGO/LHC in Geneva, Switzerland was a driver, with biomed demand a close second; specifically, access to a global TB research protocol required by scientists at the University of Cape Town and Stellenbosch University.

Next in queue among HPC Ecosystems sites that are prospective eduGAIN members given the operational status of their NRENs and subsequent engagement with UbuntuNet, are Ethiopia, Kenya and Zambia. It’s likely that Madagascar and Namibia will be next, followed by Botswana, Mauritius, and Mozambique.

3 HPC Ecosystems Project footprint

 

4 HPC Ecosystems Project sites/NREN Status

It can still require a considerable amount of time to move big data around the world, however. “African network traffic is currently routed via Europe before it travels to the U.S., and elsewhere,” said Julio Ibarra (Florida International University AVP for Technology Augmented Research). “Depending on the amount of data transferred among eduGAIN 48 member nations, the distance and number of Internet exchange sites along the way could cause significant delays,” he added.

Ibarra’s HPC On Common Ground @SC16 workshop presentation described a collaborative effort to facilitate “big data” transfers through the development of international software defined exchange points (SDX). The “AtlanticWave-SDX” is an NSF-funded project at Florida International University and the Georgia Institute of Technology, with support from Brazil’s NREN, Rede Nacional de Ensino e Pesquisa (RNP, and the Academic Network of Sao Paulo (ANSP). An SDX enables a domain scientist connected to an SDN network to use the network more intelligently; e.g., scheduling use when resources are available, or requesting a more favorable path.

In the future, Ibarra’s group hopes to explore the feasibility of establishing an SDX in West Africa, in collaboration with African NRENs, based on future availability of submarine cable spectrum for use by research and education communities between Western Africa and Brazil (scheduled 2018 and beyond, per Foley’s report).

5 Image from GEANT website.

Success, speed and reliability require some magic in the middle…

Irrespective of regional networks and the IAM infrastructure deployed at each site, moving and sharing massive amounts of data around the world requires a certain amount of geopolitical cooperation, compatible middleware and universally-adopted toolkits. One such resource is Globus which can securely and, more importantly, reliably transfer data in many scenarios where network availability and quality is highly variable. “Globus has already been successfully used on H3ABioNet to move and share data among far-flung research groups in Africa, and is currently being evaluated for broader adoption by a number of institutions in South Africa,” said Globus Co-Founder Ian Foster (University of Chicago).

With supercomputers capable of processing trillions of calculations per second, it’s unreasonable that critically-important research processes still require days or even months to complete. In light of current and anticipated global grand challenges, an accelerated process of discovery is fundamentally important to future generations’ prosperity, health and social stability. Developing the e-Infrastructure and human capital that serve the #LongestLastMile will require a globally-collaborative endeavor and investment.

About URISC@SC17

URISC@SC17 was STEM-Trek Nonprofit’s third SC co-located workshop. Last year’s “HPC On Common Ground @SC16” program in Salt Lake City featured a food security theme. The SC17 program was led by Elizabeth Leake (STEM-Trek) and Von Welch (Indiana University), and was financially-supported by U.S. National Science Foundation grants managed by Indiana University and Oklahoma State University, with STEM-Trek donations from GoogleCorelight, SC17 General Chair Bernd Mohr (Jülich Supercomputing Centre) and SC17 Inclusivity Chair Toni Collis (U-Edinburgh).

Thank you!

STEM-Trek wishes to thank URISC collaborator Von Welch (Indiana University/CTSC), the planning committee from IU and CHPC South Africa, reviewers, financial and in-kind sponsors, and presenters—especially Nick Roy (InCommon) who inspired this article. We appreciate delegates who took time to apply for and attend our workshop, and all who covered the bases in their absence at home.


1. Map available on the eduGAIN site (accessed December 2017).

2. Foley, Michael. 2016. The Role and Status of National Research and Education Networks in Africa. SABER-ICT Technical Paper Series; World Bank, Washington, DC. © World Bank. https://openknowledge.worldbank.org/handle/10986/26258 License: CC BY 3.0 IGO.

3. HPC Ecosystems Project Footprint, image provided by the South African CHPC.

4. NREN Status in HPC Ecosystems Project sites (per Foley report via Dec. 2017 Staphorst CHPC presentation)

5. GÉANT member sites, accessed from the GÉANT site, December 2017.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

RSC Reports 500Tflops, Hot Water Cooled System Deployed at JINR

April 18, 2018

RSC, developer of supercomputers and advanced HPC systems based in Russia, today reported deployment of “the world's first 100% ‘hot water’ liquid cooled supercomputer” at Joint Institute for Nuclear Research (JI Read more…

By Staff

New Device Spots Quantum Particle ‘Fingerprint’

April 18, 2018

Majorana particles have been observed by university researchers employing a device consisting of layers of magnetic insulators on a superconducting material. The advance opens the door to controlling the elusive particle Read more…

By George Leopold

Cray Rolls Out AMD-Based CS500; More to Follow?

April 18, 2018

Cray was the latest OEM to bring AMD back into the fold with introduction today of a CS500 option based on AMD’s Epyc processor line. The move follows Cray’s introduction of an ARM-based system (XC-50) last November. Read more…

By John Russell

HPE Extreme Performance Solutions

HPC and AI Convergence is Accelerating New Levels of Intelligence

Data analytics is the most valuable tool in the digital marketplace – so much so that organizations are employing high performance computing (HPC) capabilities to rapidly collect, share, and analyze endless streams of data. Read more…

Hennessy & Patterson: A New Golden Age for Computer Architecture

April 17, 2018

On Monday June 4, 2018, 2017 A.M. Turing Award Winners John L. Hennessy and David A. Patterson will deliver the Turing Lecture at the 45th International Symposium on Computer Architecture (ISCA) in Los Angeles. The Read more…

By Staff

Cray Rolls Out AMD-Based CS500; More to Follow?

April 18, 2018

Cray was the latest OEM to bring AMD back into the fold with introduction today of a CS500 option based on AMD’s Epyc processor line. The move follows Cray’ Read more…

By John Russell

IBM: Software Ecosystem for OpenPOWER is Ready for Prime Time

April 16, 2018

With key pieces of the IBM/OpenPOWER versus Intel/x86 gambit settling into place – e.g., the arrival of Power9 chips and Power9-based systems, hyperscaler sup Read more…

By John Russell

US Plans $1.8 Billion Spend on DOE Exascale Supercomputing

April 11, 2018

On Monday, the United States Department of Energy announced its intention to procure up to three exascale supercomputers at a cost of up to $1.8 billion with th Read more…

By Tiffany Trader

Cloud-Readiness and Looking Beyond Application Scaling

April 11, 2018

There are two aspects to consider when determining if an application is suitable for running in the cloud. The first, which we will discuss here under the title Read more…

By Chris Downing

Transitioning from Big Data to Discovery: Data Management as a Keystone Analytics Strategy

April 9, 2018

The past 10-15 years has seen a stark rise in the density, size, and diversity of scientific data being generated in every scientific discipline in the world. Key among the sciences has been the explosion of laboratory technologies that generate large amounts of data in life-sciences and healthcare research. Large amounts of data are now being stored in very large storage name spaces, with little to no organization and a general unease about how to approach analyzing it. Read more…

By Ari Berman, BioTeam, Inc.

IBM Expands Quantum Computing Network

April 5, 2018

IBM is positioning itself as a first mover in establishing the era of commercial quantum computing. The company believes in order for quantum to work, taming qu Read more…

By Tiffany Trader

FY18 Budget & CORAL-2 – Exascale USA Continues to Move Ahead

April 2, 2018

It was not pretty. However, despite some twists and turns, the federal government’s Fiscal Year 2018 (FY18) budget is complete and ended with some very positi Read more…

By Alex R. Larzelere

Nvidia Ups Hardware Game with 16-GPU DGX-2 Server and 18-Port NVSwitch

March 27, 2018

Nvidia unveiled a raft of new products from its annual technology conference in San Jose today, and despite not offering up a new chip architecture, there were still a few surprises in store for HPC hardware aficionados. Read more…

By Tiffany Trader

Inventor Claims to Have Solved Floating Point Error Problem

January 17, 2018

"The decades-old floating point error problem has been solved," proclaims a press release from inventor Alan Jorgensen. The computer scientist has filed for and Read more…

By Tiffany Trader

Researchers Measure Impact of ‘Meltdown’ and ‘Spectre’ Patches on HPC Workloads

January 17, 2018

Computer scientists from the Center for Computational Research, State University of New York (SUNY), University at Buffalo have examined the effect of Meltdown Read more…

By Tiffany Trader

Russian Nuclear Engineers Caught Cryptomining on Lab Supercomputer

February 12, 2018

Nuclear scientists working at the All-Russian Research Institute of Experimental Physics (RFNC-VNIIEF) have been arrested for using lab supercomputing resources to mine crypto-currency, according to a report in Russia’s Interfax News Agency. Read more…

By Tiffany Trader

How the Cloud Is Falling Short for HPC

March 15, 2018

The last couple of years have seen cloud computing gradually build some legitimacy within the HPC world, but still the HPC industry lies far behind enterprise I Read more…

By Chris Downing

Fast Forward: Five HPC Predictions for 2018

December 21, 2017

What’s on your list of high (and low) lights for 2017? Volta 100’s arrival on the heels of the P100? Appearance, albeit late in the year, of IBM’s Power9? Read more…

By John Russell

Chip Flaws ‘Meltdown’ and ‘Spectre’ Loom Large

January 4, 2018

The HPC and wider tech community have been abuzz this week over the discovery of critical design flaws that impact virtually all contemporary microprocessors. T Read more…

By Tiffany Trader

How Meltdown and Spectre Patches Will Affect HPC Workloads

January 10, 2018

There have been claims that the fixes for the Meltdown and Spectre security vulnerabilities, named the KPTI (aka KAISER) patches, are going to affect applicatio Read more…

By Rosemary Francis

Nvidia Responds to Google TPU Benchmarking

April 10, 2017

Nvidia highlights strengths of its newest GPU silicon in response to Google's report on the performance and energy advantages of its custom tensor processor. Read more…

By Tiffany Trader

Leading Solution Providers

Deep Learning at 15 PFlops Enables Training for Extreme Weather Identification at Scale

March 19, 2018

Petaflop per second deep learning training performance on the NERSC (National Energy Research Scientific Computing Center) Cori supercomputer has given climate Read more…

By Rob Farber

Lenovo Unveils Warm Water Cooled ThinkSystem SD650 in Rampup to LRZ Install

February 22, 2018

This week Lenovo took the wraps off the ThinkSystem SD650 high-density server with third-generation direct water cooling technology developed in tandem with par Read more…

By Tiffany Trader

AI Cloud Competition Heats Up: Google’s TPUs, Amazon Building AI Chip

February 12, 2018

Competition in the white hot AI (and public cloud) market pits Google against Amazon this week, with Google offering AI hardware on its cloud platform intended Read more…

By Doug Black

HPC and AI – Two Communities Same Future

January 25, 2018

According to Al Gara (Intel Fellow, Data Center Group), high performance computing and artificial intelligence will increasingly intertwine as we transition to Read more…

By Rob Farber

New Blueprint for Converging HPC, Big Data

January 18, 2018

After five annual workshops on Big Data and Extreme-Scale Computing (BDEC), a group of international HPC heavyweights including Jack Dongarra (University of Te Read more…

By John Russell

US Plans $1.8 Billion Spend on DOE Exascale Supercomputing

April 11, 2018

On Monday, the United States Department of Energy announced its intention to procure up to three exascale supercomputers at a cost of up to $1.8 billion with th Read more…

By Tiffany Trader

Momentum Builds for US Exascale

January 9, 2018

2018 looks to be a great year for the U.S. exascale program. The last several months of 2017 revealed a number of important developments that help put the U.S. Read more…

By Alex R. Larzelere

Google Chases Quantum Supremacy with 72-Qubit Processor

March 7, 2018

Google pulled ahead of the pack this week in the race toward "quantum supremacy," with the introduction of a new 72-qubit quantum processor called Bristlecone. Read more…

By Tiffany Trader

  • arrow
  • Click Here for More Headlines
  • arrow
Share This