BirdFlow AI: How Supercomputers and AI Unveil the Secrets of Migratory Bird Patterns

May 15, 2023

May 15, 2023 — Migratory birds are critical for the health of human agriculture and the environment. But our knowledge of their movements is surprisingly little. Using “snapshots” based on Cornell University’s eBird database, scientists at the University of Massachusetts created BirdFlow, an artificial intelligence (AI) that accurately predicted migratory movements. Critical for the AI’s accuracy, the team first used PSC’s Bridges-2 system to remove observation biases from eBird and fill in gaps in the data.

Predictions made by the BirdFlow AI on how the American woodcock migrates in North America between eBird “snapshots.” Credit: Fuentes, M., Van Doren, B. M., Fink, D., & Sheldon, D. (2023).

Migratory birds play an important role in our environment — and our dinner plates. By eating an astounding 400 to 500 million tons of insects each year, songbirds help protect the crops that humans and our domestic animals eat. By eating mice and rats, hawks protect our stored grain as well as reduce the number of disease-carrying ticks and rodent-borne pathogens. And that’s not even counting birds’ importance as indicators of environmental health.

It may surprise you, then, to hear that our information on bird migration is … hazy. That’s why scientists at Cornell University launched eBird, an online project to collect data on bird sightings by amateur enthusiasts worldwide. The project has given researchers a tidal wave of data on where certain birds can be found at what times of year. But the database only gives snapshots, not actual bird movements.

“The idea of bringing these sorts of methods to the problem of bird migration is something that Dan [Sheldon] has had on his mind for some time … The quality of the data and the access to computation had both improved pretty drastically since he first attempted the project many years ago,” said Miguel Fuentes, a graduate student in Daniel Sheldon’s group at the University of Massachusetts (UMass) Amherst.

Fuentes teamed up with Benjamin Van Doren and Daniel Fink of the eBird team at the Cornell Lab of Ornithology to develop BirdFlow, an AI-based tool to fill in the gaps of eBird. They used PSC’s flagship, NSF-funded Bridges-2 supercomputer to handle the massive computation and data movement needed to prepare eBird data for the AI.

As long ago as 2021, eBird had collected a billion bird observations, drawn from 77,466,000 checklists submitted by 684,300 volunteers in 202 countries. The data present a series of snapshots that, for all the world, look like stills from a video. But this view could be deceptive. Just because a type of bird is seen in one place on one date and a distant place a month later doesn’t mean that the species traveled between those two places — let alone that it’s the same bird.

Fuentes wanted to create an AI that could make informed guesses as to what happened in between the snapshots. But he’d need to validate its predictions somehow. Normally, a machine learning AI would use a fraction of a given data set to “train” itself. It would then make a series of trial-and-error guesses, correcting itself until its predictions were accurate. Then scientists would test its accuracy with the larger, whole dataset.

But the eBird data don’t directly record bird movements needed to evaluate the predictions. Instead, Fuentes would use the entire eBird data to train the AI — a new innovation. To test it, he’d use a much more definitive, if smaller, data set collected by another group of scientists: tracking data collected from individual birds’ migrations. By testing against real migration data from tagged birds, he could be sure that the AI’s predictions were valid.

The raw eBird data was not enough, though. Bird watchers aren’t evenly distributed around the world — in particular, they’re more likely to live in affluent countries. Also, the quality of the observations vary, based on the abilities of each individual volunteer. Before Fuentes could use the eBird data to train his AI, such biases would need to be corrected.

Fortunately, the eBird team was keenly interested in making these corrections to the data, so that scientists all over the world could use it more accurately. Daniel Fink and Tom Auer of the eBird team led the way in the work to “clean up” the database. Fuentes could train and run his AI using the computing resources available to him at the UMass. But the Cornell scientists would need far more computing power for their data clean-up. For each of the 2,300 bird species they would analyze over the course of a year, they would need 2 to 8 gigabytes (GB) of computer memory and 3,000 to 4,000 CPU hours. In order to carry out the analysis in a reasonable amount of time, they’d need to run many species in parallel. Bridges-2’s “regular memory” nodes, offering 256 to 512 GB of memory apiece (enough to qualify as “large memory” on most supercomputers), gave them exactly this capability. By managing their use of memory carefully, they processed some 15,000 to 20,000 GB of data per month.

Early results are promising. BirdFlow on its own predicted migration patterns that matched the movements of the tagged birds well. Adding random movements, to simulate the uncertainty of individual birds’ courses, improved the match with the tracking data even more. The group reported an initial analysis of 11 species of North American birds in a paper in the journal Methods in Ecology and Evolution in January 2023.

In the cleaned-up eBird tool, and its application to BirdFlow, the scientists have created a group of resources that they would like to see other researchers use. You can see how eBird tracks observation snapshots here. The AI can suggest migration routes as well as their timing and connectivity in ways that biologists can then test in the field. The tool promises advances in many fields, including migration ecology, conservation, disease surveillance, aviation, and public outreach.


Source: Ken Chiacchia, PSC 

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industry updates delivered to you every week!

Can Cerabyte Crack the $1-Per-Petabyte Barrier with Ceramic Storage?

July 20, 2024

A German startup named Cerabyte is hoping to solve the burgeoning market for secondary and archival data storage with a novel approach that uses lasers to etch bits onto glass with a ceramic coating. The “grey ceramic� Read more…

Weekly Wire Roundup: July 15-July 19, 2024

July 19, 2024

It's summertime (for most of us), and the HPC-related headlines aren't as plentiful as they once were. But not everything has to happen at high tide-- this week still had some waves! Idaho National Laboratory's Bitter Read more…

ARM, Fujitsu Targeting Open-source Software for Power Efficiency in 2-nm Chip

July 19, 2024

Fujitsu and ARM are relying on open-source software to bring power efficiency to an air-cooled supercomputing chip that will ship in 2027. Monaka chip, which will be made using the 2-nanometer process, is based on the Read more…

SCALEing the CUDA Castle

July 18, 2024

In a previous article, HPCwire has reported on a way in which AMD can get across the CUDA moat that protects the Nvidia CUDA castle (at least for PyTorch AI projects.). Other tools have joined the CUDA castle siege. AMD Read more…

Quantum Watchers – Terrific Interview with Caltech’s John Preskill by CERN

July 17, 2024

In case you missed it, there's a fascinating interview with John Preskill, the prominent Caltech physicist and pioneering quantum computing researcher that was recently posted by CERN’s department of experimental physi Read more…

Aurora AI-Driven Atmosphere Model is 5,000x Faster Than Traditional Systems

July 16, 2024

While the onset of human-driven climate change brings with it many horrors, the increase in the frequency and strength of storms poses an enormous threat to communities across the globe. As climate change is warming ocea Read more…

Can Cerabyte Crack the $1-Per-Petabyte Barrier with Ceramic Storage?

July 20, 2024

A German startup named Cerabyte is hoping to solve the burgeoning market for secondary and archival data storage with a novel approach that uses lasers to etch Read more…

SCALEing the CUDA Castle

July 18, 2024

In a previous article, HPCwire has reported on a way in which AMD can get across the CUDA moat that protects the Nvidia CUDA castle (at least for PyTorch AI pro Read more…

Aurora AI-Driven Atmosphere Model is 5,000x Faster Than Traditional Systems

July 16, 2024

While the onset of human-driven climate change brings with it many horrors, the increase in the frequency and strength of storms poses an enormous threat to com Read more…

Shutterstock 1886124835

Researchers Say Memory Bandwidth and NVLink Speeds in Hopper Not So Simple

July 15, 2024

Researchers measured the real-world bandwidth of Nvidia's Grace Hopper superchip, with the chip-to-chip interconnect results falling well short of theoretical c Read more…

Shutterstock 2203611339

NSF Issues Next Solicitation and More Detail on National Quantum Virtual Laboratory

July 10, 2024

After percolating for roughly a year, NSF has issued the next solicitation for the National Quantum Virtual Lab program — this one focused on design and imple Read more…

NCSA’s SEAS Team Keeps APACE of AlphaFold2

July 9, 2024

High-performance computing (HPC) can often be challenging for researchers to use because it requires expertise in working with large datasets, scaling the softw Read more…

Anders Jensen on Europe’s Plan for AI-optimized Supercomputers, Welcoming the UK, and More

July 8, 2024

The recent ISC24 conference in Hamburg showcased LUMI and other leadership-class supercomputers co-funded by the EuroHPC Joint Undertaking (JU), including three Read more…

Generative AI to Account for 1.5% of World’s Power Consumption by 2029

July 8, 2024

Generative AI will take on a larger chunk of the world's power consumption to keep up with the hefty hardware requirements to run applications. "AI chips repres Read more…

Atos Outlines Plans to Get Acquired, and a Path Forward

May 21, 2024

Atos – via its subsidiary Eviden – is the second major supercomputer maker outside of HPE, while others have largely dropped out. The lack of integrators and Atos' financial turmoil have the HPC market worried. If Atos goes under, HPE will be the only major option for building large-scale systems. Read more…

Everyone Except Nvidia Forms Ultra Accelerator Link (UALink) Consortium

May 30, 2024

Consider the GPU. An island of SIMD greatness that makes light work of matrix math. Originally designed to rapidly paint dots on a computer monitor, it was then Read more…

Comparing NVIDIA A100 and NVIDIA L40S: Which GPU is Ideal for AI and Graphics-Intensive Workloads?

October 30, 2023

With long lead times for the NVIDIA H100 and A100 GPUs, many organizations are looking at the new NVIDIA L40S GPU, which it’s a new GPU optimized for AI and g Read more…

Shutterstock_1687123447

Nvidia Economics: Make $5-$7 for Every $1 Spent on GPUs

June 30, 2024

Nvidia is saying that companies could make $5 to $7 for every $1 invested in GPUs over a four-year period. Customers are investing billions in new Nvidia hardwa Read more…

Nvidia Shipped 3.76 Million Data-center GPUs in 2023, According to Study

June 10, 2024

Nvidia had an explosive 2023 in data-center GPU shipments, which totaled roughly 3.76 million units, according to a study conducted by semiconductor analyst fir Read more…

AMD Clears Up Messy GPU Roadmap, Upgrades Chips Annually

June 3, 2024

In the world of AI, there's a desperate search for an alternative to Nvidia's GPUs, and AMD is stepping up to the plate. AMD detailed its updated GPU roadmap, w Read more…

Some Reasons Why Aurora Didn’t Take First Place in the Top500 List

May 15, 2024

The makers of the Aurora supercomputer, which is housed at the Argonne National Laboratory, gave some reasons why the system didn't make the top spot on the Top Read more…

Intel’s Next-gen Falcon Shores Coming Out in Late 2025 

April 30, 2024

It's a long wait for customers hanging on for Intel's next-generation GPU, Falcon Shores, which will be released in late 2025.  "Then we have a rich, a very Read more…

Leading Solution Providers

Contributors

Google Announces Sixth-generation AI Chip, a TPU Called Trillium

May 17, 2024

On Tuesday May 14th, Google announced its sixth-generation TPU (tensor processing unit) called Trillium.  The chip, essentially a TPU v6, is the company's l Read more…

Nvidia H100: Are 550,000 GPUs Enough for This Year?

August 17, 2023

The GPU Squeeze continues to place a premium on Nvidia H100 GPUs. In a recent Financial Times article, Nvidia reports that it expects to ship 550,000 of its lat Read more…

IonQ Plots Path to Commercial (Quantum) Advantage

July 2, 2024

IonQ, the trapped ion quantum computing specialist, delivered a progress report last week firming up 2024/25 product goals and reviewing its technology roadmap. Read more…

Choosing the Right GPU for LLM Inference and Training

December 11, 2023

Accelerating the training and inference processes of deep learning models is crucial for unleashing their true potential and NVIDIA GPUs have emerged as a game- Read more…

The NASA Black Hole Plunge

May 7, 2024

We have all thought about it. No one has done it, but now, thanks to HPC, we see what it looks like. Hold on to your feet because NASA has released videos of wh Read more…

Q&A with Nvidia’s Chief of DGX Systems on the DGX-GB200 Rack-scale System

March 27, 2024

Pictures of Nvidia's new flagship mega-server, the DGX GB200, on the GTC show floor got favorable reactions on social media for the sheer amount of computing po Read more…

MLPerf Inference 4.0 Results Showcase GenAI; Nvidia Still Dominates

March 28, 2024

There were no startling surprises in the latest MLPerf Inference benchmark (4.0) results released yesterday. Two new workloads — Llama 2 and Stable Diffusion Read more…

NVLink: Faster Interconnects and Switches to Help Relieve Data Bottlenecks

March 25, 2024

Nvidia’s new Blackwell architecture may have stolen the show this week at the GPU Technology Conference in San Jose, California. But an emerging bottleneck at Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire