Nov. 15, 2021 — The Aristotle Cloud Federation project built a scalable multi-institutional cloud federation that provides data analysis building blocks to support researchers requiring flexible workflows and large data sets.
Cloud computing systems were deployed at Cornell University, the University at Buffalo, and the University of California, Santa Barbara, and allocations, accounting, and cloud metrics were developed and implemented.
“Strategic use cases from diverse disciplines demonstrated the potential of using federated clouds to share computing, data, and software resources between institutions and reduce time to science,” says Cornell University PI David Lifka. “A rich set of open-source software and cloud usage modalities were supported, including Virtual Machine snapshots of complex software systems, application containers, dynamically-sized and scheduled compute clusters in the cloud, and optimized frameworks for query-based exploration, modeling, and analysis.”
The Aristotle project team and use case scientists produced 147 journal publications, conference papers, and presentations and 28 new technologies and techniques. New technologies advanced the knowledge and practice of federated cloud computing, public cloud economics, cloud versus supercomputer performance, edge computing, systems for implementing Internet of Things (IoT) applications, automated application deployment, and application containerization and orchestration.
Aristotle research has the potential to impact wind turbine companies interested in harnessing the energy of the atmosphere and converting it into carbon-free electricity, policymakers regulating high-frequency trading, agencies making water resource management decisions, health practitioners treating immune and metabolic diseases, oceanographers restoring nearshore coastal ecosystems, and farmers increasing yield while protecting the environment with real-time soil, water, and atmospheric data.
“In some cases, implementations of this research are already underway,” notes University of California, Santa Barbara co-PI Rich Wolski. “For example, a low-power, multi-tier IoT deployment developed by the Aristotle team and the agricultural community will monitor the growth of citrus under protective screening to protect citrus from the Huanglongbing bacteria (citrus greening disease) that devastated Florida and is now threatening California.” The Aristotle cloud is the data hosting service.
The project also had an impact on education and workforce development. Aristotle hosted 19 National Science Foundation Research Experiences for Undergraduates (REU) students, several of whom went on to achieve PhDs. Graduate and PhD students were heavily engaged in the project which afforded them the opportunity to learn many highly sought workplace skills, including cloud computing, machine learning (ML), data analysis, and data visualization. In addition, Aristotle team expertise in using, maintaining, and scaling campus clouds for academic use led to collaborations that automated and scaled a Math and Statistics Calculus curriculum, Computer Science courses in cloud computing, edge computing, and IoT, and Engineering and Applied Science courses in Programming and Data Science. Aristotle was cost-effective because instructors chose instance types based on class size, and after the semester was over, deleted the instance. Aristotle how-to guides and training videos are publicly available on the Aristotle portal.
Aristotle technologies and cloud computing resources can provide additional value to the CI and research community by supporting new science use cases such as gathering and analyzing remote data from the seabed and atmosphere to study kelp forest dynamics, and analyzing data from ground-based observation networks to study atmospheric cloud formation. The Aristotle team also has the skills and experience to implement emerging technologies such as end-to-end systems for practical and accessible IoT, and federated software development environments for ML and AI.
“In addition, plans are underway to integrate a regional Jetstream2 cloud with Red Cloud at Cornell to further explore federated cloud computing and make OpenStack enhancements that will benefit the wider community,” says Cornell Center for Advanced Computing deputy director Rich Knepper.
Source: Cornell University Center for Advanced Computing (CAC)