SCIENCE & ENGINEERING NEWS
San Diego, CA — The Internet, so powerful for sharing text, images, sounds and videos, is now weakest at doing that for which it was originally designed – exchanging raw data between researchers. Last week, the National Center for Data Mining at the University of Illinois at Chicago launched the first version of an infrastructure called Data Space Transfer Protocol, or DSTP, for creating the next generation web of data.
The current Web provides an infrastructure for working with distributed multimedia documents, which are exchanged using the familiar “http,” or Hypertext Transfer Protocol. But although a massive amount of data is available online, it is stored in so many different formats that it has become difficult, if not impossible, to analyze and use in research, says Georg Reinhart, visiting research scientist in mathematics, statistics, and computer science at UIC.
“Doctors, for example, often need to share information and data, but each doctor stores and uploads data in his or her own format,” Reinhart said.
“Astronomers, physicists and other researchers often face the same problem.”
According to Reinhart, who developed DSTP with Emory Creel, a colleague at the National Center for Data Mining, the new transfer protocol will unify the way data is stored online. Downloading data from different sites via high-speed networks and analyzing the data in real-time will become possible for the first time.
“DSTP will standardize the way data is shared, the same way HTTP revolutionized the way documents are shared,” Reinhart said. “Researchers will be able to search, analyze and mine databases simultaneously, even if the databases contain different types of data.” Reinhart predicts DSTP will motivate more researchers to post data globally and lead to “an avalanche” of new and existing data accessible and useful to a wider audience.
Six sites around the world, including the United Kingdom and Australia, are participating this week in a demonstration of distributed data transfer using DSTP at the NASA/NREN Gigabit Networking Workshop in Moffet Field, Calif.
DSTP is available free to the public. The software, along with a sample demonstration, is available at http://www.dataspaceweb.net . The National Center for Data Mining is also committed to creating the necessary infrastructure and software to support DSTP for distributed data mining.
With 25,000 students, the University of Illinois at Chicago is the largest and most diverse university in the Chicago area. UIC is home to the largest medical school in the United States and is one of only 88 national Research I universities. Located just west of Chicago’s Loop, UIC is a vital part of the educational, technological, and cultural fabric of the area. For more information, visit http://www.uic.edu .