HPCwire

Leading HPC
Solution Providers


























HPCwire >> Off the Wire

'Semantic Supercomputing' Mines Patent Databases


WALTHAM, Mass., May 20 -- Vienna-based Matrixware Information Services is using Interactive Supercomputing, Inc.'s (ISC) Star-P software to tackle the ever-growing challenge of finding patent information hidden in the world's vast patent databases and libraries.

The Austrian company employs a team of computer engineers, mathematicians, linguists and patent specialists to help companies mine patent repositories for intellectual property information. It combines natural language processing (NLP) algorithms with what it calls semantic supercomputing to retrieve relevant patent information faster, easier and at less cost.

Patents and intellectual property play an increasingly important role as intangible assets of industrial corporations. Some 60 million patents have been awarded around the world, and the yearly number of new filings is on the rise. Over 250,000 companies worldwide depend on patent data. Consequently, professional management of patents and precise retrieval of patent information are essential business processes for industries around the globe.

To solve this problem, Matrixware employs multi-core high performance computers from SGI and Star-P's interactive parallel computing capabilities to develop and run its NLP algorithms on the enormous, terabyte-scale patent data sets. Star-P enables Matrixware's team to continuously code and refine NLP algorithms on their desktops using MATLAB, a popular mathematical tool, and then run them instantly and interactively on parallel computers with little to no modification. Star-P eliminates the need to re-program applications in C, Fortran or MPI in order to run on parallel systems, resulting in huge productivity gains.

Matrixware's Alexandria System is the central storage for the raw data as well as for the enriched data. Data access is modeled along the well established Library Science methods and embedded into a workflow system. The Alexandria server also provides the user with exact and constantly updated document counts in the collections he is retrieving from.

Recursively generating metadata from data and metadata from metadata, the various refinement processes let the information store grow and allow the user community to actively "Cultivate the Corpus." As a development and front end framework, Matrixware created an extensible software infrastructure, the "Leonardo" Ecosystem. Within this framework, technologists can simultaneously create and refine new tools and use the community channel to communicate with their end-users. The benefit for the end-users on the other side is a closer match between the tools for their actual information needs and existing workflows.

"Matrixware processes patent data by its meaning in context to turn it into valuable information for our clients. Our purpose is to boost their productivity and open up new opportunities for them using intellectual property information," says Francisco Webber, Matrixware's managing director. "But while our scientists are experts in information retrieval, they are not parallel programming experts. Star-P enables them to tap the power of parallel HPCs to refine and run their natural language processing applications as well as to improve the data quality of our patent databases."

"Despite the massive growth of patent information over the last several decades, researchers still search the way they did 30 years ago," says David Gibson, ISC's vice president of business development. "Matrixware's NLP technology and semantic supercomputing breakthroughs are turning patent information retrieval into a huge competitive advantage for companies whose success hinges on intellectual property discovery and protection."

About Matrixware Information Services

The Vienna, Austria-based company Matrixware Information Services GmbH was founded in 2005 and is active in the field of patent retrieval, a subarea of information retrieval (IR). IR falls under the heading of information science and informatics and uses computer-assisted methods to conduct content-based information searches.

About Interactive Supercomputing

Interactive Supercomputing (ISC) launched in 2004 to commercialize Star-P, an interactive parallel computing platform. With automatic parallelization and interactive execution of existing desktop simulation applications, Star-P merges two previously distinct environments -- desktop computers and high performance servers -- into one. Based in Waltham, Mass., the privately held company markets Star-P for a range of biomedical, financial, and government laboratory research applications.

-----

Source: Interactive Supercomputing


Article Tools

  • Print This Article

Share & Save Options

Discussion

There are 0 discussion items posted.  

Sponsored Links



Feature Articles

The Linux HPC Empire Strikes Back

While the Microsoft juggernaut has been touting the joys of its new Windows HPC Server 2008, the Linux HPC contingent has been somewhat less vocal of late. But now Red Hat has come up with its version of an integrated cluster solution.
Read More...

Nexsan Looks to Scare Up HPC Customers With Storage Beast

Even though the cost of servers still dominates the datacenter budget, storage is actually on a steeper growth curve. HPC storage, in particular, is being singled out as high-growth opportunity. Vendors are scrambling to keep up.
Read More...

The Week in Review

Google datacenters most energy efficient; Cluster Resources to demo Moab Hybrid Cluster; Red Hat Linux releases HPC distro. John West recaps those stories and more in our weekly wrap-up.
Read More...

Top Headlines

Lustre to Battle Corruption

Oct 07 | GCN.com | Sun Microsystems has been busy building a lot more intelligence into Lustre, a file system used for large-scale cluster computing. Read more...

Oracle and HP's Database Machine Predicated on Voltaire

Oct 06 | The Register | Does the HP Oracle Database Machine represent InfiniBand's big chance to break out its HPC niche? Read more...

3D Imaging Spreads to Fashion and Beyond

Oct 06 | BusinessWeek | A body scan can save a lot of time in the fitting room, and fields from medicine to architecture are adopting 3D computing applications. Read more...

Structural Engineers and Computer Scientists Hope to Integrate Disciplines to 'Revolutionize Building Construction'

Oct 03 | UCSD News | Despite the evolution of computer science over the past 30 years, structural engineering -- hindered by a reluctance to adapt to digital innovations -- has remained relatively unchanged as a discipline. Read more...

Credit Crisis Spreads a Pall Over Silicon Valley

Oct 02 | New York Times | Silcon Valley is starting to feel the effects of the credit crunch. Read more...

Featured Whitepapers

Panasas® Tiered Parity™ Architecture

Sep 04 | | Disk drives are approximately 250 times denser today than a decade ago. This is good news for users who are creating, manipulating and storing more data than ever before. It gives them an opportunity to derive more value from their stored data and lowers the capital acquisition and operating expense associated with that data.

Multimedia

Video White Paper: Architecting a Better Network Storage Solution

BlueArc's Titan architecture represents an evolutionary step in file servers by creating a hardware-based file system that can scale bandwidth, IOPS, and overall data capacity well beyond conventional software-based devices. With its ability to virtualize a massive storage pool of up to four usable petabytes of tiered storage, Titan can scale with growing data requirements, offering a competitive advantage for businesses, researchers, or other enterprises seeking to better manage data growth while still ensuring optimal performance.

High Performance on Wall Street

Newsletters

Stay informed! Subscribe to HPCWire email Newsletters.

Get updates and insights on the High Productivity Computing industry delivered driectly to your inbox.





HPC Job Bank

Featured Events

LCI Workshop
SIFMA
HP-CAST
2008 Virtualization Conference & Expo
Symposium 2009