The Leading Source for Global News and Information Covering the Ecosystem of High Productivity Computing
May 20, 2008
WALTHAM, Mass., May 20 -- Vienna-based Matrixware Information Services is using Interactive Supercomputing, Inc.'s (ISC) Star-P software to tackle the ever-growing challenge of finding patent information hidden in the world's vast patent databases and libraries.
The Austrian company employs a team of computer engineers, mathematicians, linguists and patent specialists to help companies mine patent repositories for intellectual property information. It combines natural language processing (NLP) algorithms with what it calls semantic supercomputing to retrieve relevant patent information faster, easier and at less cost.
Patents and intellectual property play an increasingly important role as intangible assets of industrial corporations. Some 60 million patents have been awarded around the world, and the yearly number of new filings is on the rise. Over 250,000 companies worldwide depend on patent data. Consequently, professional management of patents and precise retrieval of patent information are essential business processes for industries around the globe.
To solve this problem, Matrixware employs multi-core high performance computers from SGI and Star-P's interactive parallel computing capabilities to develop and run its NLP algorithms on the enormous, terabyte-scale patent data sets. Star-P enables Matrixware's team to continuously code and refine NLP algorithms on their desktops using MATLAB, a popular mathematical tool, and then run them instantly and interactively on parallel computers with little to no modification. Star-P eliminates the need to re-program applications in C, Fortran or MPI in order to run on parallel systems, resulting in huge productivity gains.
Matrixware's Alexandria System is the central storage for the raw data as well as for the enriched data. Data access is modeled along the well established Library Science methods and embedded into a workflow system. The Alexandria server also provides the user with exact and constantly updated document counts in the collections he is retrieving from.
Recursively generating metadata from data and metadata from metadata, the various refinement processes let the information store grow and allow the user community to actively "Cultivate the Corpus." As a development and front end framework, Matrixware created an extensible software infrastructure, the "Leonardo" Ecosystem. Within this framework, technologists can simultaneously create and refine new tools and use the community channel to communicate with their end-users. The benefit for the end-users on the other side is a closer match between the tools for their actual information needs and existing workflows.
"Matrixware processes patent data by its meaning in context to turn it into valuable information for our clients. Our purpose is to boost their productivity and open up new opportunities for them using intellectual property information," says Francisco Webber, Matrixware's managing director. "But while our scientists are experts in information retrieval, they are not parallel programming experts. Star-P enables them to tap the power of parallel HPCs to refine and run their natural language processing applications as well as to improve the data quality of our patent databases."
"Despite the massive growth of patent information over the last several decades, researchers still search the way they did 30 years ago," says David Gibson, ISC's vice president of business development. "Matrixware's NLP technology and semantic supercomputing breakthroughs are turning patent information retrieval into a huge competitive advantage for companies whose success hinges on intellectual property discovery and protection."
About Matrixware Information Services
The Vienna, Austria-based company Matrixware Information Services GmbH was founded in 2005 and is active in the field of patent retrieval, a subarea of information retrieval (IR). IR falls under the heading of information science and informatics and uses computer-assisted methods to conduct content-based information searches.
About Interactive Supercomputing
Interactive Supercomputing (ISC) launched in 2004 to commercialize Star-P, an interactive parallel computing platform. With automatic parallelization and interactive execution of existing desktop simulation applications, Star-P merges two previously distinct environments -- desktop computers and high performance servers -- into one. Based in Waltham, Mass., the privately held company markets Star-P for a range of biomedical, financial, and government laboratory research applications.
-----
Source: Interactive Supercomputing
While the Microsoft juggernaut has been touting the joys of its new Windows HPC Server 2008, the Linux HPC contingent has been somewhat less vocal of late. But now Red Hat has come up with its version of an integrated cluster solution.
Read More...
Even though the cost of servers still dominates the datacenter budget, storage is actually on a steeper growth curve. HPC storage, in particular, is being singled out as high-growth opportunity. Vendors are scrambling to keep up.
Read More...
Google datacenters most energy efficient; Cluster Resources to demo Moab Hybrid Cluster; Red Hat Linux releases HPC distro. John West recaps those stories and more in our weekly wrap-up.
Read More...
Oct 07 | GCN.com | Sun Microsystems has been busy building a lot more intelligence into Lustre, a file system used for large-scale cluster computing. Read more...
Oct 06 | The Register | Does the HP Oracle Database Machine represent InfiniBand's big chance to break out its HPC niche? Read more...
Oct 06 | BusinessWeek | A body scan can save a lot of time in the fitting room, and fields from medicine to architecture are adopting 3D computing applications. Read more...
Oct 03 | UCSD News | Despite the evolution of computer science over the past 30 years, structural engineering -- hindered by a reluctance to adapt to digital innovations -- has remained relatively unchanged as a discipline. Read more...
Oct 02 | New York Times | Silcon Valley is starting to feel the effects of the credit crunch. Read more...
Sep 04 | | Disk drives are approximately 250 times denser today than a decade ago. This is good news for users who are creating, manipulating and storing more data than ever before. It gives them an opportunity to derive more value from their stored data and lowers the capital acquisition and operating expense associated with that data.
BlueArc's Titan architecture represents an evolutionary step in file servers by creating a hardware-based file system that can scale bandwidth, IOPS, and overall data capacity well beyond conventional software-based devices. With its ability to virtualize a massive storage pool of up to four usable petabytes of tiered storage, Titan can scale with growing data requirements, offering a competitive advantage for businesses, researchers, or other enterprises seeking to better manage data growth while still ensuring optimal performance.
Get updates and insights on the High Productivity Computing industry delivered driectly to your inbox.