HPCwire

Leading HPC
Solution Providers




















HPCwire >> Features

High Performance Messaging for Financial Services

-- an alternative model for HPC?


Page:  1  of  3
1 | 2 | 3   All  »  

The financial services industry thrives on data: millions upon millions of individual buy and sell orders that make up our complex global financial markets. The data has to be accurate, and speed of delivery is a competitive advantage in a business where delivery improvements measured in microseconds can make or break companies.

The financial services industry is a long-time consumer of large-scale HPC, and what's going on in their compute infrastructure has always been relevant to users on the technical side of HPC. As I was talking to Wombat Financial Services about their new Wombat Data Fabric platform, I caught a glimpse of developing ideas that could influence the future of high performance technical computing (HPTC) as we struggle to deal with systems in excess of 100,000 cores.

Wombat was founded in 1997, and acquired by NYSE/Euronext in March of 2008. The company provides software that allows financial firms access to high speed market data in large volumes very quickly. The volume of financial market data quadrupled in 2007. This growth is continuing in 2008, making the ability to provide rapid access to a growing volume of time sensitive data a key differentiator for competitors in this market.

Wombat's latest product -- Wombat Data Fabric(WDF) -- is a middleware solution based on a Remote Direct Memory Access (RDMA) model over an InfiniBand substrate. WDF plugs in underneath Wombat's widely adopted Middleware Agnostic Messaging API (MAMA) that higher level applications are written to, preventing the need for users to port applications as they move from UDP and TCP to next generation fabric-based messaging.

InfiniBand has several advantages over Ethernet-based data delivery networks in financial services. Because of the volume of relatively short messages that have to be exchanged on financial information networks, demands on the I/O subsystem represent a large part of the capacity challenge in these systems. The context switches and data copies that are needed as data moves from the source system to its destination system using TCP over Ethernet introduce latencies and computational loads that can quickly overwhelm the system, causing delays and leading to missed trading opportunities.

A common messaging architecture in this industry is publish/subscribe, or pub/sub. Pub/sub is a loosely-coupled, flexible messaging paradigm where a set of data sources (the publishers) broadcast information on specific topics, and a set of data consumers (subscribers) receive information on topics of interest to them. The subscribers and publishers needn't know about one another, and individual subscribers and publishers may come and go as they please without interrupting the operation of the rest of the participants in the data exchange.

As Ken Barnes, VP of Business and Planning at Wombat, explains, "If you go into the datacenter behind any major trading operation (be it Merrill's equity trading desk, or a stock exchange, or a hedge fund, for example) what you'll find is a row of cabinets housing hundreds of interconnected nodes. You might have a hundred of them publishing normalized data received from dozens of financial markets they participate in (stock exchanges, derivative exchanges, foreign exchange markets, etc). And you'll have a hundred(s) of other servers consuming that data to run their trading algorithms and analytical applications."

InfiniBand allows Wombat's middleware to facilitate the transfer of data between publishers and subscribers using RDMA, bypassing context switches and data copies in the producing and consuming applications. This bypass reduces load on the computers involved in the exchange and also reduces the time it takes for information to reach its intended application on a particular machine.

Another benefit to this approach shows up when messages are dropped by subscribers. Typically when a subscriber becomes overloaded, or network service is interrupted, messages of interest to the subscriber are dropped. When the subscriber recovers, it has to ask the publisher for a data recap so it can catch up to the current state. This can be problematic for large recaps that can trigger subscriber overloads, dropping recapped messages, requiring additional recaps, and so on. This kind of "recap storm" causes congestion in the network, and can overload the publisher, thus causing problems for other subscribers not involved in the original problem.

Wombat's Data Fabric exploits RDMA to allow subscribers to get the data they need on-demand by reading it directly from the publisher's memory. A subscriber that dropped messages can consume the missed messages -- at a rate it determines -- directly from the publisher with no risk of being overwhelmed by the response and no knock on effect to other subscribers.

Page:  1  of  3
1 | 2 | 3   All  »  

Article Tools

  • Print This Page
  • Bookmark This Article

Share Options

(Digg, Technorati, more)


Subscribe

Discussion

There are 0 discussion items posted.  

Sponsored Links

New Paper: Parallel Computing Without Parallel Programming
Learn how domain experts can run VHLL programs like MATLAB® on a variety of high-performance platforms without low-level reprogramming and how to work with the largest datasets and complex algorithms without sacrificing ease of use or reducing productivity.



Top Headlines

3D Seismic Data: Taking a Smarter Approach to Interpretation

Jul 09 | Engineer Live | The demand for computational tools to underpin the 3D seismic interpretation process has never been more apparent. Read more...

Engineering Unemployment Soared in 2Q to 8.6%

Jul 08 | EE Times | Unemployment for U.S. engineers has reached record levels, according to government figures. Read more...

Gartner Adjusts 2009 IT Spend Downward Again

Jul 08 | Network World | Global spending for 2009 projected to drop 6 percent, for a total of $3.2 trillion. Read more...

Concurrent and Parallel Are Not The Same

Jul 08 | Linux Magazine | Portability or efficiency? Neither is guaranteed when writing explicit parallel code. Read more...

800 TFLOP Real-Time Ray Tracing GPU Unveiled, Not for Gamers

Jul 07 | Ars Technica | Japanese company builds custom ASIC to accelerate real-time ray traced rendering for the auto industry. Read more...

Featured Whitepapers

Parallel Computing Without Parallel Programming

Jul 10 | | Engineers, scientists, and other domain experts depend on the productivity enabled by very high-level language (VHLL) tools like MATLAB® and Python. However, as datasets grow larger and programs get more sophisticated, ordinary desktop computers can no longer keep up. The paper explores how to run VHLL programs on high-performance platforms without low-level reprogramming. Work with large datasets and complex algorithms without sacrificing ease of use or reducing productivity.

Building High Performance Computing in a Green and Modular Solution Building Block

Apr 14 | | Many HPC IT departments are feeling the rising pressure to deliver more capacity computing and performance while trying to reduce the total cost of ownership. This white paper discusses how an environmentally-friendly and open-standards HPC building block based computing system using flexible interconnect options helps address capacity computing needs.

Multimedia

Webcast: Dell Expands HPC Access and Adoption with Intel Cluster Ready Program


Source: Addison Snell, GM/VP, Tabor Research; sponsored by Dell

Many organizations that could benefit from the use of HPC clusters find that it is complicated to get the systems up and running because of limited IT resources or the complexities of the clusters themselves. Learn how the Intel Cluster Ready program, for which Dell was an original partner, seeks to address this challenge for entry level and mid-range HPC users.

Video White Paper: Architecting a Better Network Storage Solution

BlueArc's Titan architecture represents an evolutionary step in file servers by creating a hardware-based file system that can scale bandwidth, IOPS, and overall data capacity well beyond conventional software-based devices. With its ability to virtualize a massive storage pool of up to four usable petabytes of tiered storage, Titan can scale with growing data requirements, offering a competitive advantage for businesses, researchers, or other enterprises seeking to better manage data growth while still ensuring optimal performance.

Webcast: HPC Development Solutions: Sun Studio & Sun HPC ClusterTools


Sun Studio Compilers and Tools and Sun HPC ClusterTools allow you to create high performance parallel applications for OpenSolaris, Solaris and Linux. Sun Studio Express 11/08 includes MPI performance analysis capabilities and full OpenMP 3.0 compiler support. Learn about all this and the latest in Sun HPC ClusterTools 8.1.

Special Feature: ISC'09

Newsletters

Stay informed! Subscribe to HPCwire email Newsletters.






HPC Job Bank


Featured Events

WORLDCOMP 2009
Data Mining Courses