HPCwire

Leading HPC
Solution Providers





















HPCwire >> Blogs

Blog: From the Editor

From the Editor | Main Blog Index

NEC Does Some Vector Addition


Last Thursday, NEC announced its sixth generation vector supercomputer, the SX-9, which the company is touting as the "worlds fastest vector supercomputer." The company says the new machine will be twice as energy-efficient as the SX-8R generation. The SX-9 is based on a new 100 gigaflop vector processor, sixteen of which are placed in a node. In addition to the new vector processor, the SX-9 supports up to one terabyte of shared memory per node and an internode interconnect of up to 128 GB/second. At its maximum configuration of 512 nodes, the SX-9 would deliver a peak vector performance of 839 teraflops.

Before I go any further, I should point out that to the best of my knowledge, no such machine is being built -- or ever will be. According to Thomas Schoenemeyer, HPC Presales Manager, NEC GmbH, nothing near the size of an 839 teraflop system is in the pipeline. NEC has orders for two systems in Europe. One is headed to the German Weather Service (DWD); the other to Meteo France. The German system, which will deliver 39 teraflops, and, coincidentally, costs 39 million euros (72 million dollars), is scheduled to be fully operational in 2010. The Meteo France system is also expected to be a sub-100 teraflop machine. This week, the company also announced an order from Japan's Tohoku University for a 26 teraflop system. NEC plans to ship bigger SX-9 systems down the road, but they don't expect to be challenging petaflop supercomputers in the foreseeable future.

"We are not going to be on the top of the TOP500 list with this system," admits Schoenemeyer. "Our focus is the productivity of the customer."

An 839 teraflop SX-9 would probably cost in the neighborhood of a billion dollars. So despite what you might have read elsewhere, the top systems from Cray and IBM are unlikely to be challenged by a maxed out SX-9 machine anytime soon. The last NEC machine to achieve TOP500 notoriety was the 36 teraflop Earth Simulator, a SX-6 generation system that was ranked the most powerful machine in the world from 2002 to 2004, before IBM Blue Gene/L overtook it.

Like its forebearers, the SX-9 is targeted to weather forecasting service facilities, climate research centers, and other government science centers. NEC has sold over 1000 SX systems over the past two decades -- the vast majority in Japan and Europe, although there are some outliers in Australia, South Africa, and Brazil. There are virtually none in North America.

The way NEC is happily churning out vector supercomputers, one might get the impression that weather and climate modeling is a growth industry. While global warming is certainly a big topic these days, such research is unlikely to propel SX-9 production into the double-digit growth rates enjoyed by the overall HPC market.

But unlike in North America, Japan and Europe have a decent-sized installed base of vector machines and the vast majority of them are NEC supers. Although most of the 1000-plus NEC vector machines sold over the last two decades have been retired, a lot of Japanese and European Earth science centers still run on SX systems. NEC is hoping many of these organizations will upgrade to the SX-9 at some point and keep the legacy going.

SX-8 applications are upwardly compatible with SX-9 (binary compatible), so the software upgrade path should be painless. NEC maintains its own compiler for the vector processors, as well as a Super-UX Unix OS to enable applications to fully utilize the large flat memory architecture and powerful processors. Both OpenMP and MPI  parallelism are supported. It's this kind of end-to-end support that has allowed NEC to maintain, and even grow, its customer base for more than two decades.

In the recent past, Cray has had some success with its X1 and X1E vector machines (Warsaw University, Spain's National Institute of Meteorology, Korea Meteorological Administration). But today the company is penetrating the European market with its Opteron-based XT4 systems. Cray's future strategy for its vector computing offerings will become more apparent next week.

Dedicated vector machines used to be all the rage in supercomputing, starting with the first commercial system in 1974, the CDC STAR-100. Cray soon followed with the Cray-1 in 1976. Later, NEC, Fujitsu and Hitachi each developed their own architectures. But vector supercomputing is a tough sell these days. The market share of these types of machines has been declining for some time, replaced by more general-purpose systems -- both tightly coupled supercomputers and computer clusters -- based on superscalar CPUs.

While HPC applications that make heavy use of a lot of matrix arithmetic, like computational fluid dynamic (CFD) codes, are well-suited to vector processors, in practice, multicore superscalar chips have proved to be a better overall technology. This is mainly because as HPC applications evolve, they become more complex, employing a greater variety of algorithms to get their job done. This complexity manifests itself in diverse computing requirements; some parts of the code require high levels of single-threaded performance, other parts require a lot of threads, and still others benefit from lots of data parallelism. Systems based on scalar processors tend to be very good at the first two, and pretty good at the third one. Vector-based machines are really only good at data parallelism (and actually only a subset of that). Even weather modeling applications, the vector machine's raison d'etre, require scalar processing for optimal performance.

More commodity-based vector processing solutions already exist and more are on the way. Short-vector SIMD on CPUs, like PowerPC AltiVec and x86 SSE, is a step in the direction of integrated vector capabilities. Mixing vector and scalar engines on the same dies, as has been done with the Cell BE processor, is another approach to making vector processing more mainstream. And as I wrote last week, coprocessor accelerators, like GPUs, FPGAs, and SIMD ASICs (ClearSpeed), are providing similar capabilities at a much more attractive price.

In the end, economics will choose how vector computing gets done. But the purveyors of proprietary solutions are on the wrong side of history. General-purpose commodity computing is not just here to stay, it's here to dominate.

-----

As always, comments about HPCwire are welcomed and encouraged. Write to me, Michael Feldman, at editor@hpcwire.com.

Posted by Michael Feldman - November 2 @ 12:00AM

(Digg, Technorati, more)

Discussion

There are 0 discussion items posted.  

Sponsored Links

White Paper: HPC in a Green and Modular Solution Building Block
Learn how the Appro GreenBlade™ System helps consolidate server, storage, network, power and simplified management capabilities in a single package while providing the performance-density, energy-efficiency and best ROI for your business.

Michael Feldman

Michael Feldman is the editor of HPCwire.

More Michael Feldman



Recent Comments

Feature Articles

Book Review: Petascale Computing: Algorithms and Applications

Petascale Computing: Algorithms and Applications, edited by David A. Bader, is the first book in CRC's Computational Science Series, edited by Horst Simon. Although the book is a collection of papers, Bader has done an excellent job of creating a compilation that holds together and covers a broad topic very well.
Read More...

The Week in Review

Cilk++ used in parallelization of the FP-tree algorithm for pattern mining; Istanbul benchmark results posted; and the latest on the NVIDIA Tesla shortage. John West recaps those stories and more in our weekly wrap-up.
Read More...

A Trio of HPC Offerings Unveiled at ISC

Last week's International Supercomputing Conference (ISC'09) was a convenient excuse for vendors to announce a raft of new products, but three, in particular, stood out.
Read More...

Top Headlines

Cloudy With a Chance of HPC

Jul 01 | GenomeWeb Daily News | The popularity of cloud computing in the life sciences community was on full display at April's Bio-IT World conference. Read more...

HPC From the Beach

Jul 01 | Linux Magazine | How can getting to the ocean help with HPC computing? Read more...

DARPA Investigates Extreme Supercomputing

Jun 29 | GCN.com | Agency issues RFI for "Ubiquitous High Performance Computing" systems. Read more...

Supercomputers Go From Biggest to Cheapest

Jun 29 | Computerworld | The bottom of the TOP500 reveals the coming revolution in truly accessible high-end computing. Read more...

CPUs Gear Up For -- and Some Avoid -- Hot Chips

Jun 18 | EE Times | Parallel software also takes spotlight at Stanford confab. Read more...

Featured Whitepapers

Building High Performance Computing in a Green and Modular Solution Building Block

Apr 14 | | Many HPC IT departments are feeling the rising pressure to deliver more capacity computing and performance while trying to reduce the total cost of ownership. This white paper discusses how an environmentally-friendly and open-standards HPC building block based computing system using flexible interconnect options helps address capacity computing needs.

Multimedia

Webcast: Dell Expands HPC Access and Adoption with Intel Cluster Ready Program


Source: Addison Snell, GM/VP, Tabor Research; sponsored by Dell

Many organizations that could benefit from the use of HPC clusters find that it is complicated to get the systems up and running because of limited IT resources or the complexities of the clusters themselves. Learn how the Intel Cluster Ready program, for which Dell was an original partner, seeks to address this challenge for entry level and mid-range HPC users.

Video White Paper: Architecting a Better Network Storage Solution

BlueArc's Titan architecture represents an evolutionary step in file servers by creating a hardware-based file system that can scale bandwidth, IOPS, and overall data capacity well beyond conventional software-based devices. With its ability to virtualize a massive storage pool of up to four usable petabytes of tiered storage, Titan can scale with growing data requirements, offering a competitive advantage for businesses, researchers, or other enterprises seeking to better manage data growth while still ensuring optimal performance.

Webcast: HPC Development Solutions: Sun Studio & Sun HPC ClusterTools


Sun Studio Compilers and Tools and Sun HPC ClusterTools allow you to create high performance parallel applications for OpenSolaris, Solaris and Linux. Sun Studio Express 11/08 includes MPI performance analysis capabilities and full OpenMP 3.0 compiler support. Learn about all this and the latest in Sun HPC ClusterTools 8.1.

Blogs by Topics

Blogs by Author

HPC Blogroll



Featured Events


WORLDCOMP 2009
Data Mining Courses