The Leading Source for Global News and Information Covering the Ecosystem of High Productivity Computing
From the Editor | Main Blog Index
November 02, 2007
Last Thursday, NEC announced its sixth generation vector supercomputer, the SX-9, which the company is touting as the "worlds fastest vector supercomputer." The company says the new machine will be twice as energy-efficient as the SX-8R generation. The SX-9 is based on a new 100 gigaflop vector processor, sixteen of which are placed in a node. In addition to the new vector processor, the SX-9 supports up to one terabyte of shared memory per node and an internode interconnect of up to 128 GB/second. At its maximum configuration of 512 nodes, the SX-9 would deliver a peak vector performance of 839 teraflops.
Before I go any further, I should point out that to the best of my knowledge, no such machine is being built -- or ever will be. According to Thomas Schoenemeyer, HPC Presales Manager, NEC GmbH, nothing near the size of an 839 teraflop system is in the pipeline. NEC has orders for two systems in Europe. One is headed to the German Weather Service (DWD); the other to Meteo France. The German system, which will deliver 39 teraflops, and, coincidentally, costs 39 million euros (72 million dollars), is scheduled to be fully operational in 2010. The Meteo France system is also expected to be a sub-100 teraflop machine. This week, the company also announced an order from Japan's Tohoku University for a 26 teraflop system. NEC plans to ship bigger SX-9 systems down the road, but they don't expect to be challenging petaflop supercomputers in the foreseeable future.
"We are not going to be on the top of the TOP500 list with this system," admits Schoenemeyer. "Our focus is the productivity of the customer."
An 839 teraflop SX-9 would probably cost in the neighborhood of a billion dollars. So despite what you might have read elsewhere, the top systems from Cray and IBM are unlikely to be challenged by a maxed out SX-9 machine anytime soon. The last NEC machine to achieve TOP500 notoriety was the 36 teraflop Earth Simulator, a SX-6 generation system that was ranked the most powerful machine in the world from 2002 to 2004, before IBM Blue Gene/L overtook it.
Like its forebearers, the SX-9 is targeted to weather forecasting service facilities, climate research centers, and other government science centers. NEC has sold over 1000 SX systems over the past two decades -- the vast majority in Japan and Europe, although there are some outliers in Australia, South Africa, and Brazil. There are virtually none in North America.
The way NEC is happily churning out vector supercomputers, one might get the impression that weather and climate modeling is a growth industry. While global warming is certainly a big topic these days, such research is unlikely to propel SX-9 production into the double-digit growth rates enjoyed by the overall HPC market.
But unlike in North America, Japan and Europe have a decent-sized installed base of vector machines and the vast majority of them are NEC supers. Although most of the 1000-plus NEC vector machines sold over the last two decades have been retired, a lot of Japanese and European Earth science centers still run on SX systems. NEC is hoping many of these organizations will upgrade to the SX-9 at some point and keep the legacy going.
SX-8 applications are upwardly compatible with SX-9 (binary compatible), so the software upgrade path should be painless. NEC maintains its own compiler for the vector processors, as well as a Super-UX Unix OS to enable applications to fully utilize the large flat memory architecture and powerful processors. Both OpenMP and MPI parallelism are supported. It's this kind of end-to-end support that has allowed NEC to maintain, and even grow, its customer base for more than two decades.
In the recent past, Cray has had some success with its X1 and X1E vector machines (Warsaw University, Spain's National Institute of Meteorology, Korea Meteorological Administration). But today the company is penetrating the European market with its Opteron-based XT4 systems. Cray's future strategy for its vector computing offerings will become more apparent next week.
Dedicated vector machines used to be all the rage in supercomputing, starting with the first commercial system in 1974, the CDC STAR-100. Cray soon followed with the Cray-1 in 1976. Later, NEC, Fujitsu and Hitachi each developed their own architectures. But vector supercomputing is a tough sell these days. The market share of these types of machines has been declining for some time, replaced by more general-purpose systems -- both tightly coupled supercomputers and computer clusters -- based on superscalar CPUs.
While HPC applications that make heavy use of a lot of matrix arithmetic, like computational fluid dynamic (CFD) codes, are well-suited to vector processors, in practice, multicore superscalar chips have proved to be a better overall technology. This is mainly because as HPC applications evolve, they become more complex, employing a greater variety of algorithms to get their job done. This complexity manifests itself in diverse computing requirements; some parts of the code require high levels of single-threaded performance, other parts require a lot of threads, and still others benefit from lots of data parallelism. Systems based on scalar processors tend to be very good at the first two, and pretty good at the third one. Vector-based machines are really only good at data parallelism (and actually only a subset of that). Even weather modeling applications, the vector machine's raison d'etre, require scalar processing for optimal performance.
More commodity-based vector processing solutions already exist and more are on the way. Short-vector SIMD on CPUs, like PowerPC AltiVec and x86 SSE, is a step in the direction of integrated vector capabilities. Mixing vector and scalar engines on the same dies, as has been done with the Cell BE processor, is another approach to making vector processing more mainstream. And as I wrote last week, coprocessor accelerators, like GPUs, FPGAs, and SIMD ASICs (ClearSpeed), are providing similar capabilities at a much more attractive price.
In the end, economics will choose how vector computing gets done. But the purveyors of proprietary solutions are on the wrong side of history. General-purpose commodity computing is not just here to stay, it's here to dominate.
-----
As always, comments about HPCwire are welcomed and encouraged. Write to me, Michael Feldman, at editor@hpcwire.com.
Posted by Michael Feldman - November 2 @ 12:00AM
(Digg, Technorati, more)
PGI Accelerator™ Fortran 95/03 and C99 compilers for x64+NVIDIA
Accelerate applications on x64+GPU platforms by adding OpenMP-like compiler directives to existing Fortran and C programs. Available now for Linux, MacOS and Windows. Download a free 15 day trial.
Platform HPC Workgroup Manager
Platform HPC Workgroup Manager integrates all the cluster productivity tools you need to deploy, run and manage your HPC environment.
Michael Feldman is the editor of HPCwire.
More Michael Feldman
Compairson to Core i7-980X by rsingle
HPC? not so much by ewahl
Re: IBM and HPC by truly64
HPC = servers but a lot more by lawries
Multi core deployment becomes a memory game by truly64
Re: Venture Capital Drought? Not So Much. by Ron Van Holst
Re: Podcast: Cray Awarded Defense Deal; SGI Makes Storage Buy; IBM Invents New Algorithm by Nastyanna
Painful Truth by jeffrey.mcallister
SGI = graphics + HPC by johnbarr
HPC = servers but a lot more by truly64
Oracle SPARC != Fujitsu SPARC by Alan M. Feldstein
Sun & HPC != Oracle & HPC by Merblich
a third vendor for lossless low latency 10GbE fabric by lee.fisher@hp.com
Response to GAH by KevinButerbaugh
Response to KevinButerbaugh by GAH
Response to KevinButerbaugh by GAH
Response to GAH by KevinButerbaugh
Response to bdrupp by KevinButerbaugh
Climate Crisis and Exaflops by bdrupp
Climate Crisis and Exaflops by John Hules
Climate Crisis and Exaflops by GAH
Climate Crisis by KevinButerbaugh
IBM "Brain Simulation" article is not properly presented. by Merritt
563 out of 1206 by vvolkov
Little Iron by gadunk
At least it's not "cloud" by KevinButerbaugh
Native QPI Interface? by commike
Mmmmmm by hellcats
New transistorized IC chip scales. by symmecon
Itanium at IDF by Alan M. Feldstein
Communication time by jnapper
"The financial meltdown and computing" by donpellegrino
Human Models by mdgabriel
High-End SPARC Chip for Scientific Applications by Alan M. Feldstein
RapidMind by Mr LolO
Rapidmind by dminor
Longer run times by JohnWest
re: Algo trading Angst by jshore
Results of Testing by in_the_crease
C-DAC announces plans for a petaflop system; IBM researchers are working on vertical integration techniques to extend Moore's Law another 15 years. We recap those stories and more in our weekly wrapup.
Read More...
The Moscow State University supercomputer, Lomonosov, has been selected for a high-performance makeover, with the goal of tripling its processing power to achieve petaflop-level performance in 2010. T-Platforms, who developed and manufactured the supercomputer, is the odds-on favorite to lead the project.
Read More...
Right on schedule, Intel has launched its Xeon 5600 processors, codenamed "Westmere EP." The 5600 represents the 32nm sequel to the Xeon 5500 (Nehalem EP) for dual-socket servers. Intel is touting better performance and energy efficiency, along with new security features, as the big selling points of the new Xeons.
Read More...
Mar 19 | OfficialWire | New super to support intelligence work Down Under. Read more...
Mar 18 | ChannelWeb | Westmere parts already showing up in HPC machines. Read more...
Mar 17 | The Register | But what about the tier ones? Read more...
Mar 17 | Cadalyst Magazine | A new generation of workstations is changing the nature of technical computing. Read more...
Mar 17 | Linux Magazine | Latest iteration of Sun Grid Engine able to tap into Cloud. Read more...
Jan 12 | | In-depth look at vSMP Foundation server virtualization technology, technical implementation, use cases and capabilities. The technical whitepaper provides an architectural overview and details on the three vSMP Foundation products: vSMP Foundation for SMP, vSMP Foundation for Cluster and vSMP Foundation for Cloud.
Jan 18 | | This white paper discusses Gore’s copper cable assemblies, and how they continue to exceed the standards for providing reliable, cost-effective solutions for high-performance computer applications.
Join this online panel discussion for live Q&A with leading industry experts, analysts, and end-users to discuss the latest innovations, best practices, barriers to implementation, and measurable benefits of server virtualization with a particular focus on today's real world solutions.
Learn about scalable fault-tolerant architectures and examples of energy efficient and scalable supercomputing clusters using dual QDR InfiniBand to combine capacity computing with network failover capabilities with the help of programming languages such as MPI and a robust Linux cluster management package.
LIVE@SCO9: The IBM team discusses new innovations in hardware, software and services that help clients better understand their workloads and get insight from their R&D efforts. Technology demonstrations include the soon-to-be-released Power7 HPC processor, the DCS990 system with 2.4 petabytes of storage, the xCAT management tool, secure HPC cloud computing and more. Winners of two HPCwire Readers' and Editors’ Choice Awards! Take the IBM virtual tour at SC09 or more information go online to: http://www-03.ibm.com/systems/deepcomputing/sc09.html