Visit additional Tabor Communication Publications
July 31, 2008
Come gather round people wherever you roam
And admit that the waters around you have grown
And accept it that soon you'll be drenched to the bone
If your time to you is worth saving
Then you'd better start swimming or you'll sink like a stone
For the times, they are a changing
-- Bob Dylan
These are interesting times for the microprocessor industry. At the same time the multicore revolution is happening, we're also seeing the rise of data parallel architectures. Yes, vector computing is back, but this time, it's not just for nerds.
In a recent Linux Magazine article by Doug Eadline on processor trends, he wrote that mainstream computing is splitting into two architectural paths: general-purpose multicore CPUs, and data parallel engines -- what Eadline calls parallel/predictable computing units. The latter include GPUs, the Cell processor, and the future Larrabee processors. To that we could also add FPGAs and custom ASICs like the ClearSpeed devices.
General-purpose computing is great for software like word processors and operating systems, where the nature of the task is unpredictable from one moment to the next, and data-intensive operations are absent. This type of code is strewn with a lot of "if-then-else" statements to handle fine-grained complexity. On the other hand, predictable computing is well-suited to multimedia apps and most types of HPC, where high levels of data parallelism can be exploited. If your code contains a lot of "for" statements that are processing big chunks of tables, you probably could benefit from data parallelism.
The reason CPUs have dominated the computing landscape for so long is that all applications need some sort of program control, and any data-heavy for-loops could always be implemented serially. Today though, a high-end computer game wouldn't be practical without a GPU or game processor. And as visual and audio media become commonplace on the Internet and in mobile devices, clients and servers will need to be equipped with chips that can process large arrays of data in real time. Data parallelism will become a requirement practically everywhere.
The same goes for high performance computing. For example, with GPU-equipped systems, we're seeing HPC codes like seismic analysis or molecular dynamics accelerated by up to two orders of magnitude compared to CPU-based systems. The extra computing power is opening up HPC applications to a much larger audience. At the high-end, the Cell-based Roadrunner has put the petaflop supercomputer on the map, and NVIDIA GPU-accelerated supers are on the drawing board.
The rise of multimedia applications and the growth of HPC means that data parallel processors are targeted for some of the hottest markets. True, it will be multimedia that drives volume, but HPC will help to pull these processors up the performance curve as it has done with the Cell processor. Every chip vendor is aware of this. The processor realignment explains why AMD bought ATI, why NVIDIA is expanding its lineup for the mobile and HPC markets, why Intel is making a foray into high-end visual computing with Larrabee, and why IBM is quickly constructing an ecosystem around the Cell processor.
As TG Daily's Theo Valich pointed out, it appears that for the first time GPUs will be implemented on a smaller manufacturing technology than CPUs. According to him, both NVIDIA and AMD will use Taiwan Semiconductor Manufacturing Company fabs to start churning out GPU silicon on the 40nm process node in early 2009. Intel CPUs are currently at 45nm and their move to 32nm is unlikely to happen until the second half of 2009. The five nanometer edge for GPUs would be mostly symbolic, but as Valich notes, AMD and NVIDIA will probably make a big deal about it.
So where is this leading? Eadline believes the optimal platform for highly parallel (predictable) applications will turn out to be a single general-purpose core hooked up to some number of parallel processing engines. The Cell processor, with a PowerPC core surrounded by eight SPEs is the current example. Larrabee will likely be a more tightly integrated version of this, with a wide SIMD unit integrated into each core -- more like a vector-enhanced manycore CPU. AMD and NVIDIA are dabbling with CPU-GPU integrated chips, but the first generation is aimed at the low end (mobile clients). There are no public plans to integrate a CPU core into AMD's FireStream or NVIDIA Tesla HPC platforms.
The discrete CPU will be around for a while, though. There is plenty of non-technical software that just needs a handful of cores -- or even just one -- to run at peak efficiency. Vanilla desktop systems and virtualized enterprise servers, equipped with multicore CPUs, will handle these apps just fine. It's the cutting-edge applications that will require these new massively parallel architectures.
In August, there are a bunch of conferences that will feature some of the latest goings on in the data parallel realm. SIGGRAPH, the HOT CHIPS symposium, the Intel Developer Forum, and NVIDIA's NVISION 08 conference will have a lot to say about the new processor landscape and how it's being shaped by emerging applications. I'm going to be following the events over the next few weeks and give you my take on the developments.
Posted by Michael Feldman - July 30, 2008 @ 9:00 PM, Pacific Daylight Time
Michael Feldman is the editor of HPCwire.
No Recent Blog Comments
During a conversation this week with Cray CEO, Peter Ungaro, we learned that the company has managed to extend its reach into the enterprise HPC market quite dramatically--at least in supercomputing business terms. With steady growth into these markets, however, the focus on hardware versus the software side of certain problems for such users is....
Contributing commentator, Andrew Jones, offers a break in the news cycle with an assessment of what the national "size matters" contest means for the U.S. and other nations...
Today at the International Supercomputing Conference in Leipzing, Germany, Jack Dongarra presented on a proposed benchmark that could carry a bit more weight than its older Linpack companion. The high performance conjugate gradient (HPCG) concept takes into account new architectures for new applications, while shedding the floating point....
Jun 19, 2013 |
Supercomputer architectures have evolved considerably over the last 20 years, particularly in the number of processors that are linked together. One aspect of HPC architecture that hasn't changed is the MPI programming model.
Jun 18, 2013 |
The world's largest supercomputers, like Tianhe-2, are great at traditional, compute-intensive HPC workloads, such as simulating atomic decay or modeling tornados. But data-intensive applications--such as mining big data sets for connections--is a different sort of workload, and runs best on a different sort of computer.
Jun 18, 2013 |
Researchers are finding innovative uses for Gordon, the 285 teraflop supercomputer housed at the San Diego Supercomputer Center (SDSC) that has a unique Flash-based storage system. Since going online, researchers have put the incredibly fast I/O to use on a wide variety of workloads, ranging from chemistry to political science.
Jun 17, 2013 |
The advent of low-power mobile processors and cloud delivery models is changing the economics of computing. But just as an economy car is good at different things than a full size truck, an HPC workload still has certain computing demands that neither the fastest smartphone nor the most elastic cloud cluster can fulfill.
Jun 14, 2013 |
For all the progress we've made in IT over the last 50 years, there's one area of life that has steadfastly eluded the grasp of computers: understanding human language. Now, researchers at the Texas Advanced Computing Center (TACC) are utilizing a Hadoop cluster on its Longhorn supercomputer to move the state of the art of language processing a little bit further.
05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas | From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.
04/15/2013 | Bull | “50% of HPC users say their largest jobs scale to 120 cores or less.” How about yours? Are your codes ready to take advantage of today’s and tomorrow’s ultra-parallel HPC systems? Download this White Paper by Analysts Intersect360 Research to see what Bull and Intel’s Center for Excellence in Parallel Programming can do for your codes.
Join HPCwire Editor Nicole Hemsoth and Dr. David Bader from Georgia Tech as they take center stage on opening night at Atlanta's first Big Data Kick Off Week, filmed in front of a live audience. Nicole and David look at the evolution of HPC, today's big data challenges, discuss real world solutions, and reveal their predictions. Exactly what does the future holds for HPC?
Join our webinar to learn how IT managers can migrate to a more resilient, flexible and scalable solution that grows with the data center. Mellanox VMS is future-proof, efficient and brings significant CAPEX and OPEX savings. The VMS is available today.