Visit additional Tabor Communication Publications
September 09, 2010
It seems like every time a chip vendor talks about its latest netbook processor, there are a flurry of articles about how such chips could be worked into a server. The impetus for this line of thinking is the power crisis in the datacenter. Processors targeted for netbooks and handheld consumer devices are ultra low-power and usually have a better performance per watt metric than your traditional server chippery. Not only that, these power-saving CPUs sell for just a fraction of the price of a typical server processor.
The most recent example of this line of thinking was precipitated by AMD's unveiling of its upcoming Bobcat microarchitecture last month at Hot Chips. Bobcat is the company's future core design destined for the netbook and notebook market. It wasn't long before articles like this from HotHardware.com showed up, suggesting that the new core design might be a great fit for ultra low-power servers and microblades.
The idea is that these power-sipping CPUs are especially efficient at scaled-out computing, where individual core performance is less important than the aggregate performance of the entire system. The idea, of course, is to offer the equivalent computational throughput for much less power than a conventional Opteron- or Xeon-based server. On paper that's true. Bobcat, for example, is advertised to offer a sub one-watt core with about 90 percent of the performance of a mainstream notebook chip. Certainly one would expect Bobcat-based CPUs to offer much better performance-per-watt numbers than their larger Opteron brethren.
In some cases, this creative thinking has gone somewhat further, that is, into actual product roadmaps. Earlier this summer, startup SeaMicro announced it was going to use an Intel 1.6 GHz Atom processor to power a new breed of low-power server. The SM10000 stuffs 512 Atom processors into a single box, while being able to run off-the-shelf applications and operating systems. SeaMicro's claim is that it can deliver comparable performance to a conventional x86 server, but use just a fourth of the power and space.
Meanwhile startup Smooth-Stone is looking to use ARM processors as the basis for another kind of low-power server. Mostly associated with cell phones and other mobile devices, the latest ARM chips will include support for both OS virtualization and the ability to address up to a terabyte of memory.
Given that there has been little experience with this type of computing, the application set for these ultra low-power servers is still a bit fuzzy. It appears that Dell and SeaMicro are aiming their offerings at cloud hosting, Web farms and other light-load applications. The practical consideration here is that single thread performance is not all that good for these under-powered chips, especially compared to a Xeon or Opteron processor. But applications that can be divvied up efficiently across many processors into independent lightweight tasks are perfect for this kind of computing.
On the other hand, where single-thread application performance is the bottleneck, execution times will suffer. Sure, power is expensive, but time is even more so. That makes most compute-bound workloads, including the vast majority of HPC apps, unsuitable for these lowly chips, with the possible exception of embarrassingly parallel codes.
That might be the end of it if it weren't for this GPGPU phenomenon. In this case, the CPU is used to drive the GPU, where the most compute-intensive piece of the application is executed. If enough of the app can be offloaded to the graphics accelerator, the CPU need not be all that muscular. Thus a power-sipping CPU might be the perfect companion to the power-hogging GPU.
In practice, though, I don't think we're quite there yet. From what I've gathered, the profile of many GPU-ported codes is such that they still rely on speedy CPUs for at least a portion of the application. It would be interesting for GPGPU developers to track execution cycles on the two processors, and determine how big a CPU is really required for a given code. It might even give some enterprising vendor an idea about how to build a better balanced GPGPU server.
Posted by Michael Feldman - September 09, 2010 @ 5:56 PM, Pacific Daylight Time
Michael Feldman is the editor of HPCwire.
No Recent Blog Comments
Contributing commentator, Andrew Jones, offers a break in the news cycle with an assessment of what the national "size matters" contest means for the U.S. and other nations...
Today at the International Supercomputing Conference in Leipzing, Germany, Jack Dongarra presented on a proposed benchmark that could carry a bit more weight than its older Linpack companion. The high performance conjugate gradient (HPCG) concept takes into account new architectures for new applications, while shedding the floating point....
Not content to let the Tianhe-2 announcement ride alone, Intel rolled out a series of announcements around its Knights Corner and Xeon Phi products--all of which are aimed at adding some options and variety for a wider base of potential users across the HPC spectrum. Today at the International Supercomputing Conference, the company's Raj....
Jun 18, 2013 |
The world's largest supercomputers, like Tianhe-2, are great at traditional, compute-intensive HPC workloads, such as simulating atomic decay or modeling tornados. But data-intensive applications--such as mining big data sets for connections--is a different sort of workload, and runs best on a different sort of computer.
Jun 18, 2013 |
Researchers are finding innovative uses for Gordon, the 285 teraflop supercomputer housed at the San Diego Supercomputer Center (SDSC) that has a unique Flash-based storage system. Since going online, researchers have put the incredibly fast I/O to use on a wide variety of workloads, ranging from chemistry to political science.
Jun 17, 2013 |
The advent of low-power mobile processors and cloud delivery models is changing the economics of computing. But just as an economy car is good at different things than a full size truck, an HPC workload still has certain computing demands that neither the fastest smartphone nor the most elastic cloud cluster can fulfill.
Jun 14, 2013 |
For all the progress we've made in IT over the last 50 years, there's one area of life that has steadfastly eluded the grasp of computers: understanding human language. Now, researchers at the Texas Advanced Computing Center (TACC) are utilizing a Hadoop cluster on its Longhorn supercomputer to move the state of the art of language processing a little bit further.
Jun 13, 2013 |
Titan, the Cray XK7 at the Oak Ridge National Lab that debuted last fall as the fastest supercomputer in the world with 17.59 petaflops of sustained computing power, will rely on its previous LINPACK test for the upcoming edition of the Top 500 list.
05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas | From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.
04/15/2013 | Bull | “50% of HPC users say their largest jobs scale to 120 cores or less.” How about yours? Are your codes ready to take advantage of today’s and tomorrow’s ultra-parallel HPC systems? Download this White Paper by Analysts Intersect360 Research to see what Bull and Intel’s Center for Excellence in Parallel Programming can do for your codes.
Join HPCwire Editor Nicole Hemsoth and Dr. David Bader from Georgia Tech as they take center stage on opening night at Atlanta's first Big Data Kick Off Week, filmed in front of a live audience. Nicole and David look at the evolution of HPC, today's big data challenges, discuss real world solutions, and reveal their predictions. Exactly what does the future holds for HPC?
Join our webinar to learn how IT managers can migrate to a more resilient, flexible and scalable solution that grows with the data center. Mellanox VMS is future-proof, efficient and brings significant CAPEX and OPEX savings. The VMS is available today.