From the Editor | Main Blog Index
February 02, 2007
Last Friday Intel demonstrated x86 processors with twice the transistor density of its current designs. Using 45nm process technology and new transistor materials, the company said it has achieved a significant breakthrough in transistor design and that this technology will be incorporated into production microprocessors in the second half of 2007. Although IBM also announced its plans for 45nm processors last Friday, Intel will be the first chipmaker to deliver the new technology into the marketplace.
The technology that makes the move to 45nm possible involves new materials that dramatically reduces transistor leakage -- an increasingly vexing problem with each new processor shrink. The new 45nm processors will use a hafnium-based high-k insulator in the gate dielectric and metal gate electrodes to reduce leakage significantly. The new materials replace silicon dioxide as the gate dialectric and polysilicon as the gate electrode, two compounds which have been used in transistors for decades. The new combo will enable Moore's Law to continue for at least a couple more processor generations before even more advanced technology is required.
It seems fitting that Intel is the chipmaker leading the charge for Moore's Law. In 1965, Gordon Moore, the company's co-founder, made the empirical observation that the number of transistors on an integrated circuit would double every two years (since updated to every 12-18 months). It was a self-fulfilling prophecy. He has not only lived to see the Law upheld for over forty years, but also the replacement of the underlying transistor technology, which had remained unchanged for almost as long.
According to Moore, "the implementation of high-k and metal materials marks the biggest change in transistor technology since the introduction of polysilicon gate MOS transistors in the late 1960s."
Intel says that the new technology can be used to increase the performance or decrease the power requirements on the new chips -- something that had been increasingly hard to balance as transistor leakage started to become more serious in the last couple of processor shrinks. The engineers tell us the new technology provides a 30 percent reduction in transistor switching power and more than a 10x reduction in gate leakage. With the revamped transistor materials, Intel can be much more aggressive with dialing the clock up for higher performance or dialing it down to increase energy efficiency.
Using similar transistor materials to Intel, IBM has also been designing 45nm circuitry for some time, but it won't be begin production of processors based on that technology until 2008. This puts it at least six months behind Intel (probably more), which means IBM and its partners -- AMD, Toshiba, Sony and others -- will be playing catchup once again in the process technology race. AMD in particular stands to suffer, since its x86 processors go head-to-head against Intel's offerings. Since AMD has to fight the transistor technology battle through its IBM proxy IBM, it doesn't have as much control as Intel does when it comes to coordinating the process technology with its processor design.
Nevertheless, AMD is looking to make up for some lost ground in 2007. When the quad-core Opteron 'Barcelona' chip (65nm) is released this summer, AMD engineers claim they will have a 40 percent performance advantage over the comparable Intel 'Clovertown' processor.
But that advantage might be short-lived. Conceivably, Intel could roll out a 45nm quad-core processor this year. That chip would likely to be more powerful than the 65nm Clovertown or Barcelona, and perhaps more energy efficient as well. The 45nm process size gives Intel some options that compensate for their older front-side-bus and off-chip memory controller. With smaller transistors, extra die space can be used to add more cache (less memory accesses) or more CPU intelligence to increase performance.
Intel claims it is now a full year ahead of its competitors in process technology. While AMD hardware still retains an advantage in scaled up systems, it can't afford to fall too far behind Intel in the fundamentals. The typical cycle has been for Intel to claim the advantage when it moves to a new process technology and then lose it when AMD's processors reach the same level. If AMD falls more than a process generation behind, it won't be able to play leap-frog anymore.
HPC vendors that rely on AMD for their clusters or supercomputers may begin to fret if the processor technology gets too far behind the curve. Cray, in particular, has a long-term arrangement with AMD to use Opterons in its next generation supercomputers through the end of the decade. Other HPC OEMs that offer clustered servers with either Xeons or Opterons have more flexibility, but the dual-vendor arrangement has been valuable in and of itself to help keep the x86 market competitive.
AMD can take some comfort in the fact that transistor technology advancements do not drive the industry the way they did 10 years ago, when CPU speed was everything. Today, memory bandwidth has become such a limiting factor in relation to CPU performance that processor architecture and system architecture is at least as important. This is especially true in the scaled up multi-processor machines that inhabit the datacenters of high performance computing users. And it is here that AMD, with its HyperTransport bus technology and integrated memory controller, still claims the advantage over its rival.
Also, by using silicon-on-insulator (SOI) and strained silicon technologies for its chips, AMD is able to compensate for its larger transistor sizes. SOI is more expensive to manufacture, but offers higher performance than conventional silicon. It's one of the reasons why AMD processors are so competitive with their Intel counterparts once they reach the same transistor geometries.
Intel has committed to maintain a two-year technology cadence -- their so-called 'tick-tock' model -- that encompasses a processor shrink (tick) followed by a new microarchitecture design (tock). It can be a challenge to line up this model with the demands of the various x86 markets. A Centrino-based laptop might be replaced every two years, while a Xeon-based supercomputer is more likely to have at least a five or ten year life. Superimposed on this is the fact that the laptop user may have very little use for more cores, extra cache or better performance if existing consumer software can't take advantage of it. On the other hand, the supercomputer user is much more likely to use all of these capabilities.
Eventually the marketplace will decide the value of processor technology advancements. Both Intel and AMD have to navigate the diverse markets they serve with the x86, but ultimately they are counting on the OEMs and their customers to join them on the technology treadmill.
-----
As always, comments about HPCwire are welcomed and encouraged. Write to me, Michael Feldman, at editor@hpcwire.com.
Posted by Michael Feldman - February 01, 2007 @ 9:00 PM, Pacific Standard Time
![]()
Michael Feldman is the editor of HPCwire.
No Recent Blog Comments
Large-scale, worldwide scientific initiatives rely on some cloud-based system to both coordinate efforts and manage computational efforts at peak times that cannot be contained within the combined in-house HPC resources. Last week at Google I/O, Brookhaven National Lab’s Sergey Panitkin discussed the role of the Google Compute Engine in providing computational support to ATLAS, a detector of high-energy particles at the Large Hadron Collider (LHC).
Read more...
The Xeon Phi coprocessor might be the new kid on the high performance block, but out of all first-rate kickers of the Intel tires, the Texas Advanced Computing Center (TACC) got the first real jab with its new top ten Stampede system.We talk with the center's Karl Schultz about the challenges of programming for Phi--but more specifically, the optimization...
Read more...
Although Horst Simon was named Deputy Director of Lawrence Berkeley National Laboratory, he maintains his strong ties to the scientific computing community as an editor of the TOP500 list and as an invited speaker at conferences.
Read more...
May 16, 2013 |
When it comes to cloud, long distances mean unacceptably high latencies. Researchers from the University of Bonn in Germany examined those latency issues of doing CFD modeling in the cloud by utilizing a common CFD and its utilization in HPC instance types including both CPU and GPU cores of Amazon EC2.
Read more...
May 15, 2013 |
Supercomputers at the Department of Energy’s National Energy Research Scientific Computing Center (NERSC) have worked on important computational problems such as collapse of the atomic state, the optimization of chemical catalysts, and now modeling popping bubbles.
Read more...
May 10, 2013 |
Program provides cash awards up to $10,000 for the best open-source end-user applications deployed on 100G network.
Read more...
May 09, 2013 |
The Japanese government has revealed its plans to best its previous K Computer efforts with what they hope will be the first exascale system...
Read more...
05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas | From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.
04/15/2013 | Bull | “50% of HPC users say their largest jobs scale to 120 cores or less.” How about yours? Are your codes ready to take advantage of today’s and tomorrow’s ultra-parallel HPC systems? Download this White Paper by Analysts Intersect360 Research to see what Bull and Intel’s Center for Excellence in Parallel Programming can do for your codes.
In this demonstration of SGI DMF ZeroWatt disk solution, Dr. Eng Lim Goh, SGI CTO, discusses a function of SGI DMF software to reduce costs and power consumption in an exascale (Big Data) storage datacenter.
The Cray CS300-AC cluster supercomputer offers energy efficient, air-cooled design based on modular, industry-standard platforms featuring the latest processor and network technologies and a wide range of datacenter cooling requirements.