AMD : The Integration Revolution?

By Nebojsa Novakovic

May 7, 2012

By Nebojsa Novakovic

The potential–and challenges of multi-core processing

In recent years, microprocessor designers began hitting the limitations of the single-core architecture. So they made the shift to power-efficient multi-core designs. Now they’re running up against the limitations of this format, as programming multi-core processors becomes increasingly complex. One path forward is to synergistically combine the potential of both CPU and GPU units. But integrating two very high powered processing units with somewhat differing performance and bandwidth requirements can pose interesting challenges to the overall system architecture.

AMD is proposing to solve these issues with a combination of hardware solutions and system-level programming tools, ushering in a new era of what has become known as heterogeneous computing.

AMD Fig. 1 

The promise of Heterogeneous System Architecture (HSA)

While it has been recognized for some time that GPUs can be used to do parallel processing, the programmer’s task has been difficult if not extraordinary. That’s where AMD’s Heterogeneous System Architecture (HSA) comes in. HSA enables a new way to program applications using the GPU that can make it easier for mainstream programmers. AMD’s HSA is a full solution approach, enabling mainstream programmers to write parallel processing code as easily for the GPU as the CPU. And in some cases, the code may be able to execute on either the CPU or GPU, based on the system’s resources.

One way HSA can help solve the problem is by providing a unified address space for the CPU and GPU. With HSA, GPUs support the same page tables x86 CPUs use for mapping program memory pages to physical memory. Now GPUs can use a much larger memory map and, more importantly, a pointer is usually the same for code running on the CPU and code running on the GPU. The latter allows one copy of data to exist in memory and both the CPU and GPU can act upon it. The programmer doesn’t have to manage two or more copies of the same data. This design also helps improve performance because it is no longer necessary to make copies and keep them synchronized.

AMD is proposing an open platform architecture for HSA with published specifications. HSA will have a virtual ISA known as HSAIL (HSA Intermediate Language), a memory model, and a system specification. AMD is working with hardware, operating system, tools, and application companies to form an HSA foundation to guide the architectural development into the future.

Among the programmers I talk to, there is a great deal of excitement in being able to obtain that huge untapped performance potential from GPUs via HSA in an easy and transparent way, something that was difficult, if not impossible, until now

Combining the CPU and GPU: Bringing the APU to life

A hardware merger of the CPU and GPU, AMD’s Accelerated Processing Units (APUs) provide streamlined hardware-level integration of these two processing units. By the time of the AMD Fusion12 Developer Summit (June 11-14, 2012), AMD will have introduced “Trinity,” a second generation APU in market, along with the two first generation products AMD A-Series and AMD C and E-Series APUs formerly codenamed “Llano,” and “Brazos.”

The “Brazos”-based AMD C and E-Series APU combines an ultra-low power dual core CPU with an entry-level AMD Radeon™ GPU. Variants of these APUs are targeted at tablets, fanless notebooks, entry-level notebooks, and entry-level desktops. The “Llano”-based AMD A-Series APU combines two or four “Husky” CPU cores with a mid-range, AMD Radeon™ HD 6500-series discrete-class GPU. Husky cores are the next generation of the cores from the popular AMD Phenom™ processor series of AMD CPUs. The “Trinity”-based AMD A-Series variants target mainstream notebooks and mainstream desktops with good CPU performance and industry leading integrated graphics and video capabilities.

All three AMD APU families benefit from the greatly increased speed of communications between the CPU and GPU. Both the bandwidth and the latency for response are improved. All support DDR3 memory, DirectX 11 graphics, and have dedicated hardware for video playback. AMD C and E-Series has a single memory channel while AMD A-Series for both Llano and Trinity have dual memory channel support. To fully handle the memory bandwidth requirements of the larger GPU units in the A-Series of Llano and Trinity, DDR3 memory speeds up to 1600 and 1866 are supported, respectively.

The APU impact on system design: board, memory, graphics, and form factor

For a mainstream PC solution, the APU enables you to fit a quad core DirectX 11 3D gaming system with all the familiar features into a form-factor smaller than a Mini-ITX. Even if you add PCIe® expansion slots, there is still plenty of space for a Mini-ITX format solution; even a Pico-ITX form factor should be possible.

This opens up a design choice: One option is to stick with the default spec. The result will be smaller than most current TV set top boxes with near zero noise. The other choice is to stick with the Mini-ITX format, but provide performance enhancements through the use of a better cooling solution (to enable overclocking), adding faster memory options, and adding AMD Radeon™ Dual Graphics.

There were numerous online website reviews of the first generation APU desktop platforms discussing the various memory choices. Based on the current product performance and return on the memory speed investment, DDR3-1866 CL9 DIMMs are the best memory choice providing an outstanding performance per dollar. For the second generation APU, a DDR3-2133 CL10 or better memory should strike a good balance.

In summary, the AMD APU’s integration, balanced performance, expandability, and power usage enables new ultra-compact form factors for complete systems. The power consumption savings that can come from running a near teraflop of performance with less than 100W power can enable much “greener” high-end machines–up to the supercomputer range.

The APU impact in tablet and netbook space

The lowest power version of the AMD E-Series APU is suitable for an x86 tablet, which can run Windows 8 and avoid the dependence on application stores and such centralized resources. The added CPU and graphics horsepower of the APU enables new, productive form factors for tablets. How about a 11” or 12” inch full HD+ tablet with 16:10, 1920×1200 or even 3:2, 1920×1280 screen? Not only are these far more productive than movie screen 16:9 displays, but oriented in portrait mode they can emulate a printed page. Future ultrathin versions of quad-core AMD APUs in newer processes are designed to enable ultrahigh resolution 3D tablets that can also substitute for a proper PC.

Taking the APU further

A powerful GPU closely tied to the CPU not only benefits 3D graphics applications, but also applications with intense parallel computation. Examples include ultrafast large spreadsheet calculation, database manipulation, and media creation. On the larger scale, an APU-powered petaflop machine spread could enable more affordable supercomputing and large data analysis for many more users. An APU-based supercomputer could achieve its floating-point performance rating at one-third the power of the usual purely x86 CPU-based one, a massive advantage when combined with the right software.

Looking forward

The AMD APU and HSA approaches are revolutionary to programmers and users. AMD architecture changes not just the PC processor architecture, but the system design. That may not be obvious when looking at the first generation APUs. However, as AMD further develops APUs the benefit is bound to become obvious.

The HSA programming model will open new worlds of opportunities for the programmers, challenging them to harness the new performance potential. The more intricate interdependencies and benefits from that integration also will require system builders and designers to put more thought into maximizing their systems’ competitiveness. Over time, the benefits of the APU should spread top to bottom, from the supercomputer to the smartphone.

About the author: Based in Singapore, Nebojsa Novakovic is a strategic advisor to VR-Zone.com, Asia Pacific editor for TheInquirer.net, and frequently writes on high-end computing, system architecture, processors, 3-D graphics and related subjects.

Ready to learn more? AMD’s Fusion12 Developer Summit unites the industry’s experts in the world of heterogeneous computing. Held June 11-14, 2012, in Bellevue, Washington, the event provides deep, actionable content across ten tracks, covering heterogeneous computing as it relates to multimedia, graphics, cloud computing, security, big data, and more. Whether you’re responsible for planning or development, you’ll find the tools, knowledge, and resources you need to take advantage of this new era of computing. Learn more at amd.com/afds.

 

 

This paper is sponsored by AMD
© 2012 Advanced Micro Devices, Inc. All rights reserved. AMD, the AMD Arrow logo, and combinations thereof are trademarks of Advanced Micro Devices, Inc. Other names are for identification purposes only and may be trademarks of their respective owners.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industry updates delivered to you every week!

SC23: The Ethics of Supercomputing

November 29, 2023

Why should HPC practitioners care about ethics? And, what are our ethics in HPC? These questions were central to a lively discussion at the SC23 Birds-of-a-Feather (BoF) session: With Great Power Comes Great Responsib Read more…

Grace Hopper’s Big Debut in AWS Cloud While Gravaton4 Launches

November 29, 2023

Editors Note: Additional Coverage of the AWS-Nvidia 65 Exaflop ‘Ultra-Cluster’ and Gravitron4 can be found on our sister site Datanami. Amazon Web Services will soon be home to a new Nvidia-built supercomputer tha Read more…

Give a Little (on Tuesday), Get a Lot

November 28, 2023

HPC is built on open source. While building HPC systems with "open plumbing" has enormous advantages, there can also be some challenges. As illustrated in the classic XKCD comic, the entire dependency tree of many usefu Read more…

re:Invent 2023: AWS Talks a Little Quantum, Showcases Error Correction Progress

November 28, 2023

Quantum computing held sway in the last few minutes of AWS senior vice president Peter DeSantis’ keynote yesterday at the AWS re:Invent 2023 conference, being held in Las Vegas this week. While scarce on details, DeSan Read more…

Analyst Panel Says Take the Quantum Computing Plunge Now…

November 27, 2023

Should you start exploring quantum computing? Yes, said a panel of analysts convened at Tabor Communications HPC and AI on Wall Street conference earlier this year. Without doubt, the quantum computing landscape remai Read more…

AWS Solution Channel

Deploying AI/ML at the Edge with Omniflow’s Sustainable Smart Lamppost, NVIDIA, and AWS

Imagine a world where a lamppost does more than just illuminate streets; it actively contributes to a smarter, safer, and more sustainable community. Using Amazon Web Services (AWS) and NVIDIA technologies, Omniflow is turning this vision into a reality. Read more…

QCT Solution Channel

QCT and Intel Codeveloped QCT DevCloud Program to Jumpstart HPC and AI Development

Organizations and developers face a variety of issues in developing and testing HPC and AI applications. Challenges they face can range from simply having access to a wide variety of hardware, frameworks, and toolkits to time spent on installation, development, testing, and troubleshooting which can lead to increases in cost. Read more…

SC23 HPC Student Cluster Smackdown

November 21, 2023

Since 2007, the Student Cluster Competition (SCC) has provided an international multi-day contest for the best and brightest university HPC teams. This year, the in-person event was held at SC23 in Denver from November 1 Read more…

SC23: The Ethics of Supercomputing

November 29, 2023

Why should HPC practitioners care about ethics? And, what are our ethics in HPC? These questions were central to a lively discussion at the SC23 Birds-of-a-Fe Read more…

Grace Hopper’s Big Debut in AWS Cloud While Gravaton4 Launches

November 29, 2023

Editors Note: Additional Coverage of the AWS-Nvidia 65 Exaflop ‘Ultra-Cluster’ and Gravitron4 can be found on our sister site Datanami. Amazon Web Servic Read more…

Analyst Panel Says Take the Quantum Computing Plunge Now…

November 27, 2023

Should you start exploring quantum computing? Yes, said a panel of analysts convened at Tabor Communications HPC and AI on Wall Street conference earlier this y Read more…

SCREAM wins Gordon Bell Climate Prize at SC23

November 21, 2023

The first Gordon Bell Prize for Climate Modeling was presented at SC23 in Denver. The award went to a team led by Sandia National Laboratories that had develope Read more…

SC23 BOF: Inclusivity Progress and Challenges

November 21, 2023

New to SC23 was a series of talks on Inclusivity topics. Sponsored by the Inclusivity Committee and open to all conference attendees, these 90-minute birds-of-a Read more…

Supercomputing 2023: Odds and Ends from the Show

November 20, 2023

This year's fantastic Supercomputing 2023 was back in full form. Attendees seemed to be glad that the show was back in Denver, which was a preferred destination Read more…

Material Simulation with Quantum Accuracy Wins 2023 ACM Gordon Bell Prize

November 20, 2023

Accurately calculating interactions among electrons has been a significant obstacle to reliable material exploration and design through computer modeling. Recen Read more…

Shutterstock 1086444218

HPC Hardware Contracts: Backlash as Security Ignored in Performance Pursuit

November 16, 2023

The security of supercomputers has been grossly ignored in the pursuit of horsepower. Still, there is a growing realization that security is needed to prevent b Read more…

CORNELL I-WAY DEMONSTRATION PITS PARASITE AGAINST VICTIM

October 6, 1995

Ithaca, NY --Visitors to this year's Supercomputing '95 (SC'95) conference will witness a life-and-death struggle between parasite and victim, using virtual Read more…

SGI POWERS VIRTUAL OPERATING ROOM USED IN SURGEON TRAINING

October 6, 1995

Surgery simulations to date have largely been created through the development of dedicated applications requiring considerable programming and computer graphi Read more…

U.S. Will Relax Export Restrictions on Supercomputers

October 6, 1995

New York, NY -- U.S. President Bill Clinton has announced that he will definitely relax restrictions on exports of high-performance computers, giving a boost Read more…

Dutch HPC Center Will Have 20 GFlop, 76-Node SP2 Online by 1996

October 6, 1995

Amsterdam, the Netherlands -- SARA, (Stichting Academisch Rekencentrum Amsterdam), Academic Computing Services of Amsterdam recently announced that it has pur Read more…

Cray Delivers J916 Compact Supercomputer to Solvay Chemical

October 6, 1995

Eagan, Minn. -- Cray Research Inc. has delivered a Cray J916 low-cost compact supercomputer and Cray's UniChem client/server computational chemistry software Read more…

NEC Laboratory Reviews First Year of Cooperative Projects

October 6, 1995

Sankt Augustin, Germany -- NEC C&C (Computers and Communication) Research Laboratory at the GMD Technopark has wrapped up its first year of operation. Read more…

Sun and Sybase Say SQL Server 11 Benchmarks at 4544.60 tpmC

October 6, 1995

Mountain View, Calif. -- Sun Microsystems, Inc. and Sybase, Inc. recently announced the first benchmark results for SQL Server 11. The result represents a n Read more…

New Study Says Parallel Processing Market Will Reach $14B in 1999

October 6, 1995

Mountain View, Calif. -- A study by the Palo Alto Management Group (PAMG) indicates the market for parallel processing systems will increase at more than 4 Read more…

Leading Solution Providers

Contributors

SC23 Booth Videos

AMD @ SC23
AWS @ SC23
Altair @ SC23
CoolIT @ SC23
Cornelis Networks @ SC23
CoreHive @ SC23
DDC @ SC23
HPE @ SC23 with Justin Hotard
HPE @ SC23 with Trish Damkroger
Intel @ SC23
Intelligent Light @ SC23
Lenovo @ SC23
Penguin Solutions @ SC23
QCT Intel @ SC23
Tyan AMD @ SC23
Tyan Intel @ SC23
HPCwire LIVE from SC23 Playlist

CORNELL I-WAY DEMONSTRATION PITS PARASITE AGAINST VICTIM

October 6, 1995

Ithaca, NY --Visitors to this year's Supercomputing '95 (SC'95) conference will witness a life-and-death struggle between parasite and victim, using virtual Read more…

SGI POWERS VIRTUAL OPERATING ROOM USED IN SURGEON TRAINING

October 6, 1995

Surgery simulations to date have largely been created through the development of dedicated applications requiring considerable programming and computer graphi Read more…

U.S. Will Relax Export Restrictions on Supercomputers

October 6, 1995

New York, NY -- U.S. President Bill Clinton has announced that he will definitely relax restrictions on exports of high-performance computers, giving a boost Read more…

Dutch HPC Center Will Have 20 GFlop, 76-Node SP2 Online by 1996

October 6, 1995

Amsterdam, the Netherlands -- SARA, (Stichting Academisch Rekencentrum Amsterdam), Academic Computing Services of Amsterdam recently announced that it has pur Read more…

Cray Delivers J916 Compact Supercomputer to Solvay Chemical

October 6, 1995

Eagan, Minn. -- Cray Research Inc. has delivered a Cray J916 low-cost compact supercomputer and Cray's UniChem client/server computational chemistry software Read more…

NEC Laboratory Reviews First Year of Cooperative Projects

October 6, 1995

Sankt Augustin, Germany -- NEC C&C (Computers and Communication) Research Laboratory at the GMD Technopark has wrapped up its first year of operation. Read more…

Sun and Sybase Say SQL Server 11 Benchmarks at 4544.60 tpmC

October 6, 1995

Mountain View, Calif. -- Sun Microsystems, Inc. and Sybase, Inc. recently announced the first benchmark results for SQL Server 11. The result represents a n Read more…

New Study Says Parallel Processing Market Will Reach $14B in 1999

October 6, 1995

Mountain View, Calif. -- A study by the Palo Alto Management Group (PAMG) indicates the market for parallel processing systems will increase at more than 4 Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire