Bits and Bytes From IDF

By Michael Feldman

April 20, 2007

The semi-annual Intel Developer Forum (IDF) took place in Beijing, China this week, a venue chosen to signify the importance of China to Intel. Not only is the Chinese IT market one of the most rapidly growing in the world, but Intel also has very large business commitments there. Currently, the company employs over 6000 employees in the country and has substantial investments in R&D and microprocessor assembly and testing. Intel recently announced they're building a $2.5 billion 300mm wafer fab in Dalian, bringing the company's total stake in China to 3.8 billion. So yes, China is important to them.

The IDF event gives Intel the opportunity to keep its customers and partners happy and titillate industry watchers. For those of us with limited travel budgets, Intel supplied an analyst/press phone briefing of the Beijing proceedings, hosted by Sean Maloney, Intel's chief sales and marketing officer. Although no big announcements were forthcoming at IDF, the company managed to pique my interest in a few areas.

Since Intel is between its “tick-tock” product cycle, having released most of the Core 65nm “tock” offerings, a lot of the talk at IDF was looking at the upcoming “tick” products — the Penryn processor family. With initial product releases scheduled for the latter half of this year, the Penryn processors will be implemented on the companies recently announced 45nm High-K plus metal gate silicon process technology. And just in case anyone has forgotten about the importance of this technology in the past few months, Maloney reminded us:

“We believe we've achieved a significant breakthrough in transistor technology by developing the High-K metal gate transistors … the biggest breakthrough in transistor technology in the last 40 years.”

The shrink to 45nm will give the Penryn products twice the transistor budget of the 65nm products and support higher clock speeds, lower power, or some combination of the two. It looks like Intel will be pushing the clock whenever possible. The company provided some preliminary benchmarks with a pre-production 3.3 GHz processor with a 1600 MHz front side bus. For HPC and workstation systems running bandwidth-intensive applications they achieved a 45 percent performance increase compared to the current quad-core Intel Xeon X5355 processor. Not bad, but there was no mention of the benchmark being used.

The new cores will also support 47 new SSE4 instructions, which are designed to accelerate applications involving digital media processing, memory-intensive workloads, HPC workloads and text searching, to name a few. According to Intel, SSE4 is their largest ISA extension in terms of scope impact since SSE2. Jon Stokes details some of the new capabilities of the SSE4 technology in an article published in Ars Technica.

The new 45nm “tock” offerings, represented by the Nehalem processor family, is slated to start production in 2008. These chips will contain up to 8 cores. Simultaneous multi-threading, an evolution of Intel's Hyper-threading technology, will also be included in the Nehalem cores. At least some of the these processors will include a new system interconnect (presumably Intel's forthcoming Common System Interconnect) as well as on-chip memory controllers, an architecture that mimics AMD's Opteron processor.

Speaking of the Common System Interconnect (CSI), on Tuesday at the Gelato ICE conference in San Jose, Intel released a few details about how this technology will fit into their Itanium roadmap. According to Jim Fister in Intel's Digital Enterprise group, Tukwilla, the four-core Itanium processor slated for 2008, will be the first chip to incorporate CSI, replacing the front-side bus that has been the mainstay of Intel processors. EETimes has posted a nice article from Rick Merrit about this topic.

Back to Beijing. In another nod to AMD design, Intel outlined its QuickAssist technology, which appears to be an evolution of their Geneseo initiative. QuickAssist will work analogously to AMD's Torrenza technology, allowing third-party coprocessors to link to Intel processors. In Intel's version, their interconnect platform is based on PCI Express, rather than HyperTransport. QuickAssist is designed to be used for specialized accelerators that target math-intensive, graphics, or embedded content workloads.

Maybe the biggest non-announcement announcement of the IDF event was the admission of the Intel's Larrabee technology, something that had been rumored for months now. But during Intel's press briefing, Maloney refuse to give any details on the technology, dodging every question on the topic. Here's the minimalist quote from the press release:

“Intel has begun planning products based on a highly parallel, IA-based programmable architecture codenamed 'Larrabee.' It will be easily programmable using many existing software tools, and designed to scale to trillions of floating point operations per second (Teraflops) of performance. The Larrabee architecture will include enhancements to accelerate applications such as scientific computing, recognition, mining, synthesis, visualization, financial analytics and health applications.”

The only new factoid here is that the silicon would be compatible with current Intel products. Reports from IDF about Pat Gelsinger's comments on Larrabee revealed a few more tidbits. Gelsinger, the senior vice president and co-general manager of Intel's Digital Enterprise Group, said Larrabee products would support varying numbers of processing cores, depending on the targeted application, and will be based on the x86 ISA.

In the past, industry analysts had speculated that Larrabee referred to some kind of high-end GPU silicon that would go head-to-head with AMD and NVIDIA graphics processors that are aimed at GPGPU (general purpose processing with GPUs) workloads. But Gelsinger dismissed the GPGPU model, stating that graphics processors are not general purpose.

“[T]his will end the GPGPU debate,” wrote Gelsinger in a blog entry posted on Tuesday. “Solving the programmability of any such highly parallel machines is the key problem and IA programmability is the solution.”

In any case, the Larrabee technology may end up looking similar to the GPGPU platforms envisioned by AMD and NVIDIA, namely, processors that incorporate elements of CPUs with elements of GPUs. Reading between the lines, it looks like Intel could be thinking of a multicore streaming SIMD architecture based on their native SSE instruction set — think multiple SSE units. The SIMD cores could be isolated on their own die or mixed with non-SSE x86 silicon in some fashion. However it's implemented, Larrabee looks like the basis of a hardware platform designed to run RMS (Recognition, Mining, and Synthesis) applications, Intel's vision of next-generation workloads for 2010 and beyond.

—–

As always, comments about HPCwire are welcomed and encouraged. Write to me, Michael Feldman, at [email protected].

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

Cray, Fujitsu Both Bringing Fujitsu A64FX-based Supercomputers to Market in 2020

November 12, 2019

The number of top-tier HPC systems makers has shrunk due to a steady march of M&A activity, but there is increased diversity and choice of processing components with Intel Xeon, AMD Epyc, IBM Power, and Arm server ch Read more…

By Tiffany Trader

SIA Recognizes Robert Dennard with 2019 Noyce Award

November 12, 2019

If you don’t know what Dennard Scaling is, the chances are strong you don’t labor in electronics. Robert Dennard, longtime IBM researcher, inventor of the DRAM and the fellow for whom Dennard Scaling was named, is th Read more…

By John Russell

Leveraging Exaflops Performance to Remediate Nuclear Waste

November 12, 2019

Nuclear waste storage sites are a subject of intense controversy and debate; nobody wants the radioactive remnants in their backyard. Now, a collaboration between Berkeley Lab, Pacific Northwest National University (PNNL Read more…

By Oliver Peckham

Using HPC and Machine Learning to Predict Traffic Congestion

November 12, 2019

Traffic congestion is a never-ending logic puzzle, dictated by commute patterns, but also by more stochastic accidents and similar disruptions. Traffic engineers struggle to model the traffic flow that occurs after accid Read more…

By Oliver Peckham

Mira Supercomputer Enables Cancer Research Breakthrough

November 11, 2019

Dynamic partial-wave spectroscopic (PWS) microscopy allows researchers to observe intracellular structures as small as 20 nanometers – smaller than those visible by optical microscopes – in three dimensions at a mill Read more…

By Staff report

AWS Solution Channel

Making High Performance Computing Affordable and Accessible for Small and Medium Businesses with HPC on AWS

High performance computing (HPC) brings a powerful set of tools to a broad range of industries, helping to drive innovation and boost revenue in finance, genomics, oil and gas extraction, and other fields. Read more…

IBM Accelerated Insights

IBM Adds Support for Ion Trap Quantum Technology to Qiskit

November 11, 2019

After years of percolating in the shadow of quantum computing research based on superconducting semiconductors – think IBM, Rigetti, Google, and D-Wave (quantum annealing) – ion trap technology is edging into the QC Read more…

By John Russell

Cray, Fujitsu Both Bringing Fujitsu A64FX-based Supercomputers to Market in 2020

November 12, 2019

The number of top-tier HPC systems makers has shrunk due to a steady march of M&A activity, but there is increased diversity and choice of processing compon Read more…

By Tiffany Trader

IBM Adds Support for Ion Trap Quantum Technology to Qiskit

November 11, 2019

After years of percolating in the shadow of quantum computing research based on superconducting semiconductors – think IBM, Rigetti, Google, and D-Wave (quant Read more…

By John Russell

Tackling HPC’s Memory and I/O Bottlenecks with On-Node, Non-Volatile RAM

November 8, 2019

On-node, non-volatile memory (NVRAM) is a game-changing technology that can remove many I/O and memory bottlenecks and provide a key enabler for exascale. Th Read more…

By Jan Rowell

MLPerf Releases First Inference Benchmark Results; Nvidia Touts its Showing

November 6, 2019

MLPerf.org, the young AI-benchmarking consortium, today issued the first round of results for its inference test suite. Among organizations with submissions wer Read more…

By John Russell

Azure Cloud First with AMD Epyc Rome Processors

November 6, 2019

At Ignite 2019 this week, Microsoft's Azure cloud team and AMD announced an expansion of their partnership that began in 2017 when Azure debuted Epyc-backed ins Read more…

By Tiffany Trader

Nvidia Launches Credit Card-Sized 21 TOPS Jetson System for Edge Devices

November 6, 2019

Nvidia has launched a new addition to its Jetson product line: a credit card-sized (70x45mm) form factor delivering up to 21 trillion operations/second (TOPS) o Read more…

By Doug Black

In Memoriam: Steve Tuecke, Globus Co-founder

November 4, 2019

HPCwire is deeply saddened to report that Steve Tuecke, longtime scientist at Argonne National Lab and University of Chicago, has passed away at age 52. Tuecke Read more…

By Tiffany Trader

Spending Spree: Hyperscalers Bought $57B of IT in 2018, $10B+ by Google – But Is Cloud on Horizon?

October 31, 2019

Hyperscalers are the masters of the IT universe, gravitational centers of increasing pull in the emerging age of data-driven compute and AI.  In the high-stake Read more…

By Doug Black

Supercomputer-Powered AI Tackles a Key Fusion Energy Challenge

August 7, 2019

Fusion energy is the Holy Grail of the energy world: low-radioactivity, low-waste, zero-carbon, high-output nuclear power that can run on hydrogen or lithium. T Read more…

By Oliver Peckham

Using AI to Solve One of the Most Prevailing Problems in CFD

October 17, 2019

How can artificial intelligence (AI) and high-performance computing (HPC) solve mesh generation, one of the most commonly referenced problems in computational engineering? A new study has set out to answer this question and create an industry-first AI-mesh application... Read more…

By James Sharpe

Cray Wins NNSA-Livermore ‘El Capitan’ Exascale Contract

August 13, 2019

Cray has won the bid to build the first exascale supercomputer for the National Nuclear Security Administration (NNSA) and Lawrence Livermore National Laborator Read more…

By Tiffany Trader

DARPA Looks to Propel Parallelism

September 4, 2019

As Moore’s law runs out of steam, new programming approaches are being pursued with the goal of greater hardware performance with less coding. The Defense Advanced Projects Research Agency is launching a new programming effort aimed at leveraging the benefits of massive distributed parallelism with less sweat. Read more…

By George Leopold

AMD Launches Epyc Rome, First 7nm CPU

August 8, 2019

From a gala event at the Palace of Fine Arts in San Francisco yesterday (Aug. 7), AMD launched its second-generation Epyc Rome x86 chips, based on its 7nm proce Read more…

By Tiffany Trader

D-Wave’s Path to 5000 Qubits; Google’s Quantum Supremacy Claim

September 24, 2019

On the heels of IBM’s quantum news last week come two more quantum items. D-Wave Systems today announced the name of its forthcoming 5000-qubit system, Advantage (yes the name choice isn’t serendipity), at its user conference being held this week in Newport, RI. Read more…

By John Russell

Ayar Labs to Demo Photonics Chiplet in FPGA Package at Hot Chips

August 19, 2019

Silicon startup Ayar Labs continues to gain momentum with its DARPA-backed optical chiplet technology that puts advanced electronics and optics on the same chip Read more…

By Tiffany Trader

Crystal Ball Gazing: IBM’s Vision for the Future of Computing

October 14, 2019

Dario Gil, IBM’s relatively new director of research, painted a intriguing portrait of the future of computing along with a rough idea of how IBM thinks we’ Read more…

By John Russell

Leading Solution Providers

ISC 2019 Virtual Booth Video Tour

CRAY
CRAY
DDN
DDN
DELL EMC
DELL EMC
GOOGLE
GOOGLE
ONE STOP SYSTEMS
ONE STOP SYSTEMS
PANASAS
PANASAS
VERNE GLOBAL
VERNE GLOBAL

Intel Confirms Retreat on Omni-Path

August 1, 2019

Intel Corp.’s plans to make a big splash in the network fabric market for linking HPC and other workloads has apparently belly-flopped. The chipmaker confirmed to us the outlines of an earlier report by the website CRN that it has jettisoned plans for a second-generation version of its Omni-Path interconnect... Read more…

By Staff report

Kubernetes, Containers and HPC

September 19, 2019

Software containers and Kubernetes are important tools for building, deploying, running and managing modern enterprise applications at scale and delivering enterprise software faster and more reliably to the end user — while using resources more efficiently and reducing costs. Read more…

By Daniel Gruber, Burak Yenier and Wolfgang Gentzsch, UberCloud

Dell Ramps Up HPC Testing of AMD Rome Processors

October 21, 2019

Dell Technologies is wading deeper into the AMD-based systems market with a growing evaluation program for the latest Epyc (Rome) microprocessors from AMD. In a Read more…

By John Russell

Intel Debuts Pohoiki Beach, Its 8M Neuron Neuromorphic Development System

July 17, 2019

Neuromorphic computing has received less fanfare of late than quantum computing whose mystery has captured public attention and which seems to have generated mo Read more…

By John Russell

Rise of NIH’s Biowulf Mirrors the Rise of Computational Biology

July 29, 2019

The story of NIH’s supercomputer Biowulf is fascinating, important, and in many ways representative of the transformation of life sciences and biomedical res Read more…

By John Russell

Xilinx vs. Intel: FPGA Market Leaders Launch Server Accelerator Cards

August 6, 2019

The two FPGA market leaders, Intel and Xilinx, both announced new accelerator cards this week designed to handle specialized, compute-intensive workloads and un Read more…

By Doug Black

When Dense Matrix Representations Beat Sparse

September 9, 2019

In our world filled with unintended consequences, it turns out that saving memory space to help deal with GPU limitations, knowing it introduces performance pen Read more…

By James Reinders

With the Help of HPC, Astronomers Prepare to Deflect a Real Asteroid

September 26, 2019

For years, NASA has been running simulations of asteroid impacts to understand the risks (and likelihoods) of asteroids colliding with Earth. Now, NASA and the European Space Agency (ESA) are preparing for the next, crucial step in planetary defense against asteroid impacts: physically deflecting a real asteroid. Read more…

By Oliver Peckham

  • arrow
  • Click Here for More Headlines
  • arrow
Do NOT follow this link or you will be banned from the site!
Share This