Adapteva Unveils 64-Core Chip
Chipmaker Adapteva is sampling its 4th-generation multicore processor, known as Epiphany-IV. The 64-core chip delivers a peak performance of 100 gigaflops and draws just two watts of power, yielding a stunning 50 gigaflops/watt. The engineering samples were manufactured by GLOBALFOUNDRIES on its latest 28nm process technology.
Based in LEXINGTON, Massachusetts, Adapteva is in the business of developing ultra-efficient floating point accelerators. Andreas Olofsson, a former chip engineer at Texas Instruments and Analog Devices, founded the company in 2008, and gathered $2.5 million from various VCs and private investors. With that shoestring budget, he managed to produce four generations of the Epiphany architecture, including two actual chips. The technology is initially aimed at the mobile and embedded market, but Olofsson also has designs on penetrating the supercomputing space.
Epiphany is essentially a stripped down general-purpose RISC CPU that throws out almost everything but the number-crunching silicon. But since it doesn’t incorporate features needed by operating systems, like memory management, it relies on a host processor to feed it application kernels in the same manner as a GPGPU. The current implementation supports single precision floating point only, but plans are already in the works for a double precision implementation.
The general layout of Epiphany is a 2D mesh of simple cores, which talk to each other via a high-speed interconnect. In that sense, it looks more like Intel’s manycore Xeon Phi than a graphics processor, but without the x86 ISA baggage (but also without the benefit of the x86 ecosystem).
The latest Epiphany chip, which was spec’d out last fall, runs at a relatively slow 800MHz. But thanks to its highly parallel design and simplified cores, its 50 gigaflops/watt energy efficiency is among the best in the business. NVIDIA’s new K10 GPU computing card can hit about 20 single precision gigaflops/watt, but that also includes 8GB of GDDR5 memory and a few other on-board components, so it’s not an apples-to-apples comparison. Regardless, a 100 gigaflop chip drawing a couple of watts is a significant achievement.
The downside of the design is that it uses Adapteva’s own proprietary ISA, so there are no ready-made software tools that developers can tap into. “Everybody is very impressed by the numbers,” Olofsson told HPCwire. “They just haven’t quite been convinced they can program this thing.”
That has now changed. In conjunction with the 28nm samples, Adapteva has also released its own OpenCL compiler wrapped in their new software developer kit (SDK). The compiler is an adaptation of Brown Deer Technology’s OpenCL implementation developed for ARM and x86 platforms. Brown Deer provides tools and support for high performance computing applications and is especially focused on acceleration technologies based on GPUs and FPGAs. The Adapteva implementation means developers can now use standard OpenCL source to program the Epiphany processor.
Olofsson says they chose OpenCL because it’s a recognized open standard that is being used for heterogeneous computing platforms in all the segments Adapteva is interested in. In particular, it’s getting some traction on heterogeneous platforms in the embedded space, where GPUs are increasingly being targeted to general-purpose computing. “The way we are pitching [Epiphany] is that OpenCL GPGPUs may not be good at everything, because of their architectural limitations,” say Olofsson. “So why not put another accelerator next to it that is also OpenCL-programmable.”
Adapteva is putting the SDK through its paces using existing OpenCL codes like 2D Fast Fourier Transform (FFT) and multi-body physics algorithms that were downloaded off the Internet. The company is currently using an x86-based board for these test runs, but since OpenCL has bindings for C/C++, essentially any commodity CPU is fair game as the host driver. Adapteva’s SDK is currently in beta form and is being released to the company’s early access partners.
As far as getting the Epiphany chips onto useful platforms, that’s still a work in progress. At least some of the engineering samples of the 28nm chip will go to Bittware, an early customer of Adapteva’s. Bittware used the early 16-core, 32-gigaflop version of Epiphany on its custom PCIe boards. Those products are aimed at military and industrial application for things like embedded signal processing. Because of the need to minimize power usage in embedded computing, Epiphany is a good fit for this application domain. At least one more vendor has signed up to develop Epiphany-based PCIe boards, but that company is not ready to go public just yet.
Adapteva’s market aspirations extend beyond the military-industrial complex though. Olofsson believes Epiphany is ideal for mobile computing, and eventually HPC. With regard to the former, Adapteva is planning to use the new chip to demonstrate face detection, an application aimed at devices like smartphones and tablets. Face detection and recognition rely on very compute-intensive algorithms, which is fine if you’ve got a server or two to spare, but it’s beyond the number-crunching capabilities of most mobile-grade CPUs and GPUs today.
Other flop-hungry applications that could find a home on in this market include augmented overlays, gesture recognition, real-time speech recognition, realistic game physics, and computational photography. Like mobile-based face detection/recognition, all of these require lots of computational performance operating within very restricted power envelopes.
For high performance computing, the path is a little more complex. For starters, someone has to build a Epiphany-based PCIe card suitable for HPC servers, and then an OEM has to be enticed to support that board. To deliver a reasonable amount of computation for a server — say, a teraflop or so — you would need multiple Epiphany chips glued to a card, which would necessitate a PCIe expansion setup of some sort. Not an impossibility, but probably not a job for a do-it-yourselfer.
More fundamentally though, the architecture has to add support for double precision floating point to be taken seriously for HPC (although applications like seismic modeling, image and audio processing, and video analysis are fine with single precision).
In any case, double precision is already on Adapteva’s roadmap. “We’ll definitely have something next year,” says Olofsson.
Beyond that, the company has plans on the drawing board to scale this architecture up to the teraflop/watt realm. Following a Moore’s Law trajectory, that would mean that by 2018 a 7nm Epiphany processor could house 1,000 cores and deliver a whopping two teraflops. Since such a chip would draw the same two watts as the current 100 gigaflops version, it could easily provide the foundation for an exascale supercomputer. Or a killer tablet.
Feeds by Topic
- Developer Tools
Feeds by Industry
August 22, 2014
- IBM to Add Power Systems Linux Services to Innovation and Client Centers
- University of Houston Oil and Gas Workshop Issues Call for Participation
August 21, 2014
- Former Carnegie Mellon Computer Science Dean Now Assisting in White House Policy Office
- University of Delaware to Deploy Second HPC Cluster
- DESY and IBM Develop Big Data Architecture for Science
- Acceleware Releases Results for the Second Quarter of Fiscal 2014
- Terascala Enables Tech-X to Develop Scientific Applications for Big Data
August 20, 2014
- OLCF’s GPU Hackathon to Begin October 27
- Inspur Selects Micron as Leading Memory Provider for Server Portfolio
- 2014 IEEE International Conference on Cluster Computing Issues Call for Participation
August 19, 2014
- Imation Now Offering 6TB Drives with Nexsan E-Series Family
- ACME Project to Unleash Supercomputing on Climate Change
- DDN Reports Results for First Half of 2014
- Clemson Named CUDA Teaching Center by NVIDIA
August 18, 2014
- Speakers Announced for EnterpriseHPC’14 Summit
- Blue Waters Project to Offer Graduate Visualization Course in Spring 2015
- Fujitsu Announces New Tier III Datacenters in North America
- UT Students Perform Mock Run for SC14 Student Cluster Competition
- CFIUS Approves Sale of IBM x86 Business to Lenovo
- SGI Opens New German Subsidiary
Most Read Features
- Adapting Algorithms to Modern Hybrid Architectures
- Fujitsu Targets 100 Petaflops Supercomputing
- Exascale Resilience Turns a Corner
- NERSC Launches Exascale Readiness Program with Intel, Cray
- Big Data Needs Big Funding: NSF’s Jahanian Makes The Case
- Academic HPC and Cloud Services Start to Blur
- China’s Supercomputing Strategy Called Out
- Computer Science Education Act Hits Critical Milestone
- Charming Exascale Power Problems
- Bolstering Extreme Scale Computational Biology
- More Features…
Most Read Short Takes
- Jellyfish Use Novel Search Strategy
- Is This the Exascale Breakthrough We’ve Been Waiting For?
- Brainy IBM Chip Packs One Million Neuron Punch
- Parallel Programming with OpenMP
- DOE to Fund Exascale Resilience Research
- Deconstructing Moore’s Law’s Limits
- Building Parallel Code with Hybrid Fortran
- NVIDIA Boasts ‘Compelling HPC Solution’
- Parallel Computing Trends
- Earthquake Simulation Scales to 8.6 Petaflops
- More Short Takes…
Most Read Off The Wire
- Dongarra and Team Start New Supercomputing Journal
- Top Supercomputers of India List Released
- CMU Scientists Design Way to Use Multiple Programming Languages Within Same Program
- Micron Appoints Stephen Pawlowski as VP of Advanced Computing Solutions
- RIKEN Researchers Run 10,240 Parallel Simulations of Global Weather on K Computer
- Bright Computing Raises $14.5M to Accelerate HPC, Hadoop, and OpenStack Business
- Cray Reports Second Quarter 2014 Financial Results
- AMD FirePro S9150 Server Card Unveiled
- IBM Expands HPC Capabilities in the Cloud
- AMD Opteron 64-Bit ARM-Based Developer Kits Now Available
- More Off The Wire…
- EnterpriseHPC event Read more…
- Read more…
- Read more…
- Read more…
- Read more…
- Read more…
- 8/15/14 | NetApp | Find out how new hybrid cloud architectures enable you to take advantage of public cloud resources from Amazon, Microsoft, and others while Read more…
- 7/17/14 | Acceleware | Learn about advanced software engineering techniques that leverage the power of today’s parallel hardware to optimize applications for maximum Read more…
- Read more…
- Read more…
August 20 - 22San Diego CA United States
September 22New York NY United States
October 8 - 9Royal Victoria Dock London United Kingdom
October 23 - 24Shanghai China
November 4 - 6Santa Clara CA United States