TSUBAME Prototype System Balances Benchmark Leadership

By Nicole Hemsoth

July 9, 2014

When it comes to large-scale supercomputing installations, Asia is the continent to watch carefully over the next few years. Already host to the top system in the world, China’s Tianhe-2, Japan and others have ambitions to take over the top ten list of systems.

In the most recent rankings, Japan is home to 30 systems out of the Top 500 worldwide supercomputer share, up from just 18 systems in 2010. In addition to the #4 K Computer system at RIKEN, another noteworthy machine, the #14-ranked TSUBAME 2.5 (or TSUBAME KFC, named for the Kepler Fluid Cooling component, which ironically means it’s submerged in oil) is garnering attention. Part of why this 76,032-core system is one to watch is because it’s setting the stage for a new machine with far higher performance—and some novel integrations of unique storage, network, middleware and other technology.

TSUBAMEKFCsmallThis month at ISC, Satoshi Matsuoka from the Tokyo Institute of Technology presented an overview of progress with the TSUBAME KFC machine, which is the precursor and prototype for the next-generation system of the same name that the team will roll out sometime in 2016. When it emerges in 2016, TSUBAME 3 is expected to hit the 25-30 petaflop performance mark while balancing some new technologies cooked into the middleware, storage, and network. In addition to the “Kepler Fluid Cooling” which was at the heart of its top efficiency rankings on the Green 500 this summer, where it was the top system.

“Assuming that TSUBAME-KFC’s energy efficiency could be scaled linearly to an exaflop supercomputing system, one that can perform one trillion floating-point operations per second, such a system would consume on the order of 225 megawatts (MW),“ said Wu Feng of the Green 500. “Although this 225-megawatt power envelope is still quite far from DARPA’s optimistic target of a 67-megawatt power envelope, it is an order of magnitude better than the initial projection of a nearly 3000-megawatt power envelope from 2007 when the first official Green500 list was launched.”

But the Green 500 and prototype system’s placement in the Top 500 are just part of a larger story–one that Masuoka doesn’t want the community to overlook. It’s about handling the next generation of data-intensive applications, which is an area full of lessons from outside of supercomputing.

The focus of TSUBAME-3 (and leading into 4 in the 2020-20222 timeframe) will be on balancing efficiency, data-readiness, and of course performance or, as Matsuoka described in his talk, a convergence of supercomputing with extreme big data. We are all aware of the bubble big data has presented in HPC, but Mastuoka says it’s critical to design systems that integrate lessons learned from hyperscale cloud datacenters as well as what appears ahead for eventual exascale-class systems.

TSUBAME4

The current KFC machine ranked #12 on the Graph 500, and #6 on the Green Graph 500, which looks at the energy efficiency of solving “big data” graph problems. This is where the real future focus of the system in its 2016 incarnation will be, says Matsuoka. As he explained, at the beginning of the Graph 500 list, the expectation among some was that the list would look far different than the Top 500 with a number of cloud vendors submitting their distributed machines for the rankings. However, the list looks quite similar to the Top 500, with the same machines at the top of the list that are in the Top 500 and to a lesser extent, the Green 500. The hope is to balance top results across these categories with an eye on real-world applications, not just benchmark toppling.

These early predictions stood to reason since ostensibly, the big clouds were tackling “big data” jobs The common estimate is that a giant web services company like Amazon has around 500,000 nodes with around 6 million cores spread throughout its network. That makes for a massive distributed machine, but the core counts of these cheaper servers are often far lower than ultra-dense supercomputers. For instance, Tianhe-2 has 3 million cores spread across 18,000 nodes. Matsuooka says this point isn’t a surprising one—large datacenters are common, but they tend to be very sparse; they don’t require the networking and density of supercomputers—and therefore don’t have the same capability.

The goal of the next incarnation of TSUBAME in 2016 will be reducing the size of the system while supplying the needed bandwidth and compute horsepower in a much smaller amount of space. Cheap SSDs, ultra-dense system design, and leveraging new uses of burst buffer technology to offload critical processing tasks are key to the approach with both the coing TSUBAME 3 and the future 4 machine.

More text here

TSUBAMEKFCMatsuoka says that TSUBAME 3 will feature larger capacity SSDs that will give the Tokyo Tech team local bandwidth of about 50 TB of capacity, or 50 GB/s bandwidth in a single rack, suppose 40 racks, several terabytes per second of aggregate bandwidth. They’re working with DDN now to further this future.

The Top 500 list in 2016 is set to be an interesting one, particularly in November, with the addition of this next-generation machine and several others we’ve heard word of. While not all the major machines set to come online by Linpack time will be running the famous benchmark since it doesn’t adequately reflect their goals, Japan is expected to take advantage of all three major benchmarking opportunities–Top 500, Green 500, and Graph 500–to show the balanced system they’re seeking…one that’s ultra-efficient, big data capable, and of course, high performing in a top 10-class way.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

IBM Launches Commercial Quantum Network with Samsung, ORNL

December 14, 2017

In the race to commercialize quantum computing, IBM is one of several companies leading the pack. Today, IBM announced it had signed JPMorgan Chase, Daimler AG, Samsung and a number of other corporations to its IBM Q Net Read more…

By Tiffany Trader

TACC Researchers Test AI Traffic Monitoring Tool in Austin

December 13, 2017

Traffic jams and mishaps are often painful and sometimes dangerous facts of life. At this week’s IEEE International Conference on Big Data being held in Boston, researchers from TACC and colleagues will present a new Read more…

By HPCwire Staff

AMD Wins Another: Baidu to Deploy EPYC on Single Socket Servers

December 13, 2017

When AMD introduced its EPYC chip line in June, the company said a portion of the line was specifically designed to re-invigorate a single socket segment in what has become an overwhelmingly two-socket landscape in the d Read more…

By John Russell

HPE Extreme Performance Solutions

Explore the Origins of Space with COSMOS and Memory-Driven Computing

From the formation of black holes to the origins of space, data is the key to unlocking the secrets of the early universe. Read more…

Microsoft Wants to Speed Quantum Development

December 12, 2017

Quantum computing continues to make headlines in what remains of 2017 as several tech giants jockey to establish a pole position in the race toward commercialization of quantum. This week, Microsoft took the next step in Read more…

By Tiffany Trader

IBM Launches Commercial Quantum Network with Samsung, ORNL

December 14, 2017

In the race to commercialize quantum computing, IBM is one of several companies leading the pack. Today, IBM announced it had signed JPMorgan Chase, Daimler AG, Read more…

By Tiffany Trader

AMD Wins Another: Baidu to Deploy EPYC on Single Socket Servers

December 13, 2017

When AMD introduced its EPYC chip line in June, the company said a portion of the line was specifically designed to re-invigorate a single socket segment in wha Read more…

By John Russell

Microsoft Wants to Speed Quantum Development

December 12, 2017

Quantum computing continues to make headlines in what remains of 2017 as several tech giants jockey to establish a pole position in the race toward commercializ Read more…

By Tiffany Trader

HPC Iron, Soft, Data, People – It Takes an Ecosystem!

December 11, 2017

Cutting edge advanced computing hardware (aka big iron) does not stand by itself. These computers are the pinnacle of a myriad of technologies that must be care Read more…

By Alex R. Larzelere

IBM Begins Power9 Rollout with Backing from DOE, Google

December 6, 2017

After over a year of buildup, IBM is unveiling its first Power9 system based on the same architecture as the Department of Energy CORAL supercomputers, Summit a Read more…

By Tiffany Trader

Microsoft Spins Cycle Computing into Core Azure Product

December 5, 2017

Last August, cloud giant Microsoft acquired HPC cloud orchestration pioneer Cycle Computing. Since then the focus has been on integrating Cycle’s organization Read more…

By John Russell

GlobalFoundries, Ayar Labs Team Up to Commercialize Optical I/O

December 4, 2017

GlobalFoundries (GF) and Ayar Labs, a startup focused on using light, instead of electricity, to transfer data between chips, today announced they've entered in Read more…

By Tiffany Trader

HPE In-Memory Platform Comes to COSMOS

November 30, 2017

Hewlett Packard Enterprise is on a mission to accelerate space research. In August, it sent the first commercial-off-the-shelf HPC system into space for testing Read more…

By Tiffany Trader

US Coalesces Plans for First Exascale Supercomputer: Aurora in 2021

September 27, 2017

At the Advanced Scientific Computing Advisory Committee (ASCAC) meeting, in Arlington, Va., yesterday (Sept. 26), it was revealed that the "Aurora" supercompute Read more…

By Tiffany Trader

NERSC Scales Scientific Deep Learning to 15 Petaflops

August 28, 2017

A collaborative effort between Intel, NERSC and Stanford has delivered the first 15-petaflops deep learning software running on HPC platforms and is, according Read more…

By Rob Farber

Oracle Layoffs Reportedly Hit SPARC and Solaris Hard

September 7, 2017

Oracle’s latest layoffs have many wondering if this is the end of the line for the SPARC processor and Solaris OS development. As reported by multiple sources Read more…

By John Russell

AMD Showcases Growing Portfolio of EPYC and Radeon-based Systems at SC17

November 13, 2017

AMD’s charge back into HPC and the datacenter is on full display at SC17. Having launched the EPYC processor line in June along with its MI25 GPU the focus he Read more…

By John Russell

Nvidia Responds to Google TPU Benchmarking

April 10, 2017

Nvidia highlights strengths of its newest GPU silicon in response to Google's report on the performance and energy advantages of its custom tensor processor. Read more…

By Tiffany Trader

Japan Unveils Quantum Neural Network

November 22, 2017

The U.S. and China are leading the race toward productive quantum computing, but it's early enough that ultimate leadership is still something of an open questi Read more…

By Tiffany Trader

GlobalFoundries Puts Wind in AMD’s Sails with 12nm FinFET

September 24, 2017

From its annual tech conference last week (Sept. 20), where GlobalFoundries welcomed more than 600 semiconductor professionals (reaching the Santa Clara venue Read more…

By Tiffany Trader

Amazon Debuts New AMD-based GPU Instances for Graphics Acceleration

September 12, 2017

Last week Amazon Web Services (AWS) streaming service, AppStream 2.0, introduced a new GPU instance called Graphics Design intended to accelerate graphics. The Read more…

By John Russell

Leading Solution Providers

IBM Begins Power9 Rollout with Backing from DOE, Google

December 6, 2017

After over a year of buildup, IBM is unveiling its first Power9 system based on the same architecture as the Department of Energy CORAL supercomputers, Summit a Read more…

By Tiffany Trader

Perspective: What Really Happened at SC17?

November 22, 2017

SC is over. Now comes the myriad of follow-ups. Inboxes are filled with templated emails from vendors and other exhibitors hoping to win a place in the post-SC thinking of booth visitors. Attendees of tutorials, workshops and other technical sessions will be inundated with requests for feedback. Read more…

By Andrew Jones

EU Funds 20 Million Euro ARM+FPGA Exascale Project

September 7, 2017

At the Barcelona Supercomputer Centre on Wednesday (Sept. 6), 16 partners gathered to launch the EuroEXA project, which invests €20 million over three-and-a-half years into exascale-focused research and development. Led by the Horizon 2020 program, EuroEXA picks up the banner of a triad of partner projects — ExaNeSt, EcoScale and ExaNoDe — building on their work... Read more…

By Tiffany Trader

Delays, Smoke, Records & Markets – A Candid Conversation with Cray CEO Peter Ungaro

October 5, 2017

Earlier this month, Tom Tabor, publisher of HPCwire and I had a very personal conversation with Cray CEO Peter Ungaro. Cray has been on something of a Cinderell Read more…

By Tiffany Trader & Tom Tabor

Tensors Come of Age: Why the AI Revolution Will Help HPC

November 13, 2017

Thirty years ago, parallel computing was coming of age. A bitter battle began between stalwart vector computing supporters and advocates of various approaches to parallel computing. IBM skeptic Alan Karp, reacting to announcements of nCUBE’s 1024-microprocessor system and Thinking Machines’ 65,536-element array, made a public $100 wager that no one could get a parallel speedup of over 200 on real HPC workloads. Read more…

By John Gustafson & Lenore Mullin

Flipping the Flops and Reading the Top500 Tea Leaves

November 13, 2017

The 50th edition of the Top500 list, the biannual publication of the world’s fastest supercomputers based on public Linpack benchmarking results, was released Read more…

By Tiffany Trader

Intel Launches Software Tools to Ease FPGA Programming

September 5, 2017

Field Programmable Gate Arrays (FPGAs) have a reputation for being difficult to program, requiring expertise in specialty languages, like Verilog or VHDL. Easin Read more…

By Tiffany Trader

HPC Chips – A Veritable Smorgasbord?

October 10, 2017

For the first time since AMD's ill-fated launch of Bulldozer the answer to the question, 'Which CPU will be in my next HPC system?' doesn't have to be 'Whichever variety of Intel Xeon E5 they are selling when we procure'. Read more…

By Dairsie Latimer

  • arrow
  • Click Here for More Headlines
  • arrow
Share This