This year at SC13 we counted over 100 HPC-specific announcements that hit the wires over the course of the week, many of which were from vendors, organizations and users that we were within a short walk across the show floor in Denver.
We wanted to point to some of the key newsmakers during the show, but before we delve deeper into some of their news items, there were some stories that deemed “best in show” during the course of the event. Our selections for “can’t miss” news items for SC13 include:
- Micron Exposes Double Life of Memory with Automata Processor
- Jack Dongarra Sheds Further Light on New HPC Benchmark
- RSC Squeezes a Petaflop in a Rack
- NVIDIA Intros K40 GPU Accelerators
- Intel Reveals New Details About Knight’s Landing
- IBM and NVIDIA Join Hands to Bring GPUs to POWER
- University of Texas at Austin Steals Second Student Cluster Competition
To get the full news firehose from last week, scroll down to the bottom of our special SC13 coverage page for a peek at top selected items.
Let’s take a more focused look at some of the week’s top newsmakers, beginning with the one company that seems to be everywhere (and they were hard to miss with their neon green scarves)…
NVIDIA
Before we jump directly into some of NVIDIA’s specific news, there are a few figures from the Top 500 BoF session that we’d like to share to highlight a few points.
Rather hard trends to ignore–movements that are echoed by both Intersect360 Research and IDC, which noted during their breakfast event at SC13 that “way back” in 2011, only 28.2% of the sites they surveyed had adopted coprocessors or accelerators whereas their 2013 figures showed that a remarkable 76.9% of sites had swept up acceleration. While they note that a great deal of this is still in experimental phase, it’s nonetheless significant as it could mark the shape of the Top 500 in lists to come–and with some recent news from NVIDIA that we’ll get to in a moment–more enterprise datacenters.
General trends aside, there was plenty of specific news around NVIDIA during SC13–not to mention quite a bit of action at their booth, which hosted ongoing sessions and learning/engagement activities. News-wise the GPU giant had a strong showing with the official unveiling and detailing of its K40 accelerators, which provide a boost in both processing capacity and memory over their K20X.
While you can read more about the release in this in-depth feature from last week, suffice to say, a number of system vendors climbed on board with support, including Cray, AMAX, Supermicro, Boston Limited, Exxac and others. As we reported, the upgrade can mean significant performance improvements via the activation of more cores on the GPU and also through a new GPU Boost mode that lets the CUDA cores overclock.
Outside of this uptick in GPU capability, NVIDIA had other news that could be a boon to its future expansion in further environments. First, they announced that CUDA 6.0 will be available in 2014, which they said can push an 8x improvement to applications. The update includes some long-awaited features, most notably unified memory. CUDA 6.0 also now enables new drop-in libraries and “multi-GPU scaling” which lets re-designed BLAS and FFT GPU libraries scale their performance automatically across up to 8 GPUs per node, “delivering over nine teraflops of double precision performance per node, and supporting larger workloads than ever before (up to 512GB).” This feature can be used with their new BLAS drop-in library.
In another effort to extend the reach of GPU computing, this time into the enterprise datacenter, NVIDIA and IBM announced a partnership to collaborate on GPU-accelerated versions of several of IBM’s enterprise applications on Power systems. The companies noted that this is “the first time that GPU accelerator technology will move beyond the realm of supercomputing and into the heart of enterprise-scale datacenters.”
“This partnership will bring supercomputer performance to the corporate data center, expanding the use of GPU accelerators well beyond the traditional supercomputing and technical computing markets,” said Ian Buck, vice president of Accelerated Computing at NVIDIA. “It will also provide existing supercomputing and high performance computing customers with new choices and technologies to build powerful, energy-efficient systems that drive innovation and scientific discovery.”
Aside from these items and the notable momentum on the Top500, NVIDIA’s GPUs topped the list for green supercomputers and overall efficiency. All ten of the top placeholders on the Green500 (some excellent info on that list and its evolution from one of its founders, Kirk Cameron here) were powered by GPUs. Further, the Top500 BoF shared the following to echo the efficiency sentiment…
Cray
As we move over to Cray, it’s worth pointing to their own involvement in NVIDIA’s news last week around the coming K40. The company was among the first to announce that it would support the updated GPU across its entire supercomputer line.
Given their history in HPC, it’s easy to make the claim that Cray is hard to ignore at SC, but this has really been their year to shine in some new ways. While their financials may not reflect knockout, wild growth, they’re working to innovate ahead of the curve, expanding into markets outside of scientific computing. In addition to the focus on their YarcData division, which is dedicated to delivering system and software solutions targeted at big data use cases, the company took another step in the enterprise direction with a new framework designed to allow Hadoop some easier hooks into their XC30 line of supers.
Cray was also behind the only new addition to this year’s top ten on the Top500—the 6.27 petaflop Piz Daint system at the Swiss National Supercomputing Centre—an XC30 powered by a Xeon E5/NVIDIA K20x duo and the Aries interconnect. Cray claimed a total of 48 entries on the Top500 this year, 18 of which were in the top 100 (and 2 in the top ten—Titan and now Piz Daint). Last November (SC12) they had a total of 55 systems, in 2011 they had 40, and in 2009 they had a 25 system share. In the years prior to 2009, they tended to hover around the 20 system share (give or take a few) regularly.
If the trend holds, they’ll be grabbing an ever-growing slice of Top500 pie, especially with the addition of some new systems and updates they’ve talked about that will be ripe for ISC’s list. For now, here is the vendor system share from this year:
There were a few end user stories about new and build-out implementations at the show as well, including news that Cray was awarded a contract to expand its XC30 environment at the University of Stuttgart’s “Hornet” site. When the expansion is complete, Hornet will deliver peak performance of more than seven petaflops (quadrillion mathematical calculations per second) and 2.3 petabytes of additional Cray file system storage for Lustre (CLFS). Additionally, the Center for Computational Sciences (CCS) at the University of Tsukuba in Japan has pushed an NVIDIA and Intel-powered Cray CS300 cluster into production. The new system has been combined with the University’s current Cray super, and is providing researchers and scientists with 1.1 petaflops at their disposal.
On the programming front, Cray also announced another sweep across parts of its line with the latest release of the Cray Compiler Environment (CCE), which is available now on the CS300 machines. According to Cray, this move with the CCE “provides customers with a proven, familiar and HPC-optimized compiler for highly parallel environments.”
We’ve definitely been watching Cray this year—and so have you, according to our Reader’s Choice award votes. The company was handed ten separate HPCwire awards this year– three Readers’ Choice Awards and seven Editors’ Choice Awards. These together mark the most awards Cray has won in a single year. This also marks the tenth consecutive year Cray was selected for HPCwire awards.
“We are truly honored by the scope of industry recognition for Cray this year,” said Peter Ungaro, president and CEO of Cray. “Credit is due to the hundreds of Cray employees who have committed their talents and energy to helping our customers solve the world’s most difficult computing challenges. As a company, we are intently focused on being a global leader in supercomputing, and the HPCwire awards are an acknowledgement of the results of working closely with our partners and our customers.”
Data Direct Networks
There seemed to be quite a bit of activity around the Data Direct Networks booth this year at SC as they held demos and showed off some their shiniest appliances against the din of the many other storage vendors that packed the floor.
At SC13 DDN rolled out two new models of its Storage Fusion Architecture(SFA) technology – the SFA12KX and the SFA12KXE. The company showcased the SFA12KX at SC13 and discussed the advanced processor technology and optimized OS, which they say to delivers up to 48GB/s and 1.4 Million IOPS from a single appliance.
The SFA12KXE leverages DDNTM In-Storage Processing technology (see image below) to back their own EXAScaler and GRIDScaler parallel file systems, as well as customer applications running natively within the storage array. The SFA12KXE is set to deliver up to 23GB/s of file system performance and eliminates external servers and storage networking to bring it into the “converged” camp that’s been garnering a great deal of attention.
According to Jean-Luc Chatelain, Executive Vice President of Strategy and Technology at DDN, “DDN’s SFA12KX appliances are the foundation of many of the world’s most demanding and data-intensive environments, each requiring massive performance and scale without high cost. With features such as application-aware Flash caching, Real-Time I/O and in-storage processing, our SFX12KX appliances will perform at up to 48GB/s.”
DDN gave SC13 attendees something else to talk about last week beyond their SFA news via their annual HPC Trends survey, which found that, perhaps unsurprisingly, storage I/O performance is a top priority for those designing HPC systems for “big data” workloads. The survey, which hit a cross-section of 60 of their end users, found that 68% of those polled agree that data and data storage has become the most strategic part of the HPC datacenter. As DDN noted, “Moreover, by a margin of two to one respondents, the survey also reveals that today’s storage technologies will need to undergo massive change to hit exascale proportions.” They also found that 78% “agree that hybrid storage is the evolutionary next step for HPC storage, combining both the performance and cost efficiency benefits required of storage at exascale.”
Data Direct Networks was another company that raked in a number of awards from HPCwire this year. DDN pulled in six separate awards for its work in manufacturing, government and industry collaborations, and financial services among other areas. The company also won Best HPC Storage Technology in conjunction with TACC as well as a Reader’s choice for Best Use of HPC in Oil and Gas for its role in BP’s new Center for High Performance Computing in Houston.
IBM
Outside of one major partnership news item, IBM only had a couple of light announcements at SC13, but before we dig into those specifics, just wanted to point to a trend that we’d noted before. Take a look below at the slide from this year’s Top 500 announcement that lists the top ten systems. Notice a trend, given that we’re talking now about Big Blue? (note: yellow highlight from BoF emphasis on the one new system–not related).
While this isn’t necessarily a surprise, it’s also worth noting a few other ways that IBM stole the listings last week at SC13. Take a look for instance at the following two charts, which show both the Green 500 supercomputer list and the Graph 500, which pits systems against one another based on their performance on the graph problem benchmark.
First, the Graph500
And now the Green500
As we showed earlier, there is some disparity between the big players in the Top 500 and their smaller counterparts. While HP has an overall Top500 system share of 39% to IBM’s 33% (but with a performance share of 31.6% to HP’s 15.5%), it’s certainly worth pointing out that IBM is taking the cake in ways that few vendors can in the three big areas that matter—performance on LINPACK, performance on big data graph problems, and overall efficiency.
Back in the section on NVIDIA we highlighted a Top500 slide that pointed to the most efficient architectures and while IBM isn’t anywhere to be found there, NVIDIA and IBM’s partnership (again, highlighted earlier in the article) could turn that graphic around by the time next November rolls around with the POWER and GPU combination.
Outside of that announcement, IBM had a relatively quiet show but if there was one new item that slipped under the radar (at least in many of the conversations this attendee was around) this was certainly one. We should also point out that the Gordon Bell prize this year was awarded to researchers tapping Sequoia, providing IBM with another notch in its SC belt.
Other Strong Showings
We wanted to highlight a few other vendors that we watched at SC13 who had noteworthy and/or voluminous news. These include Mellanox, HP (which still holds the systems share on the Top 500) and Bull. Some of their key news items are linked below.
Mellanox
Mellanox InfiniBand Adoption Grows Across TOP500 Systems
NVIDIA RDMA Technology Supported on Mellanox InfiniBand Adapters
Mellanox InfiniBand Adapter Performance Revealed
Quantum Selects Mellanox as InfiniBand Partner
Mellanox Ranked 251 on Deloitte’s Technology Fast 500
Purdue Supercomputer Utilizes Mellanox InfiniBand Adapters
HP
TACC to Deploy Maverick for Visualization and Data Analysis
HP Servers Support TOP500 Supercomputers
Bull
University of Reims Supercomputer Ranked 5th on Green500 List
Bull Receives Honors in HPCwire Readers’ and Editor’s Choice Awards