June 18, 2012
World's fastest single-precision accelerator, Tesla K10 GPU accelerates widely-used defense, seismic, life sciences, and video applications
HAMBURG, Germany, June 18 -- ISC'12 - NVIDIA Tesla K10 GPUs offer performance breakthroughs on popular high performance computing (HPC) applications -- ranging from seismic processing to life sciences to video processing -- according to new benchmarks NVIDIA released today.
Based on the new NVIDIA Kepler computing architecture, the Tesla K10 GPU delivers the industry's highest single precision performance (4.58 teraflops) and highest memory bandwidth (320 GB/sec) in a single accelerator. This is 12 times higher single precision flops and 6.4 times higher memory bandwidth than the latest-generation Intel Sandy Bridge CPUs(1) .
The Tesla K10 GPU outperforms CPUs and previous-generation GPUs across the board on the most popular, compute-intensive applications for four key market segments, including:
-- Defense: video analytics, video stabilization, orthorectification, computer vision
-- Life and material sciences: molecular dynamics
-- Oil and gas: seismic processing, reverse time migration
-- Media and entertainment: video editing, video rendering/transcoding, ray tracing
"A distinct advantage of the Tesla K10 GPUs is that it excels in two key areas that have a dramatic impact on overall application performance: floating point operation and memory bandwidth," said Sumit Gupta, senior director of Tesla business at NVIDIA. "Together, these enable the K10 GPU to deliver substantial out-of-the-box performance increases for the top science, engineering and commercial applications with little or no effort on the part of the developer."
New Performance Records on AMBER and LAMMPS On AMBER, a leading biomolecular simulation software application, four Tesla K10 GPUs achieved world record performance, delivering far superior results than what was available on multiple racks of servers just a few years ago.(2)
The Tesla system achieved performance of 76 nanoseconds of computer simulation time in a day for a 23,558 atom molecule, outstripping the previous record set with four Tesla M2090s last year, providing supercomputing performance to thousands of individual researchers to fuel further innovation in such areas as new drug discovery and more effective materials.
"In biomolecular science, adding a few more nanoseconds of simulation time can make a world of difference in the ability of researchers to study and better understand the behavior of complex biological systems," said Ross Walker, assistant research professor, San Diego Supercomputing Center. "It still blows my mind that a single Tesla K10 outperforms some of the largest CPU clusters. The benefit it offers researchers is tremendous, enabling them to accelerate the search for new and better treatments for a host of diseases and disorders."
The Tesla K10 GPU also delivers the highest performance on LAMMPS, another application widely used by the life sciences research community. Running the LAMMPS Lennard Jones Liquid Benchmark, a single Tesla K10 GPU outperforms a Tesla M2090 GPU by 80 percent, delivering the equivalent performance of a cluster with 64 x86 CPUs.(3)
Accelerating the Search for Energy NVIDIA Tesla GPUs continue to deliver the highest performance on reverse time migration (RTM) applications for seismic processing in the oil and gas exploration industry, and for image processing in the computer vision industry. Petrobras, the national oil and gas company of Brazil, achieved an 1.8x speed up on its RTM application on the Tesla K10 GPU, as compared to a Tesla M2090 GPU within the same power envelope.
NVIDIA Tesla K10 GPUs are available from leading OEMs, including Appro Supercomputer Solutions, Dell, HP, IBM, SGI and Supermicro, as well as through NVIDIA distribution partners. More information about the Tesla K10 is available on the NVIDIA Tesla website.
About NVIDIA Tesla GPUs NVIDIA Tesla GPUs are massively parallel accelerators based on the NVIDIA CUDA(R) parallel computing platform. Tesla GPUs are designed from the ground up for power-efficient, high performance computing, computational science and supercomputing, delivering dramatically higher application acceleration for a range of scientific and commercial applications than a CPU-only approach.
To learn more about CUDA or download the latest version, visit the CUDA website. More NVIDIA news, company and product information, videos, images and other information is available at the NVIDIA newsroom. You can also follow us on Twitter (@NVIDIATesla).
About NVIDIA NVIDIA NVDA +2.16% awakened the world to computer graphics when it invented the GPU in 1999. Today, its processors power a broad range of products from smartphones to supercomputers. NVIDIA's mobile processors are used in cell phones, tablets and auto infotainment systems. PC gamers rely on GPUs to enjoy spectacularly immersive worlds. Professionals use them to create 3D graphics and visual effects in movies and to design everything from golf clubs to jumbo jets. And researchers utilize GPUs to advance the frontiers of science with high performance computing.The company has more than 5,000 patents issued, allowed or filed, including ones covering ideas essential to modern computing. For more information, see www.nvidia.com.
(1) Compared to Intel Xeon Processor E5-2690
(2) Server system is 4 nodes, each node configuration with a single Tesla K10 GPUs, Dual Intel Xeon X5670, 72 GB DDR3 memory. For large cluster performance benchmark: http://ambermd.org/amber10_bench_files/jac_nve_kraken_ranger_large.png
(3) http://lammps.sandia.gov/bench/lj_xt5.html
-----
Source: NVIDIA
In a recent solicitation, the NSF laid out needs for furthering its scientific and engineering infrastructure with new tools to go beyond top performance, Having already delivered systems like Stampede and Blue Waters, they're turning an eye to solving data-intensive challenges. We spoke with the agency's Irene Qualters and Barry Schneider about..
Read more...
Large-scale, worldwide scientific initiatives rely on some cloud-based system to both coordinate efforts and manage computational efforts at peak times that cannot be contained within the combined in-house HPC resources. Last week at Google I/O, Brookhaven National Lab’s Sergey Panitkin discussed the role of the Google Compute Engine in providing computational support to ATLAS, a detector of high-energy particles at the Large Hadron Collider (LHC).
Read more...
The Xeon Phi coprocessor might be the new kid on the high performance block, but out of all first-rate kickers of the Intel tires, the Texas Advanced Computing Center (TACC) got the first real jab with its new top ten Stampede system.We talk with the center's Karl Schultz about the challenges of programming for Phi--but more specifically, the optimization...
Read more...
May 22, 2013 |
At some point in the not-too-distant future, building powerful, miniature computing systems will be considered a hobby for high schoolers, just as robotics or even Lego-building are today. That could be made possible through recent advancements made with the Raspberry Pi computers.
Read more...
May 16, 2013 |
When it comes to cloud, long distances mean unacceptably high latencies. Researchers from the University of Bonn in Germany examined those latency issues of doing CFD modeling in the cloud by utilizing a common CFD and its utilization in HPC instance types including both CPU and GPU cores of Amazon EC2.
Read more...
May 15, 2013 |
Supercomputers at the Department of Energy’s National Energy Research Scientific Computing Center (NERSC) have worked on important computational problems such as collapse of the atomic state, the optimization of chemical catalysts, and now modeling popping bubbles.
Read more...
May 10, 2013 |
Program provides cash awards up to $10,000 for the best open-source end-user applications deployed on 100G network.
Read more...
May 09, 2013 |
The Japanese government has revealed its plans to best its previous K Computer efforts with what they hope will be the first exascale system...
Read more...
05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas | From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.
04/15/2013 | Bull | “50% of HPC users say their largest jobs scale to 120 cores or less.” How about yours? Are your codes ready to take advantage of today’s and tomorrow’s ultra-parallel HPC systems? Download this White Paper by Analysts Intersect360 Research to see what Bull and Intel’s Center for Excellence in Parallel Programming can do for your codes.
In this demonstration of SGI DMF ZeroWatt disk solution, Dr. Eng Lim Goh, SGI CTO, discusses a function of SGI DMF software to reduce costs and power consumption in an exascale (Big Data) storage datacenter.
The Cray CS300-AC cluster supercomputer offers energy efficient, air-cooled design based on modular, industry-standard platforms featuring the latest processor and network technologies and a wide range of datacenter cooling requirements.