February 06, 2013
SANTA CLARA, Calif., Feb. 5 – NVIDIA today announced a new report describing dramatic performance increases for math libraries accelerated using NVIDIA CUDA, the world's most pervasive parallel computing platform and programming model for accelerating scientific and engineering applications on NVIDIA GPUs.
Available as a free download on the NVIDIA Developer Zone website, the report details the ground-breaking application performance increases enabled by CUDA 5 GPU-accelerated math libraries running on the new NVIDIA Tesla K20 GPU accelerators. These include cuFFT, cuBLAS, cuSPARSE, cuRAND, the NVIDIA Performance Primitives (NPP) library, Thrust and a C99-compatible math library, all of which are available for free in the latest CUDA 5 Toolkit release.
Providing application developers with the easiest and fastest way to add GPU acceleration to their code, the CUDA libraries deliver dramatic performance advantages compared to Intel's Math Kernel Library (MKL) routines running on the latest Intel CPUs. Performance highlights from the report include:
NVIDIA is hosting a webinar on Wednesday, Feb. 6, from 10 – 11 a.m. PT to provide more information about CUDA 5 math library performance. Register for the webinar on the GTC Express website.
Hands-on Labs @ GTC
NVIDIA will also provide developers with a unique opportunity to experience the dramatic performance increases delivered by GPU-accelerated libraries in a series of hands-on labs at the fourth-annual GPU Technology Conference (GTC).
At GTC 2013, which will be held in San Jose, Calif., from March 18-21, developers will have the opportunity to expand their programming skills with CUDA C/C++ and OpenACC. They can also experience the performance advantages of the new CUDA 5 GPU-accelerated libraries using their own laptops to virtually access powerful GPU-based workstations hosted by Amazon in the cloud. To register for the conference, visit the GTC 2013 registration page. Once registered, attendees can log in to reserve a seat for the hands-on labs.
The CUDA 5 platform makes the development of GPU-accelerated applications faster and easier than ever, including support for dynamic parallelism, GPU-callable libraries, NVIDIA GPUDirect technology support for RDMA (remote direct memory access), and the NVIDIA Nsight Eclipse Edition integrated development environment (IDE). To learn more about CUDA or download the latest version, visit the CUDA website.
About CUDA
CUDA is a parallel computing platform and programming model developed by NVIDIA. It enables dramatic increases in computing performance by harnessing the power of GPUs. With more than 1.6 million downloads, supporting more than 180 leading engineering, scientific and commercial applications, the CUDA programming model is the most popular way for developers to take advantage of GPU-accelerated computing.
About NVIDIA
NVIDIA awakened the world to computer graphics when it invented the GPU in 1999. Today, its processors power a broad range of products from smartphones to supercomputers. NVIDIA's mobile processors are used in cell phones, tablets and auto infotainment systems. PC gamers rely on GPUs to enjoy spectacularly immersive worlds. Professionals use them to create 3D graphics and visual effects in movies and to design everything from golf clubs to jumbo jets. And researchers utilize GPUs to advance the frontiers of science with high performance computing. The company has more than 5,000 patents issued, allowed or filed, including ones covering ideas essential to modern computing. For more information, see www.nvidia.com.
-----
Source: NVIDIA Corp.
Large-scale, worldwide scientific initiatives rely on some cloud-based system to both coordinate efforts and manage computational efforts at peak times that cannot be contained within the combined in-house HPC resources. Last week at Google I/O, Brookhaven National Lab’s Sergey Panitkin discussed the role of the Google Compute Engine in providing computational support to ATLAS, a detector of high-energy particles at the Large Hadron Collider (LHC).
Read more...
The Xeon Phi coprocessor might be the new kid on the high performance block, but out of all first-rate kickers of the Intel tires, the Texas Advanced Computing Center (TACC) got the first real jab with its new top ten Stampede system.We talk with the center's Karl Schultz about the challenges of programming for Phi--but more specifically, the optimization...
Read more...
Although Horst Simon was named Deputy Director of Lawrence Berkeley National Laboratory, he maintains his strong ties to the scientific computing community as an editor of the TOP500 list and as an invited speaker at conferences.
Read more...
May 16, 2013 |
When it comes to cloud, long distances mean unacceptably high latencies. Researchers from the University of Bonn in Germany examined those latency issues of doing CFD modeling in the cloud by utilizing a common CFD and its utilization in HPC instance types including both CPU and GPU cores of Amazon EC2.
Read more...
May 15, 2013 |
Supercomputers at the Department of Energy’s National Energy Research Scientific Computing Center (NERSC) have worked on important computational problems such as collapse of the atomic state, the optimization of chemical catalysts, and now modeling popping bubbles.
Read more...
May 10, 2013 |
Program provides cash awards up to $10,000 for the best open-source end-user applications deployed on 100G network.
Read more...
May 09, 2013 |
The Japanese government has revealed its plans to best its previous K Computer efforts with what they hope will be the first exascale system...
Read more...
05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas | From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.
04/15/2013 | Bull | “50% of HPC users say their largest jobs scale to 120 cores or less.” How about yours? Are your codes ready to take advantage of today’s and tomorrow’s ultra-parallel HPC systems? Download this White Paper by Analysts Intersect360 Research to see what Bull and Intel’s Center for Excellence in Parallel Programming can do for your codes.
In this demonstration of SGI DMF ZeroWatt disk solution, Dr. Eng Lim Goh, SGI CTO, discusses a function of SGI DMF software to reduce costs and power consumption in an exascale (Big Data) storage datacenter.
The Cray CS300-AC cluster supercomputer offers energy efficient, air-cooled design based on modular, industry-standard platforms featuring the latest processor and network technologies and a wide range of datacenter cooling requirements.