November 17, 2009
Rich standards-based tools and libraries develop around CUDA architecture
PORTLAND, Ore., Nov. 17 -- SC09 -- NVIDIA and its ecosystem partners will deliver, over the next few months, the industry's broadest set of software releases to developers using GPU computing in their work.
These updates feature major releases across a broad spectrum of GPU Computing development languages, tools and libraries. Included are updates from NVIDIA for its CUDA C compiler, with additional support for C++ and its upcoming GPU codenamed "Fermi." NVIDIA is also releasing its R195 driver that includes new extensions to its OpenCL 1.0 conformant driver and toolkit, and a beta release of the NVIDIA code name Nexus, the industry's first development environment for massively parallel computing, which is integrated into Microsoft Visual Studio.
Alongside NVIDIA's own updates, several partner releases from industry leaders in software tools are available now, including The Portland Group's CUDA Fortran solution, Allinea's Distributed Debugging Tool (DDT) and the TotalView debugger.
"The only effective way to scale performance in demanding applications is to move to a parallel computing model," said Sanford Russell, general manager of GPU Computing software at NVIDIA. "The NVIDIA CUDA architecture facilitates this critical transition with its broad industry support and network of software consultants and training resources for massively parallel computing."
Updates to NVIDIA and its partners' parallel computing development tools include the following:
CUDA Toolkit 3.0 Beta: With the CUDA Toolkit 3.0 Beta, developers can start developing applications today for the NVIDIA Fermi architecture. This beta release includes features such as ECC reporting, Dual DMA Engine, Concurrent Kernel Execution and NVIDIA Fermi HW debugging support in cuda-gdb. Performance profiling is included for both CUDA Visual Profiler and the OpenCL Visual Profiler. Also included is support for a new unified interoperability API for Direct3D and OpenGL including Direct3D 11.
OpenCL 1.0 Extensions: NVIDIA is the only vendor supporting OpenCL features beyond the minimum conformance level. New extensions released by NVIDIA include support for double precision, OpenGL interoperability and the new OpenCL Installable Client Device (ICD). These new features supplement existing NVIDIA-only support for 2D image, 32-bit atomics and byte addressable stores.
NVIDIA "Nexus," the codename for the industry's first development environment for massively parallel GPU applications, integrated into Microsoft Visual Studio IDE: Comprised of a Debugger, Performance Analyzer and Graphics Inspector, this beta release gives GPU Computing developers an immediate boost in productivity through common and easy to use tools.
The Portland Group (PGI) -- CUDA Fortran: Production release of the world's first Fortran compiler compatible with the NVIDIA CUDA-enabled GPUs. CUDA Fortran will accelerate the adoption of GPU Computing in areas where applications are written in Fortran, such as ocean modeling, weather forecasting, environmental modeling, seismic analysis, bioinformatics and other areas.
Professional HPC Debugging Solutions from Allinea and TotalView were also launched this week. These tools provide CUDA GPU features that complement existing capabilities for parallel debugging using MPI, OpenMP and pthreads on the Linux platform. It enables developers to debug applications that are running on hybrid clusters of x86-64 CPUs and Tesla GPU-based servers.
Numerical Analysis Packages: Significant advances in the use of CUDA-enabled GPUs have also been made in prominent numerical analysis and mathematical modeling packages such as MATLAB from Mathworks, Mathematica from Wolfram Research and LabVIEW from National Instruments.
CUDA Libraries: In addition, developers can take advantage of a rich set of CUDA-accelerated libraries available from NVIDIA and its partners including BLAS, FFT, LAPACK (EM Photonics CULA), MAGMA (ICL at the UTK), NVIDIA Performance Primitives (NPP), CUDA Vision Workbench (CVWB) and video and image processing libraries.
To accelerate the momentum of CUDA optimized tools and applications, customers can take advantage of worldwide training and consultancy services offered by a growing number of CUDA Consultants, such as Acceleware Corp., ANEO, CAPS, Elegant Mathematics, EM Photonics, Fixstars, GASS Ltd., HPC Project, Infosys, SagivTech, Stone Ridge Technology and Tech-X Corp.
The breadth of languages, APIs, libraries and other tools that are now supported by NVIDIA graphics processing units (GPUs) based on the CUDA parallel processing architecture represent the industry's most flexible and pervasive set of tools available for parallel computing today.
About NVIDIA
NVIDIA (Nasdaq: NVDA) awakened the world to the power of computer graphics when it invented the graphics processing unit (GPU) in 1999. Since then, it has consistently set new standards in visual computing with breathtaking, interactive graphics available on devices ranging from portable media players to notebooks to workstations. NVIDIA's expertise in programmable GPUs has led to breakthroughs in parallel processing which make supercomputing inexpensive and widely accessible. Fortune magazine has ranked NVIDIA #1 in innovation in the semiconductor industry for two years in a row. For more information, see www.nvidia.com.
-----
Source: NVIDIA Corp.
Large-scale, worldwide scientific initiatives rely on some cloud-based system to both coordinate efforts and manage computational efforts at peak times that cannot be contained within the combined in-house HPC resources. Last week at Google I/O, Brookhaven National Lab’s Sergey Panitkin discussed the role of the Google Compute Engine in providing computational support to ATLAS, a detector of high-energy particles at the Large Hadron Collider (LHC).
Read more...
The Xeon Phi coprocessor might be the new kid on the high performance block, but out of all first-rate kickers of the Intel tires, the Texas Advanced Computing Center (TACC) got the first real jab with its new top ten Stampede system.We talk with the center's Karl Schultz about the challenges of programming for Phi--but more specifically, the optimization...
Read more...
Although Horst Simon was named Deputy Director of Lawrence Berkeley National Laboratory, he maintains his strong ties to the scientific computing community as an editor of the TOP500 list and as an invited speaker at conferences.
Read more...
May 16, 2013 |
When it comes to cloud, long distances mean unacceptably high latencies. Researchers from the University of Bonn in Germany examined those latency issues of doing CFD modeling in the cloud by utilizing a common CFD and its utilization in HPC instance types including both CPU and GPU cores of Amazon EC2.
Read more...
May 15, 2013 |
Supercomputers at the Department of Energy’s National Energy Research Scientific Computing Center (NERSC) have worked on important computational problems such as collapse of the atomic state, the optimization of chemical catalysts, and now modeling popping bubbles.
Read more...
May 10, 2013 |
Program provides cash awards up to $10,000 for the best open-source end-user applications deployed on 100G network.
Read more...
May 09, 2013 |
The Japanese government has revealed its plans to best its previous K Computer efforts with what they hope will be the first exascale system...
Read more...
05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas | From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.
04/15/2013 | Bull | “50% of HPC users say their largest jobs scale to 120 cores or less.” How about yours? Are your codes ready to take advantage of today’s and tomorrow’s ultra-parallel HPC systems? Download this White Paper by Analysts Intersect360 Research to see what Bull and Intel’s Center for Excellence in Parallel Programming can do for your codes.
In this demonstration of SGI DMF ZeroWatt disk solution, Dr. Eng Lim Goh, SGI CTO, discusses a function of SGI DMF software to reduce costs and power consumption in an exascale (Big Data) storage datacenter.
The Cray CS300-AC cluster supercomputer offers energy efficient, air-cooled design based on modular, industry-standard platforms featuring the latest processor and network technologies and a wide range of datacenter cooling requirements.