The Leading Source for Global News and Information Covering the Ecosystem of High Productivity Computing
July 14, 2006
In May, Sun Microsystems, Inc. announced that the Tokyo Institute of Technology (Tokyo Tech), one of the world's premier science and technology universities, installed a 38-teraflops supercomputer, based on Sun Fire Server technology. Tokyo Tech's supercomputer, called TSUBAME, represents Sun's largest high performance computing win to date. The system also represents the largest supercomputer outside the United States. It includes 10,480 AMD Opteron processor cores, more than 21 terabytes of memory and 1.1 petabytes of hard disk storage.
IDC's Earl Joseph recently completed an exclusive interview for HPCwire with Professor Satoshi Matsuoka, of the Tokyo Tech's Global Scientific Information and Computing Center. Professor Matsuoka talks about the significance of the new TSUBAME system and describes some of the cutting-edge applications that will be running on it.
HPCwire: What's the status of the Sun system?
Matsuoka: The Sun system was installed in March 2006. When the Sun boxes arrived in mass, the infrastructure was already in place, so the installation went quickly, one of the fastest installations ever of such a large system. On April 3, we were able to start the service on a limited number of nodes. We did preliminary Linpack runs on the whole system in early May. In early June, about 5,000 processors, or half of the system, was opened up for public use. We were reserving the other 5,000 processors for benchmarking and special allocations.
The user workload has been growing steadily and is already at 70 percent. We're now running hundreds of jobs on the system. We're receiving help desk mail from all of the university departments we serve. Many users are telling us they love the new system. Some users are starting to do massive runs. One of these, for example, is looking at scenarios with avian flu and the impact of potential social policy decisions. This is a parameter study.
HPCwire: How difficult was the installation?
Matsuoka: Based on our prior experiences bringing up large computers, the glitches we've faced with this Sun system have been quite minor. We're currently stress-testing the machine around the clock. There have been no major complaints. It's just been an amazingly quick ramp-up to stable operation. After just a few weeks, failures were minor and very sporadic.
HPCwire: Were there any surprises?
Matsuoka: There are always a couple. Staying with fat nodes turned out to be a good strategic move. With a small node system of this size, we would have had to manage 10,000 thin nodes. With fat nodes, the number is only 655, which is not all that different from the scale we have been used to. The other surprise is how much attention we are getting because of this system.
HPCwire: I've been hearing a lot lately about the issue of power consumption on large-scale HPC systems. How has your new system affected power consumption at your center?
Page: 1 of 4(Digg, Technorati, more)
PGI Accelerator™ Fortran 95/03 and C99 compilers for x64+NVIDIA
Accelerate applications on x64+GPU platforms by adding OpenMP-like compiler directives to existing Fortran and C programs. Available now for Linux, MacOS and Windows. Download a free 15 day trial.
Platform HPC Workgroup Manager
Platform HPC Workgroup Manager integrates all the cluster productivity tools you need to deploy, run and manage your HPC environment.
Mar 18 | ChannelWeb | Westmere parts already showing up in HPC machines. Read more...
Mar 17 | The Register | But what about the tier ones? Read more...
Mar 17 | Cadalyst Magazine | A new generation of workstations is changing the nature of technical computing. Read more...
Mar 17 | Linux Magazine | Latest iteration of Sun Grid Engine able to tap into Cloud. Read more...
Mar 16 | Bio-IT World | Biotech firm builds genetic models from patient data. Read more...
Jan 12 | | In-depth look at vSMP Foundation server virtualization technology, technical implementation, use cases and capabilities. The technical whitepaper provides an architectural overview and details on the three vSMP Foundation products: vSMP Foundation for SMP, vSMP Foundation for Cluster and vSMP Foundation for Cloud.
Jan 18 | | This white paper discusses Gore’s copper cable assemblies, and how they continue to exceed the standards for providing reliable, cost-effective solutions for high-performance computer applications.
Join this online panel discussion for live Q&A with leading industry experts, analysts, and end-users to discuss the latest innovations, best practices, barriers to implementation, and measurable benefits of server virtualization with a particular focus on today's real world solutions.
Learn about scalable fault-tolerant architectures and examples of energy efficient and scalable supercomputing clusters using dual QDR InfiniBand to combine capacity computing with network failover capabilities with the help of programming languages such as MPI and a robust Linux cluster management package.
LIVE@SCO9: The IBM team discusses new innovations in hardware, software and services that help clients better understand their workloads and get insight from their R&D efforts. Technology demonstrations include the soon-to-be-released Power7 HPC processor, the DCS990 system with 2.4 petabytes of storage, the xCAT management tool, secure HPC cloud computing and more. Winners of two HPCwire Readers' and Editors’ Choice Awards! Take the IBM virtual tour at SC09 or more information go online to: http://www-03.ibm.com/systems/deepcomputing/sc09.html