The Leading Source for Global News and Information Covering the Ecosystem of High Productivity Computing
From the Editor | Main Blog Index
June 04, 2009
Hardly a week goes by now where some big IT company isn't announcing a new cloud computing platform. Jumping into clouds seems metaphorically questionable, but a lot of IT firms see large-scale utility computing as the next big thing in computing, and they don't want to be left out. Most recently hopping onto the bandwagon are Verizon, Computer Sciences Corp., and Sun Microsystems -- its second foray into on-demand computing.
Those three companies add to a growing list of cloud providers, including Google, Microsoft, IBM, HP, AT&T, and dozens of smaller players. But HPC users seem to be gravitating toward the 800-pound gorilla in the room -- Amazon and its Elastic Compute Cloud (EC2) offering. Even though EC2 has only been around for three years, it represents the oldest and most established general-purpose cloud computing platform.
In particular, EC2 looks like it's becoming the platform of choice for biotech companies. Our February report on startup Pathwork Diagnostics is an example of a small company using EC2 to offload a cancer tissue analytics application. They cited Amazon's $0.10/CPU-hour cost as the main attraction for outsourcing some of their work. Larger biotech companies are using EC2 as well. An article last week in Chemical & Engineering News by Rick Mullin described how a handful of big pharmaceutical firms are tapping into clouds. Pfizer, Eli Lilly & Co., Johnson & Johnson and Genentech are all looking to offload some of their bioinformatics work onto the cloud. From Mullin's article:
Although Lilly has a sizable installed base of computers, the company's IT infrastructure is operating at full capacity, says Andrew Kaczorek, senior systems analyst for discovery IT. "Because we have hundreds of different users, what we see is spiky utilization," Kaczorek says. "The result is that for days at a time our clusters are at 100% of capacity. This means there are actually scientists who have work to be done that is literally sitting in a queue." Although exact cost savings are difficult to calculate, they are clearly significant, according to Powers and Kaczorek, as are the time savings.
For example, the company was able to rent CPU cycles on EC2 to run a bioinformatics sequencing code on a 64-node EC2 cluster. For a 20 minute run, the cost to Eli Lilly was $6.40. That's hard to beat when compared to the price of maintaining those additional 64 compute nodes on a permanent basis.
For bioscience businesses, the cloud story is especially compelling. Unlike other traditional HPC users like government labs, financial services firms, and oil & gas companies, life sciences came relatively late to the information technology game, so computing know-how and infrastructure at these companies tend to be spread rather thinly (at least relative to, say, a DOE lab). But today biotech companies are fully immersed in and dependent upon information technology, especially high performance computing. Mullin continues:
[T]he rapid creation of life sciences data keeps pointing to the use of cloud computing, and this is especially true in the area of genomics research. Advances in nanoscale and microfluidic chemistry now allow DNA to be monitored on tiny beads by photographic sensors that, according to Chris Dagdigian, principal consultant for the BioTeam, generate TIFF images in collections of up to 800 gigabytes. "This creates a massive data-capture and handling problem," he says. "We are now in an era where instruments that are showing up in very small wet laboratories are capable of producing a terabyte or more of data in a day."
It's conceivable that the drug companies will bypass the large-scale datacenter build-out that occurred in other HPC verticals and move directly to an on-demand computing model. As such, it may serve as a model for how other HPC users, especially smaller organizations and new users with little high-end computing expertise, can get cloud-enabled.
The early experiences by these drug firms also point to how security concerns are holding back more widespread use of cloud computing. In this case, their main concern is protecting their intellectual property and patents, but almost all HPC users (not to mention just everyday enterprise users) have security issues of one sort or another. It's worth noting here that Verizon's new cloud platform offers added security, primary because their cloud runs over their own private network. But they also offer additional security in the form of identity and access management, host intrusion detection, application vulnerability assessment, network application assessment and professional security services. It's not too hard to imagine that computing in the cloud can be made at least as secure as it is behind a local firewall.
For the HPC crowd, the longer term concern is performance. For a good synopsis of this topic take a look at Douglas Eadline's recent article in Linux Magazine and the EC2 benchmarking paper (PDF) he references. The main argument put forth is that running applications directly on top of purpose-built HPC machines is always going to be more efficient than running applications through a bunch of cloud layers on general-purpose platforms. My take on this is that focusing on performance and computing efficiency ignores the more useful (but more slippery) concept of productivity. I've yet to see a research paper look at HPC in the cloud from this perspective.
There are some early attempts to marry cloud computing services with traditional HPC infrastructure. Darkstrand, Nimbus Services, R Systems, Univa UD, and a handful of other companies are on the leading edge of HPC-as-a-service that bring real supercomputers into the mix. Wolfram Research is also developing its own HPC Cloud Service in partnership with Nimbus and R Systems. Whether HPC will be able to carve out its own niche in cloud computing is an open question, but a deeper discussion of this will have to wait until another day.
Posted by Michael Feldman - June 4 @ 3:39PM
(Digg, Technorati, more)
PGI Accelerator™ Fortran 95/03 and C99 compilers for x64+NVIDIA
Accelerate applications on x64+GPU platforms by adding OpenMP-like compiler directives to existing Fortran and C programs. Available now for Linux, MacOS and Windows. Download a free 15 day trial.
Platform HPC Workgroup Manager
Platform HPC Workgroup Manager integrates all the cluster productivity tools you need to deploy, run and manage your HPC environment.
Michael Feldman is the editor of HPCwire.
More Michael Feldman
Re: Multicore Watershed by Nastyanna
HPC? not so much by ewahl
Re: Podcast: A Trio of HPC Apps by sibat0705
Re: Podcast: A Trio of HPC Apps by sibat0705
Re: Cray Corrals Big Defense Deal by watchesuk
We think by watchesuk
Re: IBM and HPC by truly64
HPC = servers but a lot more by lawries
Lena by Nastyanna
Lena by Nastyanna
Multi core deployment becomes a memory game by truly64
Re: Venture Capital Drought? Not So Much. by Ron Van Holst
Re: AMD Confirms 12-Core Opteron Production by Nastyanna
Re: Cray Corrals Big Defense Deal by Nastyanna
Re: Podcast: Cray Awarded Defense Deal; SGI Makes Storage Buy; IBM Invents New Algorithm by Nastyanna
Painful Truth by jeffrey.mcallister
SGI = graphics + HPC by johnbarr
HPC = servers but a lot more by truly64
Oracle SPARC != Fujitsu SPARC by Alan M. Feldstein
Sun & HPC != Oracle & HPC by Merblich
a third vendor for lossless low latency 10GbE fabric by lee.fisher@hp.com
Response to GAH by KevinButerbaugh
Response to KevinButerbaugh by GAH
Response to KevinButerbaugh by GAH
Response to GAH by KevinButerbaugh
Response to bdrupp by KevinButerbaugh
Climate Crisis and Exaflops by bdrupp
Climate Crisis and Exaflops by John Hules
Climate Crisis and Exaflops by GAH
Climate Crisis by KevinButerbaugh
IBM "Brain Simulation" article is not properly presented. by Merritt
563 out of 1206 by vvolkov
Little Iron by gadunk
At least it's not "cloud" by KevinButerbaugh
Native QPI Interface? by commike
Mmmmmm by hellcats
New transistorized IC chip scales. by symmecon
Itanium at IDF by Alan M. Feldstein
Communication time by jnapper
"The financial meltdown and computing" by donpellegrino
Human Models by mdgabriel
High-End SPARC Chip for Scientific Applications by Alan M. Feldstein
RapidMind by Mr LolO
Rapidmind by dminor
Longer run times by JohnWest
re: Algo trading Angst by jshore
Results of Testing by in_the_crease
The Moscow State University supercomputer, Lomonosov, has been selected for a high-performance makeover, with the goal of tripling its processing power to achieve petaflop-level performance in 2010. T-Platforms, who developed and manufactured the supercomputer, is the odds-on favorite to lead the project.
Read More...
Right on schedule, Intel has launched its Xeon 5600 processors, codenamed "Westmere EP." The 5600 represents the 32nm sequel to the Xeon 5500 (Nehalem EP) for dual-socket servers. Intel is touting better performance and energy efficiency, along with new security features, as the big selling points of the new Xeons.
Read More...
The ACM Turing Award goes to the creator of the modern personal computer; and Voltaire announces a mid-range InfiniBand switch and new technology that accelerates distributed applications. We recap those stories and more in our weekly wrapup.
Read More...
Mar 17 | The Register | But what about the tier ones? Read more...
Mar 17 | Cadalyst Magazine | A new generation of workstations is changing the nature of technical computing. Read more...
Mar 17 | Linux Magazine | Latest iteration of Sun Grid Engine able to tap into Cloud. Read more...
Mar 16 | Bio-IT World | Biotech firm builds genetic models from patient data. Read more...
Mar 15 | The Register | EMC's grand vision for unified global storage. Read more...
Jan 12 | | In-depth look at vSMP Foundation server virtualization technology, technical implementation, use cases and capabilities. The technical whitepaper provides an architectural overview and details on the three vSMP Foundation products: vSMP Foundation for SMP, vSMP Foundation for Cluster and vSMP Foundation for Cloud.
Jan 18 | | This white paper discusses Gore’s copper cable assemblies, and how they continue to exceed the standards for providing reliable, cost-effective solutions for high-performance computer applications.
Join this online panel discussion for live Q&A with leading industry experts, analysts, and end-users to discuss the latest innovations, best practices, barriers to implementation, and measurable benefits of server virtualization with a particular focus on today's real world solutions.
Learn about scalable fault-tolerant architectures and examples of energy efficient and scalable supercomputing clusters using dual QDR InfiniBand to combine capacity computing with network failover capabilities with the help of programming languages such as MPI and a robust Linux cluster management package.
LIVE@SCO9: The IBM team discusses new innovations in hardware, software and services that help clients better understand their workloads and get insight from their R&D efforts. Technology demonstrations include the soon-to-be-released Power7 HPC processor, the DCS990 system with 2.4 petabytes of storage, the xCAT management tool, secure HPC cloud computing and more. Winners of two HPCwire Readers' and Editors’ Choice Awards! Take the IBM virtual tour at SC09 or more information go online to: http://www-03.ibm.com/systems/deepcomputing/sc09.html