What I Learned at NGDC: Technology is Ready, Users are Not

By Derrick Harris

August 13, 2007

If I had to boil what I observed at last week’s Next Generation Data Center conference into one thought, it would be this: Although technologies to virtualize, optimize and automate datacenters do exist and are mature, organizations are still leery about making the transformation, despite coveting the associated improvements in performance, flexibility and overall efficiency.

As evidence that these technologies are mature, one really need look no further than the event’s opening keynotes, in which Amazon CTO Werner Vogels and eBay distinguished research scientist Paul Strong discussed the Web-scale datacenters being operated by their respective employers. Strong, for his part, delved a little deeper into the nitty gritty details (see here and here for more on this), all the while, however, stressing the importance of building a datacenter that: (1) runs processes driven by SLAs; (2) operates as a value center rather than a cost center; and (3) enables the rolling out of new utilities, platforms, etc. To achieve this next-generation datacenter, he said, many technologies can and should be considered, including (but certainly not limited to) grid computing, utility computing, real-time solutions and virtualization.

Now, it’s unlikely that most organizations have the resources (or the need — said Strong, “If we don’t keep up, our business is gone.”) to develop and/or manage a datacenter like eBay’s — a highly automated and virtualized environment consisting of several thousand blades servers — but that doesn’t mean companies with less demand on their infrastructures can’t learn some lessons from the online auction leader. For starters, said Strong, automation is the key to efficiency and, in point he made sure to drive home, it is important to “manage relationships, not things.” This advice should be particularly relevant as today’s average datacenters continue to evolve toward advanced models like that of eBay. After all, when you’re staring down thousands of physical machines (and likely significantly more virtual ones) you can’t possibly expect to manage each one individually.

As for Vogels, whose presentation kicked off the doubleheader, he targeted his comments toward those companies who aren’t too keen on managing their own resources, and he used the opportunity to push Amazon’s stable of Web services. For most companies, he estimates, 70 percent of time and money expenditures go toward “heavy lifting” operations like maintenance, load balancing or software management, among others, none of which offer much in terms of innovation or helping differentiate your company from competitors. Unless you’re in an industry where having a customized, highly efficient datacenter directly translates into dollars, he suggested, it might be a waste of resources on all levels. “At [Amazon’s] scale, datacenters matter,” he stated. “They don’t for everyone.”

Following his statement that “I hate datacenters,” Vogels cited the recent power outage at a popular San Francisco datacenter — which led to temporary shutdowns of several Web 2.0 leaders, including Craigslist, Second Life and Netflix — as one example of what can go wrong. Power-wise, he added, even if you have generators to ensure you keep running, you still need batteries to handle the gap in time between the power going out and the generators kicking in. Still in the realm of possible physical issues, Vogels noted that datacenter managers still need to worry about issues such as sufficient cooling and how to handle a fire or other disasters. And that’s not even addressing business-side concerns, such as whether one datacenter is enough, or how you’re going to push out enough bandwidth to handle demand. Often times, he said, companies need to overprovision to handle peak loads or in case they become successful.

His solution? Utilize services such as Amazon’s Elastic Compute Cloud (EC2), as well its other Web services offerings, to handle your computing needs, paying only for what you actually use. Vogels talked about this notion as the “push versus pull” model of resource management, where “pushing” refers to the old-style method of preemptively pushing resources toward problems and “pulling” refers to the more progressive concept of pulling in resources from a centralized pool as needed — just like Amazon itself does. For organizations that don’t necessarily benefit from managing their own datacenters, he said, this utility model will allow them to get the computing resources they need elsewhere, thus freeing up time and money to spend on innovation.

However, while Vogels’ solution to datacenter woes might sound logical and easy enough to incorporate, I wouldn’t hold my breath waiting for this idea to gain mainstream acceptance, much less adoption. Why, you ask? Because organizations are having a difficult enough time taking advantage of next-generation tools within their own walls — something far less scary than the notion of relying on someone else to make sure their applications get the attention they deserve. This was made abundantly clear during a presentation by OGF President Mark Linesch, who was simply trying to lay out the business case for and current status of grid computing technologies.

During Linesch’s presentation, an audience member (who actually has some firsthand grid experience under his belt) asked how he is supposed to sell the idea of grid or virtualization to his applications developers, some of whom still oppose running their applications on distributed or virtualized platforms. (In fact, this gentleman just denied a request for 60-plus servers to develop and test a new application, instead preferring the work to be done on existing, virtually partitioned machines.) Linesch, backed up by Ravi Subramaniam, who has plenty of insight to offer after years of managing Intel’s in-house grid, gave really the only answer one can give in this situation, regardless of how frustrating it might be to someone desperately seeking a cost-efficient, dynamic IT platform for his/her organization: You have to start slow, showing success in one area at a time — perhaps with just one application — and illustrate how that translates into other areas. Not exactly the best way to show off a technology’s full range of capabilities, but not exactly uncommon, either.

This point was hammered home when I sat down to speak with Jay Fry and Ken Oestreich of Cassatt, vice president of marketing and director of product management and marketing, respectively. With its capabilities in areas like capacity on-demand, application virtualization, service level automation, utility metering, etc., Cassatt’s Collage software certainly falls under the “next-generation” umbrella, but customers aren’t always ready to experience it in full force from the get-go. In fact, customers have been known to ask for a pared down version of Collage, something Cassatt might have to do in order to show them — one step at a time — that its software is for real.

I was happy to hear, though, that Cassatt is making inroads on another front: the battle to ease customers’ minds about the cultural and organization changes that come along with the technical changes of a shared IT platform. Gone are the siloed applications and their siloed personnel. Gone are the days of server hugging. Gone are the days of low utilization and high overhead. While these all sound like great things, that kind of change apparently can be quite foreboding for IT departments, which is why many are hesitant to cross over into the promised land. Well, Cassatt and consultancy partner BearingPoint have been walking customers through this process, which they believe needs to be done in parallel, in their New York-based customer experience center, and the reaction has been very positive thus far. You can read more about the Cassatt/BearingPoint partnership here, and you can expect to see more about Cassatt’s take on utility computing in the weeks to come.

Speaking of application virtualization and its associated functionalities, the topic came up in a panel discussion featuring three distinct virtualization users and, wouldn’t you know, it seems to be a little much for them at this point. When the topic of “virtualization 2.0” was brought up, which was defined as including the grid-like abilities (e.g., high availability, SLA management, scalability, etc.) often associated with application virtualization solutions, the response was not overly positive. While Brian Harris, president and founder of Virtual Ngenuity, stated that he believes these functionalities are currently driving business decisions, two of his fellow panelists showed that this might not be entirely the case. Richard Robinson, chief operations officer for Department of Telecommunications and Information Services, City and County of San Francisco, commented that while his department is working toward these goals, they are not there yet and might not be for quite a while. After all, he noted, the “if it’s not broken, don’t fix it” axiom carries a lot of weight in the local government sector. Sudip Chahal, a senior architect in Intel’s IT Strategy, Architecture and Innovation organization, espoused his belief that this type of architecture isn’t well-suited for traditional business applications — a view not shared by audience member Dave Pearson of Oracle or, I would assume, most of the distributed IT community.

Of course, there was more going on at LinuxWorld/NGDC than just vendors showing off to and discussing with skeptical end-users their cutting-edge technologies. For example, some cool news also came out of the show, such as Appistry adding power-saving functionality to its Enterprise Application Fabric; ServePath announcing high-performance hosting via its virtualized GoGrid service; EnterpriseDB challenging Oracle with GridSQL; and IBM tackling your mountains of widely dispersed data with its grid- and virtualization-powered Information Server Blade, which Big Blue says has demonstrated significant improvements in batch process performance, hardware price performance and budget expenditure. If you’re still hungry for more after reading these announcements, don’t fret, as we’ll have more on all of them — as well as a look at the growing grid hosting business — in the weeks and months to come.

Outside of NGDC news, be sure to check these very noteworthy items: “GigaSpaces Powers Sun’s Market Data Solution”; “NCAR Adds Resources to TeraGrid”; “Imense Using Grid to Become ‘Google of Image Searching’”; “Trigence Intros Optimized App Virtualization Software”; “Sun Releases Fastest Commodity Microprocessor”; and “Layered Tech Announces Super Grid.”

—–

Comments about GRIDtoday are welcomed and encouraged. Write to me, Derrick Harris, at [email protected].

Kathy Yelick on Post-Exascale Challenges

April 18, 2024

With the exascale era underway, the HPC community is already turning its attention to zettascale computing, the next of the 1,000-fold performance leaps that have occurred about once a decade. With this in mind, the ISC Read more…

2024 Winter Classic: Texas Two Step

April 18, 2024

Texas Tech University. Their middle name is ‘tech’, so it’s no surprise that they’ve been fielding not one, but two teams in the last three Winter Classic cluster competitions. Their teams, dubbed Matador and Red Read more…

2024 Winter Classic: The Return of Team Fayetteville

April 18, 2024

Hailing from Fayetteville, NC, Fayetteville State University stayed under the radar in their first Winter Classic competition in 2022. Solid students for sure, but not a lot of HPC experience. All good. They didn’t Read more…

Software Specialist Horizon Quantum to Build First-of-a-Kind Hardware Testbed

April 18, 2024

Horizon Quantum Computing, a Singapore-based quantum software start-up, announced today it would build its own testbed of quantum computers, starting with use of Rigetti’s Novera 9-qubit QPU. The approach by a quantum Read more…

2024 Winter Classic: Meet Team Morehouse

April 17, 2024

Morehouse College? The university is well-known for their long list of illustrious graduates, the rigor of their academics, and the quality of the instruction. They were one of the first schools to sign up for the Winter Read more…

MLCommons Launches New AI Safety Benchmark Initiative

April 16, 2024

MLCommons, organizer of the popular MLPerf benchmarking exercises (training and inference), is starting a new effort to benchmark AI Safety, one of the most pressing needs and hurdles to widespread AI adoption. The sudde Read more…

Kathy Yelick on Post-Exascale Challenges

April 18, 2024

With the exascale era underway, the HPC community is already turning its attention to zettascale computing, the next of the 1,000-fold performance leaps that ha Read more…

Software Specialist Horizon Quantum to Build First-of-a-Kind Hardware Testbed

April 18, 2024

Horizon Quantum Computing, a Singapore-based quantum software start-up, announced today it would build its own testbed of quantum computers, starting with use o Read more…

MLCommons Launches New AI Safety Benchmark Initiative

April 16, 2024

MLCommons, organizer of the popular MLPerf benchmarking exercises (training and inference), is starting a new effort to benchmark AI Safety, one of the most pre Read more…

Exciting Updates From Stanford HAI’s Seventh Annual AI Index Report

April 15, 2024

As the AI revolution marches on, it is vital to continually reassess how this technology is reshaping our world. To that end, researchers at Stanford’s Instit Read more…

Intel’s Vision Advantage: Chips Are Available Off-the-Shelf

April 11, 2024

The chip market is facing a crisis: chip development is now concentrated in the hands of the few. A confluence of events this week reminded us how few chips Read more…

The VC View: Quantonation’s Deep Dive into Funding Quantum Start-ups

April 11, 2024

Yesterday Quantonation — which promotes itself as a one-of-a-kind venture capital (VC) company specializing in quantum science and deep physics — announce Read more…

Nvidia’s GTC Is the New Intel IDF

April 9, 2024

After many years, Nvidia's GPU Technology Conference (GTC) was back in person and has become the conference for those who care about semiconductors and AI. I Read more…

Google Announces Homegrown ARM-based CPUs

April 9, 2024

Google sprang a surprise at the ongoing Google Next Cloud conference by introducing its own ARM-based CPU called Axion, which will be offered to customers in it Read more…

Nvidia H100: Are 550,000 GPUs Enough for This Year?

August 17, 2023

The GPU Squeeze continues to place a premium on Nvidia H100 GPUs. In a recent Financial Times article, Nvidia reports that it expects to ship 550,000 of its lat Read more…

Synopsys Eats Ansys: Does HPC Get Indigestion?

February 8, 2024

Recently, it was announced that Synopsys is buying HPC tool developer Ansys. Started in Pittsburgh, Pa., in 1970 as Swanson Analysis Systems, Inc. (SASI) by John Swanson (and eventually renamed), Ansys serves the CAE (Computer Aided Engineering)/multiphysics engineering simulation market. Read more…

Intel’s Server and PC Chip Development Will Blur After 2025

January 15, 2024

Intel's dealing with much more than chip rivals breathing down its neck; it is simultaneously integrating a bevy of new technologies such as chiplets, artificia Read more…

Choosing the Right GPU for LLM Inference and Training

December 11, 2023

Accelerating the training and inference processes of deep learning models is crucial for unleashing their true potential and NVIDIA GPUs have emerged as a game- Read more…

Baidu Exits Quantum, Closely Following Alibaba’s Earlier Move

January 5, 2024

Reuters reported this week that Baidu, China’s giant e-commerce and services provider, is exiting the quantum computing development arena. Reuters reported � Read more…

Comparing NVIDIA A100 and NVIDIA L40S: Which GPU is Ideal for AI and Graphics-Intensive Workloads?

October 30, 2023

With long lead times for the NVIDIA H100 and A100 GPUs, many organizations are looking at the new NVIDIA L40S GPU, which it’s a new GPU optimized for AI and g Read more…

Google Addresses the Mysteries of Its Hypercomputer

December 28, 2023

When Google launched its Hypercomputer earlier this month (December 2023), the first reaction was, "Say what?" It turns out that the Hypercomputer is Google's t Read more…

How AMD May Get Across the CUDA Moat

October 5, 2023

When discussing GenAI, the term "GPU" almost always enters the conversation and the topic often moves toward performance and access. Interestingly, the word "GPU" is assumed to mean "Nvidia" products. (As an aside, the popular Nvidia hardware used in GenAI are not technically... Read more…

Meta’s Zuckerberg Puts Its AI Future in the Hands of 600,000 GPUs

January 25, 2024

In under two minutes, Meta's CEO, Mark Zuckerberg, laid out the company's AI plans, which included a plan to build an artificial intelligence system with the eq Read more…

China Is All In on a RISC-V Future

January 8, 2024

The state of RISC-V in China was discussed in a recent report released by the Jamestown Foundation, a Washington, D.C.-based think tank. The report, entitled "E Read more…

AMD’s Horsepower-packed MI300X GPU Beats Nvidia’s Upcoming H200

December 7, 2023

AMD and Nvidia are locked in an AI performance battle – much like the gaming GPU performance clash the companies have waged for decades. AMD has claimed it Read more…

DoD Takes a Long View of Quantum Computing

December 19, 2023

Given the large sums tied to expensive weapon systems – think $100-million-plus per F-35 fighter – it’s easy to forget the U.S. Department of Defense is a Read more…

Nvidia’s New Blackwell GPU Can Train AI Models with Trillions of Parameters

March 18, 2024

Nvidia's latest and fastest GPU, codenamed Blackwell, is here and will underpin the company's AI plans this year. The chip offers performance improvements from Read more…

Eyes on the Quantum Prize – D-Wave Says its Time is Now

January 30, 2024

Early quantum computing pioneer D-Wave again asserted – that at least for D-Wave – the commercial quantum era has begun. Speaking at its first in-person Ana Read more…

GenAI Having Major Impact on Data Culture, Survey Says

February 21, 2024

While 2023 was the year of GenAI, the adoption rates for GenAI did not match expectations. Most organizations are continuing to invest in GenAI but are yet to Read more…

The GenAI Datacenter Squeeze Is Here

February 1, 2024

The immediate effect of the GenAI GPU Squeeze was to reduce availability, either direct purchase or cloud access, increase cost, and push demand through the roof. A secondary issue has been developing over the last several years. Even though your organization secured several racks... Read more…

Click Here for More Headlines

HPCwire is a registered trademark of Tabor Communications, Inc. Use of this site is governed by our Terms of Use and Privacy Policy.

Reproduction in whole or in part in any form or medium without express written permission of Tabor Communications, Inc. is prohibited.

Leading Solution Providers

Off The Wire

Industry Headlines

April 18, 2024

April 17, 2024

April 16, 2024

Subscribe to HPCwire's Weekly Update!

Kathy Yelick on Post-Exascale Challenges

2024 Winter Classic: Texas Two Step

2024 Winter Classic: The Return of Team Fayetteville

Software Specialist Horizon Quantum to Build First-of-a-Kind Hardware Testbed

2024 Winter Classic: Meet Team Morehouse

MLCommons Launches New AI Safety Benchmark Initiative

Kathy Yelick on Post-Exascale Challenges

Software Specialist Horizon Quantum to Build First-of-a-Kind Hardware Testbed

MLCommons Launches New AI Safety Benchmark Initiative

Exciting Updates From Stanford HAI’s Seventh Annual AI Index Report

Intel’s Vision Advantage: Chips Are Available Off-the-Shelf

The VC View: Quantonation’s Deep Dive into Funding Quantum Start-ups

Nvidia’s GTC Is the New Intel IDF

Google Announces Homegrown ARM-based CPUs

Nvidia H100: Are 550,000 GPUs Enough for This Year?

Synopsys Eats Ansys: Does HPC Get Indigestion?

Intel’s Server and PC Chip Development Will Blur After 2025

Choosing the Right GPU for LLM Inference and Training

Baidu Exits Quantum, Closely Following Alibaba’s Earlier Move

Comparing NVIDIA A100 and NVIDIA L40S: Which GPU is Ideal for AI and Graphics-Intensive Workloads?

Google Addresses the Mysteries of Its Hypercomputer

How AMD May Get Across the CUDA Moat

Leading Solution Providers

Contributors

Tiffany Trader

Editorial Director

Douglas Eadline

Managing Editor

John Russell

Senior Editor

Kevin Jackson

Contributing Editor

Ali Azhar

Contributing Editor

Alex Woodie

Contributing Editor

Addison Snell

Contributing Editor

Drew Jolly

Assistant Editor

Meta’s Zuckerberg Puts Its AI Future in the Hands of 600,000 GPUs

China Is All In on a RISC-V Future

AMD’s Horsepower-packed MI300X GPU Beats Nvidia’s Upcoming H200

DoD Takes a Long View of Quantum Computing

Nvidia’s New Blackwell GPU Can Train AI Models with Trillions of Parameters

Eyes on the Quantum Prize – D-Wave Says its Time is Now

GenAI Having Major Impact on Data Culture, Survey Says

The GenAI Datacenter Squeeze Is Here

The Information Nexus of Advanced Computing and Data systems for a High Performance World

Share

Copy short link