Getting Beyond the Compute Grid: The Challenge of ‘Grid 2.0’

By By William Fellows, Principal Analyst, The 451 Group

September 25, 2006

Early adopters of Grid computing have indicated that many are looking for ways to move beyond the use of grids specifically for computational tasks. They want to eliminate silos and organize IT around shared resources, simplify access to data, and provide a single, consistent view of the business for the entire enterprise. The challenge they face is essentially getting “beyond the compute grid,” a topic we've looked at in a number of reports.

Grid computing supports the key functional drivers required to achieve these goals, including virtualization, service-oriented architecture (SOA) and utility delivery models. To do this, grids need to be absorbed into the enterprise infrastructure itself and not used only as a stand-alone computational engine. In such a strategic role, grids must support automated data, storage and service activities just as capably as handling computational tasks. These challenges are being addressed by what is being called “Grid 2.0.”

If “Grid 1.0” is principally concerned with the virtualization, aggregation and sharing of compute resources, Grid 2.0 is focused on the virtualization, aggregation and sharing of all compute, storage, network and data resources. It is both service-oriented — uses Web services and provides access to IT as a service — and automated.

Grid Adoption

The 451 Group has tracked more than 250 early adopters of Grid computing and related technologies in the 451 Grid Adoption Research Service (GARS) and has developed a model to track deployment progress:

  • Level 5 — Enterprise-wide grids, SOA, shared internal utility, outsourced utility.
  • Level 4 — Linked grids, within or between departments and with multiple applications.
  • Level 3 — Siloed grids: single grid, or grids in multiple departments that aren't linked.
  • Level 2 — Single application running.
  • Level 1 — Proofs of concept, trials.

This is not necessarily a sequential deployment path. Plenty of the organizations that 451 analysts have tracked are at level 2 or 3 and are not expecting to evolve through levels 4 and 5. Once organizations begin to link grids, their use can be thought of as relatively mature. At this stage, grids are well understood and often no longer the responsibility of a core R&D team. Once early adopters reach level 4, there are a range of level 5 strategic IT objectives that can be supported on top or outsourced.

Grid 1.0 usage is mostly at levels 1 through 3, and Grid 2.0 activities characterize levels 4 and 5. Financial services, pharmaceutical, electronic design automation, oil and gas, manufacturing and telecom vertical markets were the earliest adopters of grids. The applications they were and are using are primarily high-performance computing (HPC), embarrassingly parallel and batch-oriented in nature. Deployments are typically limited to one or two applications using traditional Grid middleware/schedulers. Culturally, they are very line-of-business oriented and in reality are mostly at level 2.

Grid vendors are seeking ways to expand beyond initial application beachheads. They are seeking additional early adopters within vertical markets beyond the early adopters, as well as application uses “downstream” of HPC.

In the Grid 2.0 era, use is extending across all verticals including retail, customer record management, insurance, travel, transport and government. Business intelligence, text analysis and search are some of the additional application tasks being supported on grids. Existing applications and application servers are also being deployed onto Grid-enabled containers without requiring prior knowledge of the infrastructure, and legacy applications are being wrapped for grids. Additional vendors are supplying tools to solve problems that are “beyond the compute grid,” while existing Grid vendors are diversifying to other markets and activities. Culturally, top-down, centralized approaches are driven by the CxO office and compliance is a key concern. The reality is that grids are being used to support new models such as shared internal utilities services.

So what are the kinds of activities for which grids are being deployed, and how do users describe their own deployments? When asked, 70 percent of early adopters who responded to a survey said there is a better term than “Grid” to describe their distributed computing architectures: 23 percent said virtualization, 23 percent said HPC, 19 percent said utility computing, 19 percent said clustering, and 15 percent said SOA.

Nothing Virtual about Virtualization

Virtualization enables emulation (makes something look like something else), aggregation (many things look like one thing) and segregation (one thing looks like many).

Is there a catch? Like grids, there is a good deal of confusion about what the term virtualization means, and two uses are emerging — virtualization inside a box and virtualization across multiple systems.

Furthermore, the effect virtualization will have on software licensing (and compounded by multi-core) has yet to be resolved. But it certainly will not be virtual. The major software vendors are terrified users will be able to extract more value from their software without paying any additional dollars. They're hoping that a mix of per-processor and per-user pricing mechanisms can satisfy users.

Who is Using Virtualization and Grids?

A leading European investment bank (that did not want to be named in this research) is using virtualization to create a “virtual resource market” that will allocate resources based on business demand, as part of its IT economy project. Other companies working on shared internal utility models using grids and virtualization include HSBC Bank, JPMorgan Chase, Royal Bank of Scotland, Bank of Montreal and BNP Paribas. Outside of financial services, users include Micro Technology Unlimited, Corus, DaimlerChrysler, GlaxoSmithKline and Arcelor.

Their challenges include the restrictions imposed by proprietary virtualization code, the ability to link grids over LANs and WANs and manage data and storage access, and how to create a virtual resource pool from a diverse range of IT assets and encourage the use of non-dedicated assets. HSBC's approach to the latter problem has been to offer services supported on shared assets at 50 percent of the cost IT will charge to groups requiring dedicated assets. One looming issue is the tax implications of hosting a grid in a different country from the users, which is not good from a users' point of view and indeed has forced HSBC to “park” its global shared grid strategy for the time being.

In the managed services and utility computing space (shared external utilities), the likes of Savvis Communications, Electronic Data Systems, Computer Sciences Corp., Hewlett-Packard, IBM, Oracle, Sun Microsystems, AT&T and BT are or expect to offer access to virtualized, shared resources. 451 analysts believe that there is a lot of marketing around utility computing, but not much of a market yet. In reality, there is little ability to pay only for what you use. Indeed, shared, internal utilities are today's hotspot.

Service-Oriented Architecture

Much like grids, SOA means different things to different people. What we do know is that you cannot buy an SOA. Different “ingredients” must be put together in order to build it. SOA is regarded as a way to implement services that are componentized, reusable and interoperable. The notion is that businesses using an SOA can be more competitive by responding more quickly to the changing market conditions through reuse and commodity services. It also potentially frees up internal IT resources to focus on innovating for the business rather than managing existing resources.

For the early adopters The 451 Group has interviewed, SOA is principally about the implementation of Web services. For them, Web services are afforded fault tolerance and scalability by running on a grid. Together with virtualization, this can deliver an SOA. Grids are a means to this end for many early adopters — especially in financial services. Grids are being regarded as the technology infrastructure and SOA as an application infrastructure.

451 analysts have noticed that SOA is principally concerned with new application development, while traditional HPC or compute grids are concerned mostly with chopping up existing applications. The bottom line is that grids allow users to run Web services better, faster and cheaper because they are cross-platform, can scale and offer better utilization. They also allow users to dodge platform and language wars.

Who is Using SOA and Grids?

At Bank of America, grids are seen as a corporate success story, although the grid itself is not an answer to anything. What is key is its ability to support service-oriented environments/SOA. The issue for Bank of America is that downstream data consumers cannot keep up with the volumes the grid generates. A plethora of support services are also required to keep grids primed, and resource ownership is an impediment to resource sharing.

Royal Bank of Scotland has a company-wide Web services (SOA) strategy, using grids for some part of it. Its big issue is that Web services equals federation which equals unreliability, it says. It is using grids and virtualization as the way around it.

Union Bank of Switzerland hopes to create an SOA that will support multiple teams and departments. It believes grids can support this. Its issue is that the value mostly resides in existing custom applications — and why should this be moved? It says Web services communication mechanisms are immature and that grids are not yet the tip of the spear for SOA.

A European investment bank (that did not want to be named) is creating a “virtual resource market” that will allocate resources based on business demand. It believes using SOA will allow IT to return to innovating for the business. Its issues are that data distribution can quickly become a headache, licensing for grids is a hairball and there will be vendor lock-in until standards shake out.

Other companies using grids to support an SOA strategy include Barclays Capital, Citigroup, Commerzbank, Deutsche Bank, ING Direct and JPMorgan. There are just as many that are specifically not going down the SOA route, including HSBC, TDS, BMO, DE Shaw, Markit Group and Bowne.

Business Drivers

Initial conversations with early adopters revealed the business drivers for adopting grids were mostly concerned with price/performance (better utilization and optimization of resources, maximizing investments), cost sharing (multiple groups share cost of non-dedicated assets), and improved management (multiple systems under single management environment). If early adopters did not specifically mention saving money, they spoke of cost control and emphasized “hard” benefits.

Once grids have landed and expanded beyond beachheads, continued adoption has typically been framed by a very different set of business drivers. Collaboration and the linking of resources, being able to better respond to changing conditions, enabling shorter time to market and doing more new things are most often discussed. It suggests “making money” — and not only “saving money” — is the key here. HSBC reinforces this idea, saying “using grids doesn't save us money, but it will enable us to make more money.”

For more information, visit www.the451group.com/intake/gridtoday-gridworld-sept06.

About William Fellows

William Fellows is a principal analyst at New York-based The 451 Group — an independent technology industry analyst company focused on the business of enterprise IT innovation.

** GRIDwire, GRIDtoday's exclusive coverage from GridWorld 2006, can be seen at www.gridtoday.com/gridworld/06/index.html.

 

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industry updates delivered to you every week!

Empowering High-Performance Computing for Artificial Intelligence

April 19, 2024

Artificial intelligence (AI) presents some of the most challenging demands in information technology, especially concerning computing power and data movement. As a result of these challenges, high-performance computing Read more…

Kathy Yelick on Post-Exascale Challenges

April 18, 2024

With the exascale era underway, the HPC community is already turning its attention to zettascale computing, the next of the 1,000-fold performance leaps that have occurred about once a decade. With this in mind, the ISC Read more…

2024 Winter Classic: Texas Two Step

April 18, 2024

Texas Tech University. Their middle name is ‘tech’, so it’s no surprise that they’ve been fielding not one, but two teams in the last three Winter Classic cluster competitions. Their teams, dubbed Matador and Red Read more…

2024 Winter Classic: The Return of Team Fayetteville

April 18, 2024

Hailing from Fayetteville, NC, Fayetteville State University stayed under the radar in their first Winter Classic competition in 2022. Solid students for sure, but not a lot of HPC experience. All good. They didn’t Read more…

Software Specialist Horizon Quantum to Build First-of-a-Kind Hardware Testbed

April 18, 2024

Horizon Quantum Computing, a Singapore-based quantum software start-up, announced today it would build its own testbed of quantum computers, starting with use of Rigetti’s Novera 9-qubit QPU. The approach by a quantum Read more…

2024 Winter Classic: Meet Team Morehouse

April 17, 2024

Morehouse College? The university is well-known for their long list of illustrious graduates, the rigor of their academics, and the quality of the instruction. They were one of the first schools to sign up for the Winter Read more…

Kathy Yelick on Post-Exascale Challenges

April 18, 2024

With the exascale era underway, the HPC community is already turning its attention to zettascale computing, the next of the 1,000-fold performance leaps that ha Read more…

Software Specialist Horizon Quantum to Build First-of-a-Kind Hardware Testbed

April 18, 2024

Horizon Quantum Computing, a Singapore-based quantum software start-up, announced today it would build its own testbed of quantum computers, starting with use o Read more…

MLCommons Launches New AI Safety Benchmark Initiative

April 16, 2024

MLCommons, organizer of the popular MLPerf benchmarking exercises (training and inference), is starting a new effort to benchmark AI Safety, one of the most pre Read more…

Exciting Updates From Stanford HAI’s Seventh Annual AI Index Report

April 15, 2024

As the AI revolution marches on, it is vital to continually reassess how this technology is reshaping our world. To that end, researchers at Stanford’s Instit Read more…

Intel’s Vision Advantage: Chips Are Available Off-the-Shelf

April 11, 2024

The chip market is facing a crisis: chip development is now concentrated in the hands of the few. A confluence of events this week reminded us how few chips Read more…

The VC View: Quantonation’s Deep Dive into Funding Quantum Start-ups

April 11, 2024

Yesterday Quantonation — which promotes itself as a one-of-a-kind venture capital (VC) company specializing in quantum science and deep physics  — announce Read more…

Nvidia’s GTC Is the New Intel IDF

April 9, 2024

After many years, Nvidia's GPU Technology Conference (GTC) was back in person and has become the conference for those who care about semiconductors and AI. I Read more…

Google Announces Homegrown ARM-based CPUs 

April 9, 2024

Google sprang a surprise at the ongoing Google Next Cloud conference by introducing its own ARM-based CPU called Axion, which will be offered to customers in it Read more…

Nvidia H100: Are 550,000 GPUs Enough for This Year?

August 17, 2023

The GPU Squeeze continues to place a premium on Nvidia H100 GPUs. In a recent Financial Times article, Nvidia reports that it expects to ship 550,000 of its lat Read more…

Synopsys Eats Ansys: Does HPC Get Indigestion?

February 8, 2024

Recently, it was announced that Synopsys is buying HPC tool developer Ansys. Started in Pittsburgh, Pa., in 1970 as Swanson Analysis Systems, Inc. (SASI) by John Swanson (and eventually renamed), Ansys serves the CAE (Computer Aided Engineering)/multiphysics engineering simulation market. Read more…

Intel’s Server and PC Chip Development Will Blur After 2025

January 15, 2024

Intel's dealing with much more than chip rivals breathing down its neck; it is simultaneously integrating a bevy of new technologies such as chiplets, artificia Read more…

Choosing the Right GPU for LLM Inference and Training

December 11, 2023

Accelerating the training and inference processes of deep learning models is crucial for unleashing their true potential and NVIDIA GPUs have emerged as a game- Read more…

Baidu Exits Quantum, Closely Following Alibaba’s Earlier Move

January 5, 2024

Reuters reported this week that Baidu, China’s giant e-commerce and services provider, is exiting the quantum computing development arena. Reuters reported � Read more…

Comparing NVIDIA A100 and NVIDIA L40S: Which GPU is Ideal for AI and Graphics-Intensive Workloads?

October 30, 2023

With long lead times for the NVIDIA H100 and A100 GPUs, many organizations are looking at the new NVIDIA L40S GPU, which it’s a new GPU optimized for AI and g Read more…

Shutterstock 1179408610

Google Addresses the Mysteries of Its Hypercomputer 

December 28, 2023

When Google launched its Hypercomputer earlier this month (December 2023), the first reaction was, "Say what?" It turns out that the Hypercomputer is Google's t Read more…

AMD MI3000A

How AMD May Get Across the CUDA Moat

October 5, 2023

When discussing GenAI, the term "GPU" almost always enters the conversation and the topic often moves toward performance and access. Interestingly, the word "GPU" is assumed to mean "Nvidia" products. (As an aside, the popular Nvidia hardware used in GenAI are not technically... Read more…

Leading Solution Providers

Contributors

Shutterstock 1606064203

Meta’s Zuckerberg Puts Its AI Future in the Hands of 600,000 GPUs

January 25, 2024

In under two minutes, Meta's CEO, Mark Zuckerberg, laid out the company's AI plans, which included a plan to build an artificial intelligence system with the eq Read more…

China Is All In on a RISC-V Future

January 8, 2024

The state of RISC-V in China was discussed in a recent report released by the Jamestown Foundation, a Washington, D.C.-based think tank. The report, entitled "E Read more…

Shutterstock 1285747942

AMD’s Horsepower-packed MI300X GPU Beats Nvidia’s Upcoming H200

December 7, 2023

AMD and Nvidia are locked in an AI performance battle – much like the gaming GPU performance clash the companies have waged for decades. AMD has claimed it Read more…

DoD Takes a Long View of Quantum Computing

December 19, 2023

Given the large sums tied to expensive weapon systems – think $100-million-plus per F-35 fighter – it’s easy to forget the U.S. Department of Defense is a Read more…

Nvidia’s New Blackwell GPU Can Train AI Models with Trillions of Parameters

March 18, 2024

Nvidia's latest and fastest GPU, codenamed Blackwell, is here and will underpin the company's AI plans this year. The chip offers performance improvements from Read more…

Eyes on the Quantum Prize – D-Wave Says its Time is Now

January 30, 2024

Early quantum computing pioneer D-Wave again asserted – that at least for D-Wave – the commercial quantum era has begun. Speaking at its first in-person Ana Read more…

GenAI Having Major Impact on Data Culture, Survey Says

February 21, 2024

While 2023 was the year of GenAI, the adoption rates for GenAI did not match expectations. Most organizations are continuing to invest in GenAI but are yet to Read more…

The GenAI Datacenter Squeeze Is Here

February 1, 2024

The immediate effect of the GenAI GPU Squeeze was to reduce availability, either direct purchase or cloud access, increase cost, and push demand through the roof. A secondary issue has been developing over the last several years. Even though your organization secured several racks... Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire