Why is Project MegaGrid so MegaQuiet?

By By Peter Meade**

March 14, 2005

When Project MegaGrid was announced at OpenWorld late last year, it was executed with all the ingredients of an industry-shaking and -shaping blockbuster. Imagine Dell Inc., EMC Corp., Intel Corp. and Oracle Corp. entwining their individually monumental powers and products to, as was articulated at the time, “develop a standard approach to building and deploying an enterprise Grid computing infrastructure.”

Now, fast-forward about three months to today. Why hasn't there been a steady stream of progress reports made on MegaGrid detailing all that has been accomplished since Dell founder and chairman Michael Dell made his introductory speech that day in San Francisco? Perusing the Web sites of the four partners reveals an equally conspicuous absence of updated information on MegaGrid. So perhaps a reminder of the original intention is necessary.

Initially, MegaGrid was to be made up of Intel-powered dual Xeon and four-way Itanium processor-based Dell PowerEdge servers with network-attached storage and management software from EMC as well as Oracle's 10g technology. The goal: enlighten enterprises on how to take advantage of all the superlatives of next generation's distributed computing technology.

Most of EMC's product line was to be involved, most notably Clariion CX and Symmetrix DMX networked storage systems, Celerra NS Series/Gateway network attached storage systems as well as ControlCenter and Navisphere information management software. Aside from the servers, Dell was tossing in related I/O technology. In addition to hosting the Grid, Oracle supplied its 10g suite of software, application server, database, Real Application Clusters and Enterprise Manager. Additional technology was provided by Cramer, in the form of its application software, and F5 Networks Inc. with its Big-IP switches.

Where is this much-hailed project today? Has it accomplished its goal of showcasing how a Grid infrastructure can stand up to performance and price of traditional data centers? If so, it's been done mega-quietly. Phase one has been completed, according to Randy Hietter, director of product management for Oracle, and it seems with no fanfare.

“Oracle's contribution was [to provide] primary engineering support to plan, run and document the tests,” he explained. Aside from serving as provider of the physical hosting facility — the company's Global IT data center in Austin, Texas — Oracle also provided the database, infrastructure and management software. According to Hietter, the company supplied Oracle Database 10g, Oracle Real Application Clusters, Oracle Enterprise Manager 10g with Grid Control and Oracle Application Server 10g along with project management.

The first part of the project featured Dell and the others building a system that clustered up to 32 of the company's PowerEdge 1750 servers. At the same time, best practices guides were constructed detailing the design, deployment, testing and management of mega-clusters featuring technology from EMC, Intel and Oracle. While some of the specific details remain sketchy, one thing is sure: the companies did come through on their promise to offer white papers, cross-referenced Web site copy, how-tos and a demonstration or two. Unfortunately, real-world Grids require so much more.

According to Shawn Douglass, director of partner engineering for EMC, the events of phase one are documented in a series of three white papers. The first, on design considerations, maps out the standard operating system configuration and methodology. The second deals with capacity planning, including directions for scaling the configuration. The third lays out performance management, specifically detailing the myriad workload capabilities.

“What we have today is an operational methodology,” he explained. “Today you can't buy a Grid, you must build one. We have put the pieces in place, tested them and documented the results.” Yet despite this monumental task and the massive marketing muscle of the four partners, in uncharacteristically quiet fashion, the big four summarily moved onto phase two.

“Phase two entails further building out the Grid further and expanding the workload,” said Oracle's Hietter. “For now, the team is heads-down in doing work.” More details about what has and will transpire are on tap for early fall, he added.

This phase reportedly focuses on scaling the enterprise Grid to an excess of 100 servers running a mixed workload. “Grids don't run just one thing at a time,” said EMC's Douglass. “With this mixed workload we show how Grid lets enterprises allocate and reclaim resources as needed in a fully automated fashion.” The four partners expect to leverage a lot of what has taken place in Grid development in the academic world and translate it to a business situation, he added.

“We need to meet the changing needs of business while maintaining the expected quality of service levels,” said Douglass. From this exercise will come a document explaining resource provisioning and management, with emphasis on the operational procedures.

The second phase involves clusters of PowerEdge 1850 servers. While Oracle's Hietter categorized the progress as “on track,” he added “it was never sized at 128 servers,” as previously reported. Instead, he explained, “the Grid will grow in 2005 to at least 128 servers in phases subsequent to phase two.” Guidelines are expected to follow detailing best practices procedures for a variety of enterprise applications. The goal: deliver ways that enterprises can consolidate databases, applications, servers and storage onto a common Grid platform.

Originally hailed as “an open-ended venture,” MegaGrid was expected to embrace other players as members. Yet the reality is there has been a curious lack of announcements, which cannot bode well for wider-spread acceptance of the project. Perhaps the project needs more than the two existing levels of participation — full partners such as Oracle, Dell, EMC and Intel, the founding four — and technology contributors, such as F5 Networks Inc. and Cramer Systems.

While not utilized much in the first phase, F5 expects to make a more pivotal contribution in phase two. According to Bill Evidon, senior business development manager for F5, the company's switch will become “the broker for a lot of resources,” helping the MegaGrid handle huge traffic loads. “F5 brings hardened, high-performance network development to MegaGrid,” he explained. “Big-IP's intelligent traffic handling ability and Web services API let the other vendors make pro-active network adjustments.”

According to Jeff Browning, F5 product manager, a crucial part of phase two is expanding network nodes with more workloads and applications. F5 will pay particular focus to the management of service level agreements and automation. The Big-IP switch really shines in these aspects, he added, with its dynamic workload balancing, which keys on the applications or services being loaded while monitoring areas of excess capacity. Perhaps focusing on the lesser-name players, such as F5 and Cramer, will provide MegaGrid with some diversity until the big four can cajole some new comrades.

“We are always evaluating possible new partners,” said Hietter. “Right now we are in the midst of speaking with some other potential members. We aren't in a position to disclose any further details on those discussions at this time.” While phase two originally was ticketed for the inclusion of operating systems from Microsoft Corp. and Novell Inc., no word has emerged from either camp about becoming any level of MegaGrid member.

However, there remains hope of turning the MegaGrid quartet into a quintet or more. Don't be surprised if “a strong network player” is added in time to be announced at this year's Oracle OpenWorld, which is scheduled for September, said EMC's Douglass.

“There has been a groundswell of interest,” added Douglass, who just returned from Asia, where she said he met with several interested parties.  “We need to figure out how to best plug them in.”

Granted, it's no simple task to combine the complex core technologies and massive technical resources of this foursome. But the incentive is huge. Let's face it, the quartet of some of the computing industry's biggest names are not banding together just for the goodness of providing cost-effective Grid computing solutions. More succinctly, it's because they are feeling the collective pain from selling vs. the pricey, hulking Unix boxes hawked by Hewlett-Packard Co., IBM and Sun Microsystems Inc., as well as the foes' complete Grid solutions.

The aforementioned trio individually already has customers with large Grid installations, so the MegaGrid foursome must do a better job of tooting their horn on the progress being made. So far, the only MegaGrid solutions that have been trumpeted are online store Overstock.com and BT, where an online transaction processing application comprised of 10 Dell servers (cost: $69,000) replaced a $2.9 million Sun Solaris setup. This is an impressive cost saving, so rear back and blow.

According to F5's Browning, his company already has a great comfortable level with MegaGrid partner Oracle. The Big-IP switch has been used internally at the software giant for many mission-critical applications. Even so, he added F5 welcomes the challenges ahead in phase two. “We have done a lot of horizontal scaling of databases,” he explained. “But traditionally, it's been with big, proprietary Unix boxes.” F5 is looking forward to showing its process in this high-profile Grid configuration.

While no new product introductions are expected from Project MegaGrid, the process of working as a team should give the partners excellent usage data for designing and configuring various combinations of Grid hardware and software. As MegaGrid further addresses specific business needs, there may emerge bundled product packages that are “MegaGrid tested” for use outside the project.

It's clear that Dell, EMC and Oracle all can achieve raised status in the Grid market, if they want it badly enough. The picture remains unclear, however, whether this high-profile project will succeed to where businesses will consider the MegaGrid partners to deliver the framework for new Grid implementations.  Regarding Intel, as in almost every market it has entered, the “inside” giant is entrenched in a “no lose” situation. Do any of the other MegaGrid members see that Intel wins in Grid as long as businesses select anyone other than Sun? That said, it's more constructive to make sure Intel remains true to its MegaGrid commitments. Then, if the project has something great to announce, well, go on and say it.

“Four big players, all together, have agreed, build, documented and tested,” said EMC's Douglass of MegaGrid's achievements so far. “Most such projects of this magnitude don't ever get out of the gate.”

** Peter Meade is editor of DSstar, a sister publication of GRIDtoday that covers the enterprise storage market. He is a frequent contributor to this publication.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

Supercomputer Research Reveals Star Cluster Born Outside Our Galaxy

July 11, 2020

The Milky Way is our galactic home, containing our solar system and continuing into a giant band of densely packed stars that stretches across clear night skies around the world – but, it turns out, not all of those st Read more…

By Oliver Peckham

Max Planck Society Begins Installation of Liquid-Cooled Supercomputer from Lenovo

July 9, 2020

Lenovo announced today that it is supplying a new high performance computer to the Max Planck Society, one of Germany's premier research organizations. Comprised of Intel Xeon processors and Nvidia A100 GPUs, and featuri Read more…

By Tiffany Trader

Xilinx Announces First Adaptive Computing Challenge

July 9, 2020

A new contest is challenging the computing world. Xilinx has announced the first Xilinx Adaptive Computing Challenge, a competition that will task developers and startups with finding creative workload acceleration solutions. Xilinx is running the Adaptive Computing Challenge in partnership with Hackster.io, a developing community... Read more…

By Staff report

Reviving Moore’s Law? LBNL Researchers See Promise in Heterostructure Oxides

July 9, 2020

The reality of Moore’s law’s decline is no longer doubted for good empirical reasons. That said, never say never. Recent work by Lawrence Berkeley National Laboratory researchers suggests heterostructure oxides may b Read more…

By John Russell

President’s Council Targets AI, Quantum, STEM; Recommends Spending Growth

July 9, 2020

Last week the President Council of Advisors on Science and Technology (PCAST) met (webinar) to review policy recommendations around three sub-committee reports: 1) Industries of the Future (IotF), chaired be Dario Gil (d Read more…

By John Russell

AWS Solution Channel

Best Practices for Running Computational Fluid Dynamics (CFD) Workloads on AWS

The scalable nature and variable demand of CFD workloads makes them well-suited for a cloud computing environment. Many of the AWS instance types, such as the compute family instance types, are designed to include support for this type of workload.  Read more…

Intel® HPC + AI Pavilion

Supercomputing the Pandemic: Scientific Community Tackles COVID-19 from Multiple Perspectives

Since their inception, supercomputers have taken on the biggest, most complex, and most data-intensive computing challenges—from confirming Einstein’s theories about gravitational waves to predicting the impacts of climate change. Read more…

Penguin Computing Brings Cascade Lake-AP to OCP Form Factor

July 7, 2020

Penguin Computing, a subsidiary of SMART Global Holdings, Inc., announced yesterday (July 6) a new Tundra server, Tundra AP, that is the first to implement the Intel Xeon Scalable 9200 series processors (codenamed Cascad Read more…

By Tiffany Trader

Max Planck Society Begins Installation of Liquid-Cooled Supercomputer from Lenovo

July 9, 2020

Lenovo announced today that it is supplying a new high performance computer to the Max Planck Society, one of Germany's premier research organizations. Comprise Read more…

By Tiffany Trader

President’s Council Targets AI, Quantum, STEM; Recommends Spending Growth

July 9, 2020

Last week the President Council of Advisors on Science and Technology (PCAST) met (webinar) to review policy recommendations around three sub-committee reports: Read more…

By John Russell

Google Cloud Debuts 16-GPU Ampere A100 Instances

July 7, 2020

On the heels of the Nvidia’s Ampere A100 GPU launch in May, Google Cloud is announcing alpha availability of the A100 “Accelerator Optimized” VM A2 instance family on Google Compute Engine. The instances are powered by the HGX A100 16-GPU platform, which combines two HGX A100 8-GPU baseboards using... Read more…

By Tiffany Trader

Q&A: HLRS’s Bastian Koller Tackles HPC and Industry in Germany and Europe

July 6, 2020

In this exclusive interview for HPCwire – sadly not face to face – Steve Conway, senior advisor for Hyperion Research, talks with Dr.-Ing Bastian Koller about the state of HPC and its collaboration with Industry in Europe. Koller is a familiar figure in HPC. He is the managing director at High Performance Computing Center Stuttgart (HLRS) and also serves... Read more…

By Steve Conway, Hyperion

OpenPOWER Reboot – New Director, New Silicon Partners, Leveraging Linux Foundation Connections

July 2, 2020

Earlier this week the OpenPOWER Foundation announced the contribution of IBM’s A21 Power processor core design to the open source community. Roughly this time Read more…

By John Russell

Hyperion Forecast – Headwinds in 2020 Won’t Stifle Cloud HPC Adoption or Arm’s Rise

June 30, 2020

The semiannual taking of HPC’s pulse by Hyperion Research – late fall at SC and early summer at ISC – is a much-watched indicator of things come. This yea Read more…

By John Russell

Racism and HPC: a Special Podcast

June 29, 2020

Promoting greater diversity in HPC is a much-discussed goal and ostensibly a long-sought goal in HPC. Yet it seems clear HPC is far from achieving this goal. Re Read more…

Top500 Trends: Movement on Top, but Record Low Turnover

June 25, 2020

The 55th installment of the Top500 list saw strong activity in the leadership segment with four new systems in the top ten and a crowning achievement from the f Read more…

By Tiffany Trader

Supercomputer Modeling Tests How COVID-19 Spreads in Grocery Stores

April 8, 2020

In the COVID-19 era, many people are treating simple activities like getting gas or groceries with caution as they try to heed social distancing mandates and protect their own health. Still, significant uncertainty surrounds the relative risk of different activities, and conflicting information is prevalent. A team of Finnish researchers set out to address some of these uncertainties by... Read more…

By Oliver Peckham

[email protected] Turns Its Massive Crowdsourced Computer Network Against COVID-19

March 16, 2020

For gamers, fighting against a global crisis is usually pure fantasy – but now, it’s looking more like a reality. As supercomputers around the world spin up Read more…

By Oliver Peckham

[email protected] Rallies a Legion of Computers Against the Coronavirus

March 24, 2020

Last week, we highlighted [email protected], a massive, crowdsourced computer network that has turned its resources against the coronavirus pandemic sweeping the globe – but [email protected] isn’t the only game in town. The internet is buzzing with crowdsourced computing... Read more…

By Oliver Peckham

Supercomputer Simulations Reveal the Fate of the Neanderthals

May 25, 2020

For hundreds of thousands of years, neanderthals roamed the planet, eventually (almost 50,000 years ago) giving way to homo sapiens, which quickly became the do Read more…

By Oliver Peckham

DoE Expands on Role of COVID-19 Supercomputing Consortium

March 25, 2020

After announcing the launch of the COVID-19 High Performance Computing Consortium on Sunday, the Department of Energy yesterday provided more details on its sco Read more…

By John Russell

Honeywell’s Big Bet on Trapped Ion Quantum Computing

April 7, 2020

Honeywell doesn’t spring to mind when thinking of quantum computing pioneers, but a decade ago the high-tech conglomerate better known for its control systems waded deliberately into the then calmer quantum computing (QC) waters. Fast forward to March when Honeywell announced plans to introduce an ion trap-based quantum computer whose ‘performance’ would... Read more…

By John Russell

Neocortex Will Be First-of-Its-Kind 800,000-Core AI Supercomputer

June 9, 2020

Pittsburgh Supercomputing Center (PSC - a joint research organization of Carnegie Mellon University and the University of Pittsburgh) has won a $5 million award Read more…

By Tiffany Trader

Global Supercomputing Is Mobilizing Against COVID-19

March 12, 2020

Tech has been taking some heavy losses from the coronavirus pandemic. Global supply chains have been disrupted, virtually every major tech conference taking place over the next few months has been canceled... Read more…

By Oliver Peckham

Leading Solution Providers

Contributors

10nm, 7nm, 5nm…. Should the Chip Nanometer Metric Be Replaced?

June 1, 2020

The biggest cool factor in server chips is the nanometer. AMD beating Intel to a CPU built on a 7nm process node* – with 5nm and 3nm on the way – has been i Read more…

By Doug Black

Nvidia’s Ampere A100 GPU: Up to 2.5X the HPC, 20X the AI

May 14, 2020

Nvidia's first Ampere-based graphics card, the A100 GPU, packs a whopping 54 billion transistors on 826mm2 of silicon, making it the world's largest seven-nanom Read more…

By Tiffany Trader

‘Billion Molecules Against COVID-19’ Challenge to Launch with Massive Supercomputing Support

April 22, 2020

Around the world, supercomputing centers have spun up and opened their doors for COVID-19 research in what may be the most unified supercomputing effort in hist Read more…

By Oliver Peckham

Australian Researchers Break All-Time Internet Speed Record

May 26, 2020

If you’ve been stuck at home for the last few months, you’ve probably become more attuned to the quality (or lack thereof) of your internet connection. Even Read more…

By Oliver Peckham

15 Slides on Programming Aurora and Exascale Systems

May 7, 2020

Sometime in 2021, Aurora, the first planned U.S. exascale system, is scheduled to be fired up at Argonne National Laboratory. Cray (now HPE) and Intel are the k Read more…

By John Russell

Summit Supercomputer is Already Making its Mark on Science

September 20, 2018

Summit, now the fastest supercomputer in the world, is quickly making its mark in science – five of the six finalists just announced for the prestigious 2018 Read more…

By John Russell

TACC Supercomputers Run Simulations Illuminating COVID-19, DNA Replication

March 19, 2020

As supercomputers around the world spin up to combat the coronavirus, the Texas Advanced Computing Center (TACC) is announcing results that may help to illumina Read more…

By Staff report

$100B Plan Submitted for Massive Remake and Expansion of NSF

May 27, 2020

Legislation to reshape, expand - and rename - the National Science Foundation has been submitted in both the U.S. House and Senate. The proposal, which seems to Read more…

By John Russell

  • arrow
  • Click Here for More Headlines
  • arrow
Do NOT follow this link or you will be banned from the site!
Share This