The Leading Source for Global News and Information Covering the Ecosystem of High Productivity Computing
HPC Matters is a joint blog consisting of contributors from the Tabor Communications team on their observations and insights into HPC matters.
April 10, 2009
Do you know what the saddest part is of Rackable’s attempt to acquire SGI’s assets out of bankruptcy for a paltry $25 million in cash? There would be very little effect on the HPC market as a result. Thanks to a cascading series of problems -- bad marketing, operational misfires, bad technology bets, and ordinary bad luck -- SGI didn’t have much left to lose.
It’s been a long ride down for SGI, and now that it’s finally over, it comes almost as a relief, like when a beloved relative who has been suffering in pain for years with a terminal disease, a mere shadow of former self, finally passes. It gives us one last chance to light candles, join hands, and remember an icon that once exemplified Silicon Valley greatness.
The industry is full of opinions on what went wrong with SGI, and I’ve got my stories too. Before becoming an HPC industry analyst, I worked at SGI myself for a little over six years in product management and product marketing roles in SGI’s server group.
When I joined Silicon Graphics in August 1997, the company appeared on the cover of BusinessWeek magazine. No, not that cover story. People remember SGI being anointed “the Gee-Whiz Company” by BusinessWeek in 1994, but some forget that the same publication announced the company’s downfall only three years later in “The Sad Saga of Silicon Graphics.” Marketing was to blame, along with reckless spending; that much was common knowledge. On both these points, I’d have to agree.
I didn’t witness the most reckless of spending myself -- Silicon Graphics’ final infamous Winterfest party, with its headliners, revelry, and raucous lip-synch contests, was just before my time -- but I can tell you what marketing was doing about our position in HPC. We were killing it as fast as ever we could.
Silicon Graphics’ technology leadership in workstations and servers was widely acknowledged, but our executive leadership suffered from a collective inferiority complex when compared to Sun, Silicon Graphics’ twin sister. (Both companies were founded in the same year, with neighboring headquarters, and they competed in several markets.) Sun was flying high with enterprise servers based on SPARC and Solaris, and we wanted to catch up. The training I was given that first summer directed me to assume, as an inalienable truth, that the market didn’t care about performance. We had to beat Sun with enterprise features, with an emphasis on RAS (reliability, availability, serviceability). I was the IRIX product manager at the time. I could probably still whiteboard the “Cellular IRIX” presentation for you.
You have to give CEO Ed McCracken credit; he wanted the company to be responsive to customer needs, and he listened to marketing. Marketing just blew it. We told engineering to back off on performance and give us RAS. We made “Sun Sucks” bandit videos and rewrote the marketing literature. Instead of taking on Alpha in HPC, we took on Solaris in enterprise. We were destined to fail.
Interestingly, one of the main platforms we competed against, the Sun Ultra Enterprise 10000, was based on technology we had sold to Sun. I’m in the minority opinion in that I don’t think we should have kept the technology for ourselves. I just think we should have gotten more money for it. Probably a lot more.
I should also mention Cray at this point. We owned them, and I worked in the server group. I have to say, I really didn’t know those guys back then. Again, it’s not the purchase I had a problem with, but rather the implementation. We never integrated those teams. The eventual planned merger of the roadmaps got delayed more than a few times. As of this moment, it was slated to come to market in SGI’s Ultraviolet product, which might yet never see the light of day.
The next chapter is well-documented. McCracken was out, and our new CEO was Rick Belluzzo, who was further convinced that volume was the key to success. He put Silicon Graphics’ resources on souped-up PCs -- I was still working on servers, trying to sell our customers IRIX-based systems as Belluzzo told the world we wouldn’t invest in them -- and he boldly announced the plan to the world. He abruptly quit weeks later. He was not popular around the water cooler after that.
Then came Bob Bishop, and I have to tell you, things started to get better, at least with strategy and marketing. It was 1999 when he took over. Bishop loved the big systems, and he brought us back to an HPC strategy. We built excitement in our roadmap. And we made it to what was supposed to be a critical turning point: the launch of Origin 3000 at the end of June 2000.
Reckless spending? Yeah, there was some. The annual sales conference, where we did the internal launch, featured magicians Penn and Teller and motivational speaker Tony Robbins (not to be confused with namesake Anthony Robbins, who ran federal sales), among others, in a supercool tent event set up adjacent to our supercool new buildings, a campus known today as the GooglePlex. Still, it could have been worse.
Bad marketing? I have to say, the NUMAflex messaging was a little off. Not entirely my fault, but by the time everyone had their say, "NUMAflex" represented a suite of tangentially related benefits, only one of which was performance. Still, it could have been worse.
And the launch? What a success. We had enough sales already booked to make our next two quarters. That's when different culprits got us: operations and plain old bad luck.
We'd been shipping O3Ks for only a few weeks - weren't even at full speed yet - when we hit a part outage. It was the ceramic packaging for the chips. We had a single-source supplier, IBM, and there was no ceramic packaging to be had. We went on stop-ship, with the whole backlog of new orders waiting to be fulfilled. It took months. By the time we started the line again, a lot of those sales were gone.
No matter, we had the second punch coming, our first systems based on Itanium chips. First-generation Itanium chips. Anyone here remember Merced? Intel delayed it, then delayed it again, then pulled the plug on it. There went another batch of orders. We retooled for McKinley, also delayed.
We finally got our first Itanium and Linux-based Altix 3000 systems out at LinuxWorld in January 2002. It was a great product, and if I do say so myself, marketing was really ticking. The press was good, we won awards, and everyone knew what we were doing and what the value proposition was. It was just a tough sale.
By 2002, x86 clusters were the norm. Our standard pitches that explained why you shouldn't convert your code to MPI were moot. The codes were MPI, and they weren't coming back. Furthermore, Itanium had marginalized in the market - SGI picked the wrong chip. Back to the drawing board.
The rest is not as interesting, and it's more recent, so you remember it. SGI started working on clusters. Bishop was out, replaced by Dennis McKenna, who guided the company through its first bankruptcy, then Bo Ewald. Under Ewald, SGI purchased the remains of Linux Networx and integrated the software into its own cluster stack, but it was too little too late. Customers had been hurt too many times, and they weren't coming back.
The last five years have been a grind for SGI, and many of us - especially us former employees with purple blood - silently rooted for the company to succeed. "Not dead yet" was the constant diagnosis. Until this month. Now SGI is gone.
What happened? As with any prolonged misery, a lot of things went wrong. If you want my opinion, look at the things we did wrong, and the few we did right, in the six-year span from 1997 to 2003.
But forget about that now. Remember the good times. Remember what you loved about the Gee-Whiz Company. Tell a friend your favorite SGI story, and then listen to someone else's. Have an SGI-style TGIF beer bash and raise a glass to a Silicon Valley icon.
And then bury it, 'cause it's gone.
Posted by Addison Snell - April 10 @ 9:54AM
(Digg, Technorati, more)
There are 6 discussion items posted.
Torn between IRIX and Linux
Submitted by Merblich on 04/09/2009 - 9:57PM
I don't have the Crystal Ball, and I don't know how many computer companies which didn't start with Intel based CPUs will be around in 5 / 10 years, but,, SGI had a alot of fights: IRIX vs Linux, MIPS vs Intel, IRIX which wasn't a SVR4.x release, NVidia vs SGI, NUMA vs shared memory, ...
For a single company, it maybe was just trying to move into all possible directions at the same time and resulting in tearing itself apart?
Post #1
How it all went wrong for SGI
Submitted by jmh900 on 04/10/2009 - 7:10PM
In 1996 SGI bought Cray and, collectively, had access to an astonishingly broad and competent set of technologies. I watched from afar (as a Crayon) as the honeymoon faded into something that was not wedded bliss and the arrogance and hubris of Silicon Valley rode roughshod over the conservative and deeply experienced Silicon Prairie.
Leaving aside canceling Cray's vector architecture product and the Alpha-based T3E (Cray's most successful product ever), I think the example that most typifies the ineptitude of SGI's management was selling Cray Business Systems to SUN. Whatever SGI's motives, this was an "audacious" initiative by the SGI executive team. SGI's actions were like placing the business end of a loaded gun in your mouth and inviting your fiercest enemy to pull the trigger - SUN obliged. It is said that when asked for his opinion on the price that SGI might get for the CS6400 product and development team etc, a very senior SGI scientist replied that "it didn't matter what price they got as it wasn't worth a s**t anyway".
Extraordinary days, the most amazing this is that SGI lasted as long as it did.
Post #2
How it went wrong for SGi
Submitted by shawnu on 04/10/2009 - 8:01PM
Nice post, Addison. My career there was a little bit of a mirror of yours, but on graphics. Funny how some of the perspective differs, but how it all converged in the end.
Post #3
How it went wrong for SGi
Submitted by atzanov on 04/29/2009 - 10:37AM
There is only one reason for SGI's fail - bad management. But hey, these executive MBAs were the same "entiteled to be executive" but inept people who destroy GM, Crysler etc.
Post #4
The battleground...
Submitted by rgreen1 on 04/30/2009 - 9:30AM
Good post. I remember joining before the 3000 launch and thinking we were destined to succeed. With about a billion in the bank, the most amazing products, and the phenomenal campus, how could we not succeed?
The graphics story at SGI was quite similar to the HPC story.
You can't forget about the name change from Silicon Graphics to SGI, or the highly unsuccessful acquisition of Alias Wavefront either.
Another battleground was keeping developers and ISVs working on IRIX. With lower volumes, mixed messages, and fewer customers, it was hard to keep applications. Another battle lost.
Maybe Rackable will change their name to SGI so we can live our memories vicariously through another company?
Post #5
SGI, Not Alone
Submitted by EricS on 05/02/2009 - 1:05PM
Addison, SGI made many bone head moves, significantly more then you mentioned. I believe the biggest (and, SGI is not alone in this) is forgetting to look at the history of high tech companies while planning forward looking campaigns.
I went to work at SGI when the late, great Digital Equip. Corp. was acquired by Compaq. Watched the demise of their rival Data General as it was acquired by EMC, on and on.
Great products or technology no longer define business success. Today it's more about picking markets with a sustainable business plan. SGI was strong in graphics and HPC. Missteps and market changes in the graphics segment left only HPC. From a marketing pyramid standpoint, HPC is at the pointy little top, not a big market segment, lots of competitors. And, most of the those competitors have significantly larger commercial markets to shore up their HPC business.
Without a commercial market segment, SGI needed a right-sized organization to stay profitable in HPC. Not an easy thing to do with the commoditization and proliferation of beowulf clusters.
Sun has a different set of issues. Looks like both companies are tracking - from inception to demise.
Post #6
PGI Accelerator™ Fortran 95/03 and C99 compilers for x64+NVIDIA
Accelerate applications on x64+GPU platforms by adding OpenMP-like compiler directives to existing Fortran and C programs. Available now for Linux, MacOS and Windows. Download a free 15 day trial.
Platform HPC Workgroup Manager
Platform HPC Workgroup Manager integrates all the cluster productivity tools you need to deploy, run and manage your HPC environment.
Addison Snell is the CEO of InterSect 360 Research
More Addison Snell
Re: Multicore Watershed by Nastyanna
HPC? not so much by ewahl
Re: Podcast: A Trio of HPC Apps by sibat0705
Re: Podcast: A Trio of HPC Apps by sibat0705
Re: Cray Corrals Big Defense Deal by watchesuk
We think by watchesuk
Re: IBM and HPC by truly64
HPC = servers but a lot more by lawries
Lena by Nastyanna
Lena by Nastyanna
Multi core deployment becomes a memory game by truly64
Re: Venture Capital Drought? Not So Much. by Ron Van Holst
Re: AMD Confirms 12-Core Opteron Production by Nastyanna
Re: Cray Corrals Big Defense Deal by Nastyanna
Re: Podcast: Cray Awarded Defense Deal; SGI Makes Storage Buy; IBM Invents New Algorithm by Nastyanna
Painful Truth by jeffrey.mcallister
SGI = graphics + HPC by johnbarr
HPC = servers but a lot more by truly64
Oracle SPARC != Fujitsu SPARC by Alan M. Feldstein
Sun & HPC != Oracle & HPC by Merblich
a third vendor for lossless low latency 10GbE fabric by lee.fisher@hp.com
Response to GAH by KevinButerbaugh
Response to KevinButerbaugh by GAH
Response to KevinButerbaugh by GAH
Response to GAH by KevinButerbaugh
Response to bdrupp by KevinButerbaugh
Climate Crisis and Exaflops by bdrupp
Climate Crisis and Exaflops by John Hules
Climate Crisis and Exaflops by GAH
Climate Crisis by KevinButerbaugh
IBM "Brain Simulation" article is not properly presented. by Merritt
563 out of 1206 by vvolkov
Little Iron by gadunk
At least it's not "cloud" by KevinButerbaugh
Native QPI Interface? by commike
Mmmmmm by hellcats
New transistorized IC chip scales. by symmecon
Itanium at IDF by Alan M. Feldstein
Communication time by jnapper
"The financial meltdown and computing" by donpellegrino
Human Models by mdgabriel
High-End SPARC Chip for Scientific Applications by Alan M. Feldstein
RapidMind by Mr LolO
Rapidmind by dminor
Longer run times by JohnWest
re: Algo trading Angst by jshore
Results of Testing by in_the_crease
The Moscow State University supercomputer, Lomonosov, has been selected for a high-performance makeover, with the goal of tripling its processing power to achieve petaflop-level performance in 2010. T-Platforms, who developed and manufactured the supercomputer, is the odds-on favorite to lead the project.
Read More...
Right on schedule, Intel has launched its Xeon 5600 processors, codenamed "Westmere EP." The 5600 represents the 32nm sequel to the Xeon 5500 (Nehalem EP) for dual-socket servers. Intel is touting better performance and energy efficiency, along with new security features, as the big selling points of the new Xeons.
Read More...
The ACM Turing Award goes to the creator of the modern personal computer; and Voltaire announces a mid-range InfiniBand switch and new technology that accelerates distributed applications. We recap those stories and more in our weekly wrapup.
Read More...
Mar 18 | ChannelWeb | Westmere parts already showing up in HPC machines. Read more...
Mar 17 | The Register | But what about the tier ones? Read more...
Mar 17 | Cadalyst Magazine | A new generation of workstations is changing the nature of technical computing. Read more...
Mar 17 | Linux Magazine | Latest iteration of Sun Grid Engine able to tap into Cloud. Read more...
Mar 16 | Bio-IT World | Biotech firm builds genetic models from patient data. Read more...
Jan 12 | | In-depth look at vSMP Foundation server virtualization technology, technical implementation, use cases and capabilities. The technical whitepaper provides an architectural overview and details on the three vSMP Foundation products: vSMP Foundation for SMP, vSMP Foundation for Cluster and vSMP Foundation for Cloud.
Jan 18 | | This white paper discusses Gore’s copper cable assemblies, and how they continue to exceed the standards for providing reliable, cost-effective solutions for high-performance computer applications.
Join this online panel discussion for live Q&A with leading industry experts, analysts, and end-users to discuss the latest innovations, best practices, barriers to implementation, and measurable benefits of server virtualization with a particular focus on today's real world solutions.
Learn about scalable fault-tolerant architectures and examples of energy efficient and scalable supercomputing clusters using dual QDR InfiniBand to combine capacity computing with network failover capabilities with the help of programming languages such as MPI and a robust Linux cluster management package.
LIVE@SCO9: The IBM team discusses new innovations in hardware, software and services that help clients better understand their workloads and get insight from their R&D efforts. Technology demonstrations include the soon-to-be-released Power7 HPC processor, the DCS990 system with 2.4 petabytes of storage, the xCAT management tool, secure HPC cloud computing and more. Winners of two HPCwire Readers' and Editors’ Choice Awards! Take the IBM virtual tour at SC09 or more information go online to: http://www-03.ibm.com/systems/deepcomputing/sc09.html