Visit additional Tabor Communication Publications
March 31, 2011
The HPC conference season started in earnest this Tuesday with the HPCC event, a two-and-a-half day conference that attracts a rather elite lineup of speakers and attendees. HPCC, aka the Newport conference (after its Rhode Island locale), is celebrating its 25th anniversary this year, and as someone pointed out, that makes it a couple of years older than the much better-known (and much larger) SC conference of November.
HPCC is geared for the government supercomputing crowd, but topic coverage is broadly spread across the HPC application spectrum. The first day was no exception; it included everything from climate modeling and computational engineering to exascale computing.
Two of the sessions on Tuesday had to do with the infamous "missing middle" problem in HPC. That termed, coined by the Council on Competitiveness, refers to the group of HPC users between low-end, mostly workstation-bound HPC users, and the kind of high-end HPC typically performed at national labs and some universities. The problem is that transitioning from desktop HPC to server-based HPC is filled with roadblocks, especially for commercial users looking to make the leap to small clusters.
At HPCC, we got two perspectives on the problem -- one from a vendor, the other from a user. The user in this case was Caterpillar, a company that is fairly well along in the adoption of HPC. In that sense, they're no longer missing, but they're definitely in the middle. According to Keven Hoffstetter, Caterpillar's research program manager for their Virtual Product Development group, they use HPC-based virtual tools to drive much of their product development.
Caterpillar is the largest manufacturer of construction and mining equipment, which may make it seem like they're not a "middle" HPC user at all. But keep in mind that the term refers to a capability level, not to company size or R&D budget, which in the case of Caterpillar is significant on both counts (93K employees and $1 billion-plus for R&D). Even though they use a number of virtual tools to model things like engines, cooling systems, and linkage systems, they do so at a relatively low capability level.
For example, Caterpillar does modeling of a differential system using a machine with just 64 cores. That simulation takes 4 weeks. According to Hoffstetter, they would like to increase the performance by at least an order of magnitude so that they can run that model much faster and do hundreds of them at a time. The idea is to go through many iterations as possible in order to optimize the design.
Despite the modest level of use, they've managed to realize significant gains from these tools. They're no longer tied to a 10-year product development cycle, which used to be the accepted timeline at Caterpillar. The virtual modeling speeds up all development phases: concept, design, and testing.
But for Caterpillar, there are still major hurdles to get to greater levels of HPC including getting software to scale to greater level, software licensing costs (which Hoffstetter said are already greater than hardware costs), software usability, multi-physics integration and dealing with increasing hardware complexity (including divergent technologies like CPUs and GPUs).
But Hoffstetter contends the biggest problem there is the cultural change required to move to a complete virtual design paradigm. Basically the fear of doing something differently and the concern that the virtual models do not accurately represent the physical product hinder the acceptance of these tools internally. Besides, the models don't yet simulate everything, so the engineers end up having to build a physical prototype anyway. In such an environment, the adoption of this technology is by degrees, rather than en masse.
From the vendor side, we had Intel's Stephen Wheat talking up the democratization of HPC. Intel, of course, has a big stake in broadening the HPC market as far and as fast as it can go. Wheat said HPC currently represents north of 25 percent of their processor revenue and volume. Obviously, expanding that market means big bucks for the chipmaker.
To Intel, this missing middle appears to be low-hanging fruit. A specific focus lately has been the manufacturing industry, which represents a particularly large group of users who have a quantified need for HPC-level simulation and modeling. Intel estimates of that there are about 280,000 small and mid-sized manufacturers in the US alone, and nearly half would use advanced simulation and modeling. That represents a market nearly as large as the entire global HPC segment today.
To that end, Intel has been a big driver behind the newfound Alliance for High Performance Digital Manufacturing (AHPDM), a group devoted to bringing high performance computing to manufacturers. It is especially targeted to these smaller players without the resources or expertise to swallow HPC whole. The alliance consists of industry players, ISVs, academia, and HPC centers of various stripes. The idea is to leverage the existing HPC expertise and infrastructure in order to connect to dots for the manufacturing companies.
Intel is not the only vendor in this group. Microsoft, HP, Dell, NVIDIA, ANSYS and Cray (to name a few) are also part of this. (Disclaimer: HPCwire's publisher, Tabor Communications, is an Alliance member too.) In any case, I suspect we'll be hearing a lot more about AHPDM in the future as some of the efforts get going.
As Wheat said, the definition of success if fairly simple: "When the middle is no longer missing."
Posted by Michael Feldman - March 31, 2011 @ 9:37 AM, Pacific Daylight Time
Michael Feldman is the editor of HPCwire.
No Recent Blog Comments
In a recent solicitation, the NSF laid out needs for furthering its scientific and engineering infrastructure with new tools to go beyond top performance, Having already delivered systems like Stampede and Blue Waters, they're turning an eye to solving data-intensive challenges. We spoke with the agency's Irene Qualters and Barry Schneider about..
Large-scale, worldwide scientific initiatives rely on some cloud-based system to both coordinate efforts and manage computational efforts at peak times that cannot be contained within the combined in-house HPC resources. Last week at Google I/O, Brookhaven National Lab’s Sergey Panitkin discussed the role of the Google Compute Engine in providing computational support to ATLAS, a detector of high-energy particles at the Large Hadron Collider (LHC).
The Xeon Phi coprocessor might be the new kid on the high performance block, but out of all first-rate kickers of the Intel tires, the Texas Advanced Computing Center (TACC) got the first real jab with its new top ten Stampede system.We talk with the center's Karl Schultz about the challenges of programming for Phi--but more specifically, the optimization...
May 22, 2013 |
At some point in the not-too-distant future, building powerful, miniature computing systems will be considered a hobby for high schoolers, just as robotics or even Lego-building are today. That could be made possible through recent advancements made with the Raspberry Pi computers.
May 16, 2013 |
When it comes to cloud, long distances mean unacceptably high latencies. Researchers from the University of Bonn in Germany examined those latency issues of doing CFD modeling in the cloud by utilizing a common CFD and its utilization in HPC instance types including both CPU and GPU cores of Amazon EC2.
May 15, 2013 |
Supercomputers at the Department of Energy’s National Energy Research Scientific Computing Center (NERSC) have worked on important computational problems such as collapse of the atomic state, the optimization of chemical catalysts, and now modeling popping bubbles.
May 10, 2013 |
Program provides cash awards up to $10,000 for the best open-source end-user applications deployed on 100G network.
05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas | From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.
04/15/2013 | Bull | “50% of HPC users say their largest jobs scale to 120 cores or less.” How about yours? Are your codes ready to take advantage of today’s and tomorrow’s ultra-parallel HPC systems? Download this White Paper by Analysts Intersect360 Research to see what Bull and Intel’s Center for Excellence in Parallel Programming can do for your codes.
In this demonstration of SGI DMF ZeroWatt disk solution, Dr. Eng Lim Goh, SGI CTO, discusses a function of SGI DMF software to reduce costs and power consumption in an exascale (Big Data) storage datacenter.
The Cray CS300-AC cluster supercomputer offers energy efficient, air-cooled design based on modular, industry-standard platforms featuring the latest processor and network technologies and a wide range of datacenter cooling requirements.