The Leading Source for Global News and Information Covering the Ecosystem of High Productivity Computing
March 07, 2008
For some time there has been a growing disparity between processor power and memory capacity. As processors and systems have become more computationally dense, memory capacities have been growing more slowly. This has been less of an issue in PCs, where both compute power and memory are relatively abundant, but as high performance computing, virtualization, and large transactional databases take hold in the datacenter, memory has increasingly become the bottleneck to system performance. While lack of memory bandwidth may be the ultimate constraint for maximum CPU performance, for memory-constrained applications, sufficient RAM is required just to get past the formidable disk bottleneck.
Unfortunately scaling memory is not nearly as simple as scaling processing power. An economic discontinuity exists in the DRAM market. While 1 Gbit DRAM chips cost around $3, a 2 Gbit DRAM chips cost $30 to $50 dollars. And since commodity systems with DDR2 memory controllers are designed with only a limited number of DIMMs (which themselves contain a fixed number of DRAM chips), the only way to get more memory is to go with the outrageously expensive 2 Gbit chips. Since the memory controllers are designed to talk with a limited number of DRAMs, this effectively limits affordable capacity.
But MetaRAM, a 2-year-old fabless semiconductor startup, is attempting to change the game. Last week, the company unveiled a new product line that is able to aggregate 1 Gbit DRAMs so as to increase affordable main memory by a factor of two to four. The 34-person company has developed new memory technology -- called MetaSDRAM -- that allows 8GB and 16GB DIMMs to be built from low-cost 1 Gbit memory components. According to the company, a four-socket server can be populated with 256 GB of MetRAM memory for only $50,000. Using conventional DDR2 memory, this same system would cost $500,000 -- $480,000 of which is just DRAM.
The new MetaRAM device contains a special chipset that virtualizes the 1 Gbit components so that a DDR2 memory controller thinks its talking with 2 Gbit or 4 Gbit DRAMs instead of multiple 1 Gbit DRAMs. (Fortunately, the memory controllers aren't smart enough to know that 4 Gbit DRAMs don't even exist yet.) Internally, the 1 Gbit DRAMs think they're talking directly with the memory controller. The chipset in the middle contains the smarts to do the needed DDR2 protocol translations and keep the memory accesses coherent.
"So we look like a DRAM to the memory controller, and a memory controller to the DRAM," says Suresh Rajan, MetaRAM VP Marketing.
Rajan and CEO Fred Weber founded MetaRAM in January 2006 with the idea of leapfrogging the "memory capacity gap." Weber, a former CTO at AMD who helped drive the 64-bit x86 revolution at his former company, saw the new opportunity in the processor-memory disparity and decided that the growth of Web 2.0, datacenter virtualization, and high performance computing were going to create a large demand for x86-based systems with much larger memory footprints. In the HPC realm, memory-hungry applications in aerospace/automotive, financial services, digital content creation and rendering, oil and gas exploration, and semiconductor design and simulation would especially benefit from lots of RAM.
"Where this really gets exciting is that we can build very powerful clusters at very affordable prices to solve problems that are memory bound," says Rajan. "Obviously some applications are compute bound, but others are memory bound, and for the latter, this is a great technology."
The first two MetaRAM products announced are the 8 GB and 16 GB R-DIMMs. The 8 GB chipset is currently in full production and is available for $200 , while the 16 GB is qualified, but not yet in production, and will be priced at $450. Hynix Semiconductor and SMART Modular Technologies have partnered with MetaRAM to produce the memory modules for system vendors. Hynix is a large semiconductor manufacturer with relationships with many of the tier one OEMs; SMART Modular is focused on the tier two players, plus system builders and other channel customers.
Server/cluster vendors that are currently committed to carrying the first 8GB DIMMs include Appro, Rackable, Colfax International and Verari. System availability is expected in Q1 of this year, that is, any day now. Rajan says they're talking with all the usual HPC system vendors, and he says we can anticipate more OEMs to announce MetaRAM support at some point. He notes that they currently have a working eight-socket Opteron Sun machine in-house with 512 GB of MetaRAM memory.
Some OEMs might be resistant to offering high-memory servers at commodity pricing, since this might cut into their high-end server margins. But if more system vendors offer MetaRAM-based systems, it would be difficult for their competitors to resist, especially in the highly price-sensitive x86 server market.
Page: 1 of 2(Digg, Technorati, more)
New Paper: Parallel Computing Without Parallel Programming
Learn how domain experts can run VHLL programs like MATLAB® on a variety of high-performance platforms without low-level reprogramming and how to work with the largest datasets and complex algorithms without sacrificing ease of use or reducing productivity.
Jul 09 | Engineer Live | The demand for computational tools to underpin the 3D seismic interpretation process has never been more apparent. Read more...
Jul 08 | EE Times | Unemployment for U.S. engineers has reached record levels, according to government figures. Read more...
Jul 08 | Network World | Global spending for 2009 projected to drop 6 percent, for a total of $3.2 trillion. Read more...
Jul 08 | Linux Magazine | Portability or efficiency? Neither is guaranteed when writing explicit parallel code. Read more...
Jul 07 | Ars Technica | Japanese company builds custom ASIC to accelerate real-time ray traced rendering for the auto industry. Read more...
Jul 10 | | Engineers, scientists, and other domain experts depend on the productivity enabled by very high-level language (VHLL) tools like MATLAB® and Python. However, as datasets grow larger and programs get more sophisticated, ordinary desktop computers can no longer keep up. The paper explores how to run VHLL programs on high-performance platforms without low-level reprogramming. Work with large datasets and complex algorithms without sacrificing ease of use or reducing productivity.
Apr 14 | | Many HPC IT departments are feeling the rising pressure to deliver more capacity computing and performance while trying to reduce the total cost of ownership. This white paper discusses how an environmentally-friendly and open-standards HPC building block based computing system using flexible interconnect options helps address capacity computing needs.
Source: Addison Snell, GM/VP, Tabor Research; sponsored by Dell
Many organizations that could benefit from the use of HPC clusters find that it is complicated to get the systems up and running because of limited IT resources or the complexities of the clusters themselves. Learn how the Intel Cluster Ready program, for which Dell was an original partner, seeks to address this challenge for entry level and mid-range HPC users.
BlueArc's Titan architecture represents an evolutionary step in file servers by creating a hardware-based file system that can scale bandwidth, IOPS, and overall data capacity well beyond conventional software-based devices. With its ability to virtualize a massive storage pool of up to four usable petabytes of tiered storage, Titan can scale with growing data requirements, offering a competitive advantage for businesses, researchers, or other enterprises seeking to better manage data growth while still ensuring optimal performance.
Sun Studio Compilers and Tools and Sun HPC ClusterTools allow you to create high performance parallel applications for OpenSolaris, Solaris and Linux. Sun Studio Express 11/08 includes MPI performance analysis capabilities and full OpenMP 3.0 compiler support. Learn about all this and the latest in Sun HPC ClusterTools 8.1.