Aspen
Texas Advanced Computing Center
HPCwire

Since 1986 - Covering the Fastest Computers
in the World and the People Who Run Them

Language Flags

Visit additional Tabor Communication Publications

Datanami
Digital Manufacturing Report
HPC in the Cloud
Green Computing Report

Tabor Communications
Corporate Video

ClusterVision Leads HPC Cluster Partnership at U Nottingham


NOTTINGHAM, United Kingdom, Jan. 24 – ClusterVision, Europe’s dedicated specialist in high performance computing solutions, has announced the successful completion of Minerva, the latest generation HPC cluster at the University of Nottingham.

Minerva represents a significant progression of the existing capability from the previous “Jupiter” systems and continues the trend for pioneering HPC development at the University. Inspiration for the name of the latest system originates from Minerva, the daughter of Jupiter and the patron goddess of wisdom and craft.

The new cluster was officially opened by Professor Tom Rodden of the Interactive Systems Mixed Reality Laboratory (MRL), and Professor Saul Tendler, Pro-Vice-Chancellor for Research, as the highlight of the University’s annual HPC Conference. In addition to presentations from ClusterVision and the other technology partners, the 2-day Conference showcased a wide variety of the University’s current research applications, including presentations on quantum and astrophysics simulation, genome modelling, advanced mathematical solutions, and molecular chemistry.

As the prime contractor for the design, build and management of the Minerva system, ClusterVision managed a complex collaboration of 17 hardware and software partners. Key contributors to the Minerva project included Dell, Intel, Qlogic, NVIDIA, Panasas, Bright Computing, Altair Engineering and Allinea.

The realisation of the Minerva project benefitted from a long-standing relationship between ClusterVision, the University of Nottingham and many of the other collaboration partners. Following the successful trial of Bright Cluster Manager (from Bright Computing) and PBS Professional (from Altair Engineering) on a Dell server, ClusterVision were invited to continue to work with the University of Nottingham HPC team in order to help develop the functional requirements for a next generation cluster system. Although originally issued in October 2011 the public tender was subsequently delayed to consider the potential benefit from newly emerging technologies, one of the key objectives of the process, most notably the Sandybridge Xeon E5 processor series from Intel.

In a detailed tender response ClusterVision proposed the design and performance of a system based on a combination of hardware, software and service components which would surpass the functional requirements, together with a vision of the long term benefit that such a system and collaborative partnership would have on the University and its extended scientific user community. ClusterVision were confirmed as the prime contractor to the project in April 2012, with the system build, configuration and performance testing being successfully completed during the second half of the year.

“The ClusterVision solution won out in a very close competitive tender process, as the technology was judged to be the best match to our requirements.  In addition, ClusterVision and their selected partners showed a real commitment to collaborate with the University not only to deliver excellent hardware and software, but also a service package which met our specific requirements,” Dr. Colin Bannister, Senior HPC Development Officer, IT Services, The University of Nottingham.

The server anatomy of the cluster is based on Dell PowerEdge and PowerVault series components. ClusterVision and Dell drew upon a long established partnership, and the confidence of a number of collaborations on other major international academic cluster systems such as the University of Bordeaux in France, CRP Gabriel Lippmann in Luxembourg, and the King Abdullah University of Science and Technology in Saudi Arabia.

The Minerva system comprises 2 redundant master nodes; Dell Powerdege R720’s, with a single master node shared storage provided by the 2U 12 disk Dell PowerVault MD3200. The compute capacity is shared between 156 Dell PowerEdge nodes, arranged in Dell C6220 servers, with 12 high memory fast I/O nodes also in Dell 6220’s, and 6 additional GPU accelerated nodes. Originally designed using C6100 servers, the Dell compute node specification was subsequently upgraded to Dell PowerEdge C6620’s which were introduced as a vehicle for the latest Intel Xeon E5 Sandybridge processors. Each 2.6 Ghz compute unit contains a 500 GB local disk. The fast I/O nodes have 500 GB SATA and 4 100 GB SSD’s and are designed specifically for the high intensity needs of the applications. The 6 GPU accelerated nodes comprise a Supermicro base chassis, also incorporating the 8-core Intel Xeon E5 processor, together with 2 Tesla M2090 series GPU’s from NVIDIA.

Scalable parallel file storage is provided by 4 Panasas ActiveStor12 series shelves, incorporated as a complete storage appliance with the required management systems and Ethernet switching. Each Panasas ActiveStore12 shelf provides 60 TB capacity and 80 GB cache, giving a total theoretical and usable storage capacity of 240 TB and approximately 180 TB respectively. System interconnect is a dual-level combination of 1GB Ethernet for the administration and management communications and an Intel/Qlogic QDR InfiniBand fabric and switching system for the main application communications. All of the system components are mounted in 9 42U black server racks.

For the software environment ClusterVision selected 3 key providers, Bright Computing, Altair Engineering and Allinea.

Provisioning and cluster management is provided by 176 advanced version licences of the cluster management suite Bright Cluster Manager from Bright Computing. Bright Cluster Manager was used to manage the Linux environment and initial configuration process, and provides much of the software infrastructure for the everyday monitoring and healthcare of the system.

Bright Cluster Manager is also the enabler for ClusterVision’s innovative Remote Administration (RSA) service. RSA is a secure off-site cluster management service, designed to enhance the overall experience of cluster ownership by relieving much of the burden of in-house cluster operation. The inclusion of RSA in the Minerva project delivery allows ClusterVision’s engineers to continually monitor details of the cluster from its headquarters in Amsterdam, and to proactively diagnose and address potential performance and healthcare issues without the inconvenience and cost of an on-site visit. RSA is delivered as a suite of scalable work packages so is easy to customise to a range of basic, intermediate and advanced management requirements.

Although it is beyond the immediate scope of the current Minerva project, the Amazon EC2 cloud bursting functionality of Bright Cluster Manager’s advanced version also creates a working foundation for an anticipated cloud based extension at a later date.

A high level of user management and detailed usage analytics were identified as important operational requirements of the system. To address these needs ClusterVision worked in partnership with Altair Engineering to incorporate licences of Altair PBS Professional, PBS Compute Manager, and PBS Analytics. The software stack was completed with licences of the PGI CUDA Compiler, from the Portland Group, and Allinea’s Optimisation and Profiling, and Distributed Debugging tools, Allinea OPT and Allinea DDT.

The turn-key nature of the Minerva system is completed with ClusterVision’s on-site and post-delivery Professional Service offerings. In addition to the component delivery and pre-assembly of the Supermicro-NVIDIA GPU nodes, ClusterVision provided all the on-site server and storage racking, interconnect cabling, and software installation. This included system configuration, performance testing, acceptance certification, and on-site training on Bright Cluster Manager. Post-delivery services include extended support and maintenance for the various software components, and Dell Pro Support and Panasas PAS12 Silver Service warranty and on-site repair arrangements for the server and storage components, with ClusterVision providing continuity in a rationalised single point of contact and first level of service for all of the above.

The Minerva system, which is anticipated to operate at around 45 Tflops performance, will be a valuable local resource for students and research staff at the University of Nottingham. It will also provide a substantial compute capability for an extended network of collaborating United Kingdom Universities and enterprise businesses.

“We are immensely proud to have had the opportunity to design and build the University of Nottingham’s Minerva system, and to lead the collaboration of such a high calibre collection of partners. We are confident that the innovative systems and software technologies incorporated within Minerva, as well as the on-going provision of professional services, including our Remote System Administration service, will deliver a high performing and robust facility, and will keep the University of Nottingham at the forefront of HPC technology”, Christopher Huggins, Commercial Director ClusterVision.

About The University of Nottingham

The University of Nottingham, described by The Sunday Times University Guide 2011 as ‘the embodiment of the modern international university’, has 40,000 students at award-winning campuses in the United Kingdom, China and Malaysia. It is ranked in the UK's Top 10 and the World's Top 75 universities by the Shanghai Jiao Tong (SJTU) and the QS World University Rankings. It was named ‘the world’s greenest university’ in the UI GreenMetric World University Ranking 2011. More than 90 per cent of research at The University of Nottingham is of international quality, according to the most recent Research Assessment Exercise. The University’s vision is to be recognised around the world for its signature contributions, especially in global food security, energy & sustainability, and health. The University won a Queen’s Anniversary Prize for Higher and Further Education in 2011, for its research into global food security.

About Dell

For more than 28 years, Dell has empowered countries, communities, customers and people everywhere to use technology to realize their dreams. Customers trust us to deliver technology solutions that help them do and achieve more, whether they're at home, work, school or anywhere in their world.


About ClusterVision

ClusterVision specialises in the design, build and management of High Performance Compute (HPC) clusters. Compute clusters are used by pioneering organisations in academia and industry to run high-intensity computing applications in fields such as scientific research and development, manufacturing, healthcare and finance. By combining cutting-edge hardware and software components with a range of customised professional services, ClusterVision helps its customers create top-quality, efficient and reliable HPC solutions. In addition to systems technologies from leading manufacturers, ClusterVision's solutions typically include a range of HPC software components, such as easy to use cluster provisioning, management and monitoring. ClusterVision offers a full end-to-end portfolio of professional services - from system design, assembly and certification, to operational management, support, and training. With a background in applied scientific research, and practical experience in a wide range of HPC technologies, the ClusterVision team has designed and built some of the largest and most complex computational, storage and database clusters in Europe.

-----

Source: ClusterVision

 

Sponsored Links

Accelerate your science with Seneca
One of the first HPC providers installing a 4X NVIDIA Kepler K-20 cluster. Invites you to a free evaluation on Seneca’s NVIDIA K20 Kepler cluster, pre-loaded with AMBER, NAMD, LAMMPS

High-Performance Computing in Action
Businesses that want to be on the cutting edge of their industries are increasingly turning to high-performance computing (HPC) solutions to handle complex compute processes and speed up their rate of innovation. Download this Executive Brief to see how businesses in energy, life sciences and entertainment put HPC solutions to work in their operations.

Webinar: Programming Heterogeneous X64+GPU Systems Using OpenACC
Join Michael Wolfe as he compares the advantages and costs of using both low-level models and the directive-based OpenACC model for programming accelerated heterogeneous systems. Registration is free.

May 23, 2013

May 22, 2013

May 21, 2013

May 20, 2013

May 17, 2013

May 16, 2013

May 15, 2013

May 14, 2013

May 13, 2013

May 10, 2013


Most Read Features

Most Read Around the Web

Most Read This Just In

Supermicro

Feature Articles

Exascale Advocates Stand on Nuclear Stockpiles

In quieter times, sounding the bell of funding big science with big systems tends to resonate further than when ears are already burning with sour economic and national security news. For exascale's future, however, the time could be ripe to instill some sense of urgency....
Read more...

NSF Forges Further Beyond FLOPs

In a recent solicitation, the NSF laid out needs for furthering its scientific and engineering infrastructure with new tools to go beyond top performance, Having already delivered systems like Stampede and Blue Waters, they're turning an eye to solving data-intensive challenges. We spoke with the agency's Irene Qualters and Barry Schneider about..
Read more...

CERN, Google Drive Future of Global Science Initiatives

Large-scale, worldwide scientific initiatives rely on some cloud-based system to both coordinate efforts and manage computational efforts at peak times that cannot be contained within the combined in-house HPC resources. Last week at Google I/O, Brookhaven National Lab’s Sergey Panitkin discussed the role of the Google Compute Engine in providing computational support to ATLAS, a detector of high-energy particles at the Large Hadron Collider (LHC).
Read more...

Short Takes

NASA Builds 'Climate in a Box'

May 23, 2013 | he study of climate change is one of those scientific problems where it is almost essential to model the entire Earth to attain accurate results and make worthwhile predictions. In an attempt to make climate science more accessible to smaller research facilities, NASA introduced what they call ‘Climate in a Box,’ a system they note acts as a desktop supercomputer.
Read more...

Building Supercomputers with Raspberries

May 22, 2013 | At some point in the not-too-distant future, building powerful, miniature computing systems will be considered a hobby for high schoolers, just as robotics or even Lego-building are today. That could be made possible through recent advancements made with the Raspberry Pi computers.
Read more...

Running Computational Fluid Dynamics in the Cloud

May 16, 2013 | When it comes to cloud, long distances mean unacceptably high latencies. Researchers from the University of Bonn in Germany examined those latency issues of doing CFD modeling in the cloud by utilizing a common CFD and its utilization in HPC instance types including both CPU and GPU cores of Amazon EC2.
Read more...

Computing the Physics of Bubbles

May 15, 2013 | Supercomputers at the Department of Energy’s National Energy Research Scientific Computing Center (NERSC) have worked on important computational problems such as collapse of the atomic state, the optimization of chemical catalysts, and now modeling popping bubbles.
Read more...

Internet2 Awards Program Seeks Innovative Applications

May 10, 2013 | Program provides cash awards up to $10,000 for the best open-source end-user applications deployed on 100G network.
Read more...

Sponsored Whitepapers

Best Practices in Big Data Storage

05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas | From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.

Progress in Parallel: the Bull Parallel Programming Center

04/15/2013 | Bull | “50% of HPC users say their largest jobs scale to 120 cores or less.” How about yours? Are your codes ready to take advantage of today’s and tomorrow’s ultra-parallel HPC systems? Download this White Paper by Analysts Intersect360 Research to see what Bull and Intel’s Center for Excellence in Parallel Programming can do for your codes.

Sponsored Multimedia

SGI DMF ZeroWatt Disk Solution

In this demonstration of SGI DMF ZeroWatt disk solution, Dr. Eng Lim Goh, SGI CTO, discusses a function of SGI DMF software to reduce costs and power consumption in an exascale (Big Data) storage datacenter.

Cray CS300-AC Cluster Supercomputer Air Cooling Technology Video

The Cray CS300-AC cluster supercomputer offers energy efficient, air-cooled design based on modular, industry-standard platforms featuring the latest processor and network technologies and a wide range of datacenter cooling requirements.

SC12 Editorial Feature HPCwire Soundbite sponsored by ISC Xyratex

HPC Job Bank


Featured Events


  • June 16, 2013 - June 20, 2013
    ISC'13
    Leipzig,
    Germany

  • June 17, 2013 - June 18, 2013
    Forecast 2013
    San Francisco, CA
    United States





HPCwire Events