Python Snakes Its Way Into HPC

By Nicole Hemsoth

November 17, 2010

Interpreted programming languages usually don’t find too many friends in high performance computing. Yet Python, one of the most popular general-purpose interpreted languages, has garnered a small community of enthusiastic followers. True believers got the opportunity to hear about the language in the HPC realm in a tutorial session on Monday and a BoF session on Wednesday. Argonne National Lab’s William Scullin, who participated in both events, talked with HPCwire about the status of Python in this space and what developers might look forward to.

HPCwire: Python is not a language normally associated with high performance or scientific computing. What does it have to offer this user community not being fulfilled by traditional languages, like C, Fortran or other high productivity, interpreted languages like MATLAB?

William Scullin: In a way, Python’s growing adoption in the high performance and scientific computing space is a homecoming. Guido Van Rossum originally began Python as a way of providing an administrative scripting language for the Amoeba distributed operating system. Then as now, it combines simple, easy to learn and maintain syntax with access to the same powerful libraries and function calls you would find in any C or Fortran implementation. While there has always been an emphasis on reducing the time it takes to perform a computation, Python has truly shined in improving scientific computing by taking the work out of programming and reducing the time to solution.

Often, projects fail when they try to be all things to all people. MATLAB, Mathematica, SPSS, and Maple are all very useful tools, in part because they are focused on meeting the needs of a well funded community with very specific goals. Python, arising from a very diverse community that ranges from astrophysicists to game programmers to web designers to entry level computer science students, has been very successful due to the diversity of users. The standard library has become amazingly extensive without becoming inconsistent.

Likewise, the amount of software that has come out of the community is amazing, most of which is open source, and the vast majority of which follows the same coding guidelines as the core modules. This makes it possible to easily develop an interface to an embedded microcontroller to turn off the desk lamp when your simulation finally ends and automatically push results to a web server in less than an hour — or alternately turn on a coffee pot and resubmit your job when the simulation fails — all in one language.

HPCwire: Obviously, performance is a driving issue in HPC. How is the issue of execution performance being addressed?

Scullin: Performance is a matter of perspective. A favorite maxim in the Python community is that the greatest performance improvement comes from going to the working from the non-working state. A second maxim, from Knuth, is that premature optimization is the root of all evil. While the execution speed of a Python application may not be as fast as one written in C, C++, or Fortran, its ease of use and low learning curve sharply improves overall time to solution. It’s a question of developer time versus compute time.

Side stepping the issue, it’s ridiculously easy to extend Python with modules written in C, C++, and Fortran. It’s common in our community to utilize compiled high performance numerical kernels, then use Python to handle areas like I/O, workflow management, computational analysis, and steering. When areas become performance bottlenecks, those areas tend to be rewritten in C.

Conversely, I’ve seen C and Fortran projects where code complexity has prevented maintenance and functionality, leading to thousands of lines of compiled code being replaced with less than a hundred lines of Python. In many ways, Python is coming to fulfill the roles that frameworks like Cactus and Samurai sought to fill at the start of the decade — letting scientists worry about their problems while letting the language and interpreter do the heavy lifting.

HPCwire: Do you think a compiled implementation of Python would be a step in the right direction?

Scullin: There will always be a place for the interpreted reference implementation, especially in development, but if a Python compiler comes along that provides better performance without compromising the language, I can’t see it finding much resistance.

That said, there are currently projects such as Unladen Swallow, PyPy, Stackless Python, Jython, and Iron Python that provide alternatives to the CPython interpreter. Unladen Swallow, backed in part by Google, and PyPy both seek to close the performance gap with compiled languages. Unladen Swallow is particularly exciting as it’s backended into the Low Level Virtual Machine, which is the basis for multiple compilers including Clang, currently the default compiler under Apple’s OS X. This makes a Python compiler more a matter of when than if.

HPCwire: Can you describe some of the more important Python initiatives — language extensions, libraries, tools, etc. — that are aimed at the HPC domain?

Scullin: I cannot speak highly enough of NumPy, which is almost the Swiss army knife of Python for scientific and high performance computing. It’s been under active development for years now with each release providing better performance, automatic integration of popular high performance libraries like BLAS and LAPACK, more features, and greater portability. NumPy is further extended by SciPy, which provides additional tools and lab kits addressing almost every science domain.

Likewise, I think very highly of mpi4py, PyMPI, PyCUDA and its sister PyOpenCL, petsc4py, and PyTrilinos. All of these keep improving the options we have to accelerate our code using the very same tools and interfaces that are available through traditional compiled languages with none of the complexity.

HPCwire: Are there vendors out there with commercially-supported solutions?

Scullin: Indeed, and more importantly, most of them are active contributors to and supporters of the Python community. I can no longer count the number of consulting firms that provide Python solutions. It’s also been very encouraging seeing vendors add Python support to their products. Two companies well known in the HPC space, Rogue Wave and ParaTools, have both been very responsive.

Rogue Wave has provided access to their mathematical libraries, IMSL via PyIMSL. Furthermore, they have brought a number of people into the Python community via PyIMSL Studio which they market officially as a prototyping tool. I’ve encountered PyIMSL studio users so happy with their prototype Python applications with PyIMSL Studio, that they ran with the Python code as production code. I should also mention that while the TotalView debugger is not officially a Python tool, it’s seen a lot of use by Python HPC users and it will be interesting to see where it goes since Rogue Wave’s acquisition of Acumen.

ParaTools, a major contributor to the TAU Performance System and a leading consultant in the area of parallel and high performance codes has done a very good job of adding Python support to TAU.

Without hesitation, I have used their tools with C, Fortran, and Python and found their support to be helpful and responsive regardless of language.

While not directly in the HPC market, Enthought, deserves special mention. They host an array of Python projects with engineering and science applications. They provide a commercial packaging of the Python interpreter with commonly used libraries and utilities along with technical support as the Enthought Python Distribution. Most of all, they are active developers of NumPy and SciPy. Without their support and involvement, I am not sure that NumPy would have come together as nicely as it has.

While relatively new, I’ll also be interested to see what the future holds for MBA Sciences’s SPM.Python toolkit for bringing parallelism into serial Python programs. I’ll be keeping a close watch on PiCloud, a firm which provides an amazingly easy to user cloud computing platform that makes running Python codes on a compute cloud ridiculously easy. PiCloud users have their computations offloaded without any serious code changes, having to be involved in any aspect of setting up a cloud infrastructure, or doing any server management. They’ve seriously made it as simple as coding and running.

Finally, though it hasn’t been making a lot of noise lately, NVIDIA has been putting effort behind Copperhead, which while not a complete Python, allows for the rapid development of CUDA kernels in Python-like code.

HPCwire: Do you think most uses of Python in HPC will eventually involve either integration with C or Fortran or source code translation to those languages?

Scullin: I believe that HPC users will continue to choose the best possible tool to address a need in a given situation. Python is flexible enough that there will be continued integration with C, Fortran, and other languages. At the same time, interpreter performance is being rapidly addressed, which makes the issues that come with language translation into C and Fortran cause that sort of project to be less attractive to active Python developers. What will be interesting to watch is how codes written in a mix of C, C++, Fortran, Python and other languages perform and evolve as the LLVM platform continues to mature.

HPCwire: Can you point to any successful case studies or projects where Python has been employed in this arena?

Scullin: At Argonne, we are involved in the development of GPAW, a density-functional theory Python code based on the projector-augmented wave method. Originating out of an international collaboration, it is mostly a mix of C and Python with the vast majority of the code being Python. It has been run at scale successfully and routinely on our Blue Gene platform. While the porting of any application to platforms like the Cray XT series or the Blue Gene is an interesting exercise in computer science, it’s far more remarkable that the performance has been on par from what I’ve seen in C or C++ codes. Moreover, it is being used to produce reliable data used to generate publications.

The other community that a lot of people think of when looking for successful Python applications in the HPC space is bioinformatics. While I’ve not been involved with many bioinformatics codes, the last four or five years have seen a rising number of chemists and biologists appearing on Python-related mailing lists and at conferences discussing how they have been using Python to power their science. While Perl still holds sway in the field, Python is quickly becoming almost as popular.

HPCwire: For those HPC developers interested in learning more about what’s available in the Python ecosystem, can you point to some resources they could tap into?

Scullin: Depending on their particular interests, one of the best places to start is by visiting www.scipy.org. From there, you can find links to numerous mailing lists, information about conferences, code recipes, documentation, and much more. In the Chicago and Bay Areas there are very active Python users groups with sizable memberships with an interest in HPC and scientific computing. Finally, given Python’s ease of use, one of the best things you can do is to spend an afternoon with the interpreter, simply playing with code and seeing what the language can do for you without any effort. The joy of doing powerful things with simple code is one of the most admirable traits of the language.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

Which Schools Produce the Top Coders in the World?

December 8, 2016

Ever wonder which universities worldwide produce the best coders? The answers may surprise you, at least as judged by the results of a competition posted yesterday on the HackerRank blog. Read more…

By John Russell

Enlisting Deep Learning in the War on Cancer

December 7, 2016

Sometime in Q2 2017 the first ‘results’ of the Joint Design of Advanced Computing Solutions for Cancer (JDACS4C) will become publicly available according to Rick Stevens. He leads one of three JDACS4C pilot projects pressing deep learning (DL) into service in the War on Cancer. The pilots, supported in part by DOE exascale funding, not only seek to do good by advancing cancer research and therapy but also to advance deep learning capabilities and infrastructure with an eye towards eventual use on exascale machines. Read more…

By John Russell

DDN Enables 50TB/Day Trans-Pacific Data Transfer for Yahoo Japan

December 6, 2016

Transferring data from one data center to another in search of lower regional energy costs isn’t a new concept, but Yahoo Japan is putting the idea into transcontinental effect with a system that transfers 50TB of data a day from Japan to the U.S., where electricity costs a quarter of the rates in Japan. Read more…

By Doug Black

Infographic Highlights Career of Admiral Grace Murray Hopper

December 5, 2016

Dr. Grace Murray Hopper (December 9, 1906 – January 1, 1992) was an early pioneer of computer science and one of the most famous women achievers in a field dominated by men. Read more…

By Staff

Ganthier, Turkel on the Dell EMC Road Ahead

December 5, 2016

Who is Dell EMC and why should you care? Glad you asked is Jim Ganthier’s quick response. Ganthier is SVP for validated solutions and high performance computing for the new (even bigger) technology giant Dell EMC following Dell’s acquisition of EMC in September. In this case, says Ganthier, the blending of the two companies is a 1+1 = 5 proposition. Not bad math if you can pull it off. Read more…

By John Russell

AWS Embraces FPGAs, ‘Elastic’ GPUs

December 2, 2016

A new instance type rolled out this week by Amazon Web Services is based on customizable field programmable gate arrays that promise to strike a balance between performance and cost as emerging workloads create requirements often unmet by general-purpose processors. Read more…

By George Leopold

AWS Launches Massive 100 Petabyte ‘Sneakernet’

December 1, 2016

Amazon Web Services now offers a way to move data into its cloud by the truckload. Read more…

By Tiffany Trader

Weekly Twitter Roundup (Dec. 1, 2016)

December 1, 2016

Here at HPCwire, we aim to keep the HPC community apprised of the most relevant and interesting news items that get tweeted throughout the week. Read more…

By Thomas Ayres

Enlisting Deep Learning in the War on Cancer

December 7, 2016

Sometime in Q2 2017 the first ‘results’ of the Joint Design of Advanced Computing Solutions for Cancer (JDACS4C) will become publicly available according to Rick Stevens. He leads one of three JDACS4C pilot projects pressing deep learning (DL) into service in the War on Cancer. The pilots, supported in part by DOE exascale funding, not only seek to do good by advancing cancer research and therapy but also to advance deep learning capabilities and infrastructure with an eye towards eventual use on exascale machines. Read more…

By John Russell

Ganthier, Turkel on the Dell EMC Road Ahead

December 5, 2016

Who is Dell EMC and why should you care? Glad you asked is Jim Ganthier’s quick response. Ganthier is SVP for validated solutions and high performance computing for the new (even bigger) technology giant Dell EMC following Dell’s acquisition of EMC in September. In this case, says Ganthier, the blending of the two companies is a 1+1 = 5 proposition. Not bad math if you can pull it off. Read more…

By John Russell

AWS Launches Massive 100 Petabyte ‘Sneakernet’

December 1, 2016

Amazon Web Services now offers a way to move data into its cloud by the truckload. Read more…

By Tiffany Trader

Lighting up Aurora: Behind the Scenes at the Creation of the DOE’s Upcoming 200 Petaflops Supercomputer

December 1, 2016

In April 2015, U.S. Department of Energy Undersecretary Franklin Orr announced that Intel would be the prime contractor for Aurora: Read more…

By Jan Rowell

Seagate-led SAGE Project Delivers Update on Exascale Goals

November 29, 2016

Roughly a year and a half after its launch, the SAGE exascale storage project led by Seagate has delivered a substantive interim report – Data Storage for Extreme Scale. Read more…

By John Russell

Nvidia Sees Bright Future for AI Supercomputing

November 23, 2016

Graphics chipmaker Nvidia made a strong showing at SC16 in Salt Lake City last week. Read more…

By Tiffany Trader

HPE-SGI to Tackle Exascale and Enterprise Targets

November 22, 2016

At first blush, and maybe second blush too, Hewlett Packard Enterprise’s (HPE) purchase of SGI seems like an unambiguous win-win. SGI’s advanced shared memory technology, its popular UV product line (Hanna), deep vertical market expertise, and services-led go-to-market capability all give HPE a leg up in its drive to remake itself. Bear in mind HPE came into existence just a year ago with the split of Hewlett-Packard. The computer landscape, including HPC, is shifting with still unclear consequences. One wonders who’s next on the deal block following Dell’s recent merger with EMC. Read more…

By John Russell

Intel Details AI Hardware Strategy for Post-GPU Age

November 21, 2016

Last week at SC16, Intel revealed its product roadmap for embedding its processors with key capabilities and attributes needed to take artificial intelligence (AI) to the next level. Read more…

By Alex Woodie

Why 2016 Is the Most Important Year in HPC in Over Two Decades

August 23, 2016

In 1994, two NASA employees connected 16 commodity workstations together using a standard Ethernet LAN and installed open-source message passing software that allowed their number-crunching scientific application to run on the whole “cluster” of machines as if it were a single entity. Read more…

By Vincent Natoli, Stone Ridge Technology

IBM Advances Against x86 with Power9

August 30, 2016

After offering OpenPower Summit attendees a limited preview in April, IBM is unveiling further details of its next-gen CPU, Power9, which the tech mainstay is counting on to regain market share ceded to rival Intel. Read more…

By Tiffany Trader

AWS Beats Azure to K80 General Availability

September 30, 2016

Amazon Web Services has seeded its cloud with Nvidia Tesla K80 GPUs to meet the growing demand for accelerated computing across an increasingly-diverse range of workloads. The P2 instance family is a welcome addition for compute- and data-focused users who were growing frustrated with the performance limitations of Amazon's G2 instances, which are backed by three-year-old Nvidia GRID K520 graphics cards. Read more…

By Tiffany Trader

Think Fast – Is Neuromorphic Computing Set to Leap Forward?

August 15, 2016

Steadily advancing neuromorphic computing technology has created high expectations for this fundamentally different approach to computing. Read more…

By John Russell

The Exascale Computing Project Awards $39.8M to 22 Projects

September 7, 2016

The Department of Energy’s Exascale Computing Project (ECP) hit an important milestone today with the announcement of its first round of funding, moving the nation closer to its goal of reaching capable exascale computing by 2023. Read more…

By Tiffany Trader

HPE Gobbles SGI for Larger Slice of $11B HPC Pie

August 11, 2016

Hewlett Packard Enterprise (HPE) announced today that it will acquire rival HPC server maker SGI for $7.75 per share, or about $275 million, inclusive of cash and debt. The deal ends the seven-year reprieve that kept the SGI banner flying after Rackable Systems purchased the bankrupt Silicon Graphics Inc. for $25 million in 2009 and assumed the SGI brand. Bringing SGI into its fold bolsters HPE's high-performance computing and data analytics capabilities and expands its position... Read more…

By Tiffany Trader

ARM Unveils Scalable Vector Extension for HPC at Hot Chips

August 22, 2016

ARM and Fujitsu today announced a scalable vector extension (SVE) to the ARMv8-A architecture intended to enhance ARM capabilities in HPC workloads. Fujitsu is the lead silicon partner in the effort (so far) and will use ARM with SVE technology in its post K computer, Japan’s next flagship supercomputer planned for the 2020 timeframe. This is an important incremental step for ARM, which seeks to push more aggressively into mainstream and HPC server markets. Read more…

By John Russell

IBM Debuts Power8 Chip with NVLink and Three New Systems

September 8, 2016

Not long after revealing more details about its next-gen Power9 chip due in 2017, IBM today rolled out three new Power8-based Linux servers and a new version of its Power8 chip featuring Nvidia’s NVLink interconnect. Read more…

By John Russell

Leading Solution Providers

Vectors: How the Old Became New Again in Supercomputing

September 26, 2016

Vector instructions, once a powerful performance innovation of supercomputing in the 1970s and 1980s became an obsolete technology in the 1990s. But like the mythical phoenix bird, vector instructions have arisen from the ashes. Here is the history of a technology that went from new to old then back to new. Read more…

By Lynd Stringer

US, China Vie for Supercomputing Supremacy

November 14, 2016

The 48th edition of the TOP500 list is fresh off the presses and while there is no new number one system, as previously teased by China, there are a number of notable entrants from the US and around the world and significant trends to report on. Read more…

By Tiffany Trader

Intel Launches Silicon Photonics Chip, Previews Next-Gen Phi for AI

August 18, 2016

At the Intel Developer Forum, held in San Francisco this week, Intel Senior Vice President and General Manager Diane Bryant announced the launch of Intel's Silicon Photonics product line and teased a brand-new Phi product, codenamed "Knights Mill," aimed at machine learning workloads. Read more…

By Tiffany Trader

CPU Benchmarking: Haswell Versus POWER8

June 2, 2015

With OpenPOWER activity ramping up and IBM’s prominent role in the upcoming DOE machines Summit and Sierra, it’s a good time to look at how the IBM POWER CPU stacks up against the x86 Xeon Haswell CPU from Intel. Read more…

By Tiffany Trader

Beyond von Neumann, Neuromorphic Computing Steadily Advances

March 21, 2016

Neuromorphic computing – brain inspired computing – has long been a tantalizing goal. The human brain does with around 20 watts what supercomputers do with megawatts. And power consumption isn’t the only difference. Fundamentally, brains ‘think differently’ than the von Neumann architecture-based computers. While neuromorphic computing progress has been intriguing, it has still not proven very practical. Read more…

By John Russell

Dell EMC Engineers Strategy to Democratize HPC

September 29, 2016

The freshly minted Dell EMC division of Dell Technologies is on a mission to take HPC mainstream with a strategy that hinges on engineered solutions, beginning with a focus on three industry verticals: manufacturing, research and life sciences. "Unlike traditional HPC where everybody bought parts, assembled parts and ran the workloads and did iterative engineering, we want folks to focus on time to innovation and let us worry about the infrastructure," said Jim Ganthier, senior vice president, validated solutions organization at Dell EMC Converged Platforms Solution Division. Read more…

By Tiffany Trader

Container App ‘Singularity’ Eases Scientific Computing

October 20, 2016

HPC container platform Singularity is just six months out from its 1.0 release but already is making inroads across the HPC research landscape. It's in use at Lawrence Berkeley National Laboratory (LBNL), where Singularity founder Gregory Kurtzer has worked in the High Performance Computing Services (HPCS) group for 16 years. Read more…

By Tiffany Trader

Micron, Intel Prepare to Launch 3D XPoint Memory

August 16, 2016

Micron Technology used last week’s Flash Memory Summit to roll out its new line of 3D XPoint memory technology jointly developed with Intel while demonstrating the technology in solid-state drives. Micron claimed its Quantx line delivers PCI Express (PCIe) SSD performance with read latencies at less than 10 microseconds and writes at less than 20 microseconds. Read more…

By George Leopold

  • arrow
  • Click Here for More Headlines
  • arrow
Share This