by Paul Tooby, SDSC
La Jolla, CA — One of the most significant unsolved scientific problems of our time does not lie billions of light-years away, nor is it hidden in the microscopic world of the atom or the gene. It is something that all of humanity is immersed in every day: Earth’s climate system.
For years, however, U.S. climate-modeling efforts have been falling behind European models and computing resources. The National Research Council has recently cautioned that “the United States lags behind other countries in its ability to model long-term climate change” and that it is inappropriate to rely so heavily on foreign climate modeling. Because climate science has major implications for energy policy, without a climate modeling capability the U.S. is seriously hampered when considering energy issues or negotiating future international energy policy.
To address the need for improved climate modeling, the Accelerated Climate Prediction Initiative (ACPI), a multi-institution effort including both government laboratories and universities, is working to create a state-of-the-art climate modeling capability for the U.S. Major participants include the Department of Energy (DoE), National Center for Atmospheric Research (NCAR), Los Alamos National Laboratory (LANL), Scripps Institution of Oceanography (SIO), Naval Postgraduate School (NPS), Pacific Northwest National Laboratory (PNNL), the University of Washington, U.S. Geological Survey (USGS), and the Desert Research Institute (DRI).
To “jump start” this major long-term program, SIO researcher Tim Barnett is coordinating a collaboration of researchers in an ACPI pilot demonstration. Co-PIs in the pilot program include Robert Malone of LANL, Bill Pennell of PNNL, Bert Semtner of NPS, Detlef Stammer of SIO, and Warren Washington of NCAR. “The pilot program we’re doing as part of the larger ACPI is intended to demonstrate all the key parts of the larger climate prediction initiative in an ‘end-to-end’ process. The different model components-ocean, atmosphere, land, sea ice, and other parts-have been integrated into a global climate model, the Parallel Climate Model (PCM), which we’ll start by using today’s observed ocean as initial conditions, then run climate simulations under various scenarios of greenhouse gas changes through 2050 or 2100, and finally downscale from global to regional predictions,” Barnett says.
There are three basic parts of this pilot program. The first step is to start the Parallel Climate Model with the initial conditions of the world’s oceans as they are now, which is important because the oceans hold some 99 percent of the heat in the combined ocean-atmosphere system. Including the actual ocean state data is new science that hasn’t been done before because these data have only recently become available. Previously, to model the impact of greenhouse gas emissions, researchers would have to start their climate models in 1860, before greenhouse gases had changed, and integrate forward. So, in addition to saving more than 100 years of computing simulation time, being able to use current ocean data ties the model more realistically to the real world. “Our simulations both ways, integrating forward from 1860 and starting with today’s ocean data, have turned out surprisingly close, giving us confidence in this climate model, the PCM,” Barnett says. The ocean data comes from SIO researcher Detlef Stammer’s Ocean General Circulation Model (OGCM), which assimilates or combines all existing data about the oceans into an overall description of the present ocean state.
The second step of the pilot program is to run the full global climate model, the PCM, using different scenarios of greenhouse gas emissions that come from fossil fuel burning and other industrial and agricultural activities, and vary depending on such factors as economic and population growth and conservation measures. The PCM contains models for major components of the climate system-ocean, atmosphere, land, and sea ice-and predicts global climate decades in the future at a spatial resolution of about 280 km. Typical scenario lengths are between 50 and 100 years, taking about 320 processor hours per model year. To average out model variability, ensembles of five to 10 simulations need to be run for each scenario, placing additional demands on computing resources.
In the third step, the global-scale climate prediction data from the PCM is passed to another set of models called regional climate models developed to “downscale” the global climate predictions to the regional scale. This is important because regional predictions are what governments, policymakers, and industry need in order to make decisions. “In the pilot program we’re focusing on the hydrology of the western U.S., and after the initialization and global PCM steps, eventually we predict the amount of water that will be running down, say, the Merced River 50 years from now. So we’ll be producing predictions that have practical use right from the start,” Barnett says. But downscaling uses much higher resolution, requiring more computing time. SIO researcher David Pierce points out that “doubling the resolution of the grid in 2-D space means you also have to cut the time steps in half, so it’s a factor of eight more calculations.” This fine resolution is required to provide meaningful data to the hydrology models that do the final step of predicting streamflow in Western rivers in the decades to come.
Although U.S. climate models have lagged behind European models, researchers at NCAR working with other groups have now assembled model components for the oceans, atmosphere, land, and sea ice into the fully coupled Parallel Climate Model (PCM). The atmospheric component of the PCM is the Community Climate Model 3 (CCM3) atmospheric general circulation model (AGCM) developed at NCAR, which is used at T42 resolution (a grid in which each side of a cell measures about 2.8 degrees or 280 kilometers). The CCM3 includes a land surface model that accounts for soil moisture and vegetation types as well as a river transport model developed at the University of Texas at Austin. The ocean component is the Parallel Ocean Program (POP) model developed jointly by LANL, NPS, and NCAR. The final model component is a sea ice model developed at NPS. The full PCM has been configured to run with a serial flux coupler designed to calculate the climate system components as efficiently as possible on a variety of parallel high capacity supercomputers.
While climate models are still maturing and have significant limitations, several factors, including progress in climate science, the availability of more and better climate data, advances in modeling and computing technologies, as well as the development of terascale computing resources, are all converging to put climate modeling on the verge of major progress in predictive capability.
“I think we’ve already solved the hardest scientific problem of our pilot program, which is taking the ocean data and initializing the PCM with that,” Barnett says. If researchers put the ocean data directly into the PCM, there is a mismatch that produces model drift. Fortunately, SIO researchers have found a solution, and SIO’s David Pierce is now running tests. Once this method is working it will be transferred to NCAR partners Warren Washington and his group, who operate the PCM. “They’ll go into production and run a series of climate scenarios from 2000 to at least 2050 and maybe 2100 if the computer time’s available. And we can now do runs like this here at SDSC because we have the PCM model up and running on Blue Horizon,” Barnett says.
Climate modeling involves many large interlocking models maintained by different groups that must be ported to various supercomputing platforms. SDSC scientist John Helly has worked with SIO’s Tim Barnett and others to ensure that the Parallel Climate Model will run on NPACI resources. In a Strategic Applications Collaboration (SAC), Giri Chukkapalli and Dong Ju Choi, members of SDSC’s Scientific Computing Department, worked with NCAR researchers Tom Bettge, Tony Craig, and Vincent Wayland to port the PCM to the IBM SP and NPACI IBM Blue Horizon teraflops platform. “To enable larger simulations, the POP ocean component of the PCM maintained at LANL is being modified to scale past 64 processors, and we’ve worked closely with researchers there to help them make these changes,” Chukkapalli says.
Barnett and his colleagues in the ACPI pilot program will use the largest NPACI resources including Blue Horizon for three purposes. While all the pieces of this project have been used separately, they’ve never been put together in a seamless, programmatic form. “So, one step is to do the studies on how we initialize this model, that’s the real science in this pilot program. That involves a number of test runs on NPACI resources,” Barnett says. Second, researchers may also do global scenario runs with the PCM on NPACI resources, where they change the greenhouse gases and integrate forward to the next century. “And the third and possibly most important use of the NPACI resources is to run the downscaling model to produce the regional impact statements, which involves the high-resolution nested models that take a great deal of computing time,” Barnett says.
Climate models are inherently among the most computationally demanding scientific models in both the length of simulations required and the volume of model data produced. The reason for this lies in climate dynamics: Researchers need to discern the relatively small global climate change “signal” or “signature” above the larger “noise” of the significant natural variability of the climate system. While the “memory” of the atmosphere is less than one month, the “memory” of the oceans, due to their high heat capacity, spans many hundreds of years, requiring long simulation times. Moreover, researchers need to run ensembles of five to 10 identical simulations to average out model variability, essentially noise generated by the nonlinear properties of the model itself.
Thus, climate simulations require very large amounts of computer time, and part of the ACPI pilot program involves not only porting and optimizing codes to run efficiently on various supercomputing platforms, but also learning how to compute these large models from different groups, exchange information, and make everything work seamlessly “end-to-end.” And because the models are too large and numerous to run at one place, part of the research is learning to work efficiently with different computing centers. “NPACI has a good allocation policy where you have a year to use the time, which makes it much more feasible for us to manage this complicated project,” Barnett says.
As new technologies have enabled larger and more complex research problems such as multi-component climate modeling to be undertaken, researchers are learning to work in larger collaborations across many disciplines and institutions. “We’ve deliberately organized this pilot program in a low profile way, with distributed management. So, we’re really testing the concept that a group of people can come together, make a decision on what needs to be done, each having a responsibility, and take that away,” Barnett says. “And so far it’s working very well.”
In the two year pilot program, ACPI researchers will demonstrate an end-to-end climate modeling capability, starting with real initial ocean conditions and proceeding through global simulations with various greenhouse scenarios to regional-scale predictions for the Western U.S. “The answers we get will be useful, but certainly not the final answers, and really what we’re doing is breaking ground to identify what the problems are in the full program,” Barnett says. “We’ve already got a page or two of things that should be handled differently in later stages. We’ll wind up with a blueprint for a much larger national program to bring a state-of-the-art climate modeling capability to the U.S.”
============================================================