Cloud computing is an attractive computational resource for e-Science because of the ease with which cores can be accessed on demand, and because the virtual machine implementation that underlies cloud computing reduces the cost of porting a numeric or analysis code to a new platform. It is difficult to use cloud computing resources for large-scale, high throughput ensemble jobs however. Additionally, the computationally oriented researcher is increasingly encouraged to make data sets available to the broader community. For the latter to be achieved, using capture tools during experimentation to harvest metadata and provenance reduces the manual burden of marking up results. Better automatic capture of metadata and provenance is the only means by which sharing of scientific data can scale to meet the burgeoning explosion of data.

This project develops a pipeline framework for running ensemble simulations on the cloud; the framework has two key components: ensemble deployment and metadata harvest. Regarding the former, on commercial cloud platforms typically a much smaller number of jobs than desired can be started at any one time. An ensemble run will need to be pipelined to a cloud resource, that is, executed in well-controlled batches over a period of time. We will use platform features of Azure, and employ machine learning techniques to continuously refine the pipeline submission strategy and workflow strategies for ensemble parameter specification, pipelined deployment, and metadata capture. Regarding the latter key component, we expect to reduce the burden of sharing scientific datasets resulting from the use of cloud resources through automatic metadata and provenance capture and representation that aligns the metadata with emerging best practices in data sharing and discovery. Ensemble simulations result in complex data sets, whose reuse could be increased by expressive, granule and collection level metadata, including the lineage of the resulting products, to contribute towards trust.

In this project we focus on a compelling and timely application from climate research: One of the more immediate and dangerous impacts of climate change could be a change in the strength of storms that form over the oceans. In addition, as sea level rises due to global warming and melting of the polar ice caps, coastal communities will become increasingly vulnerable to storm surge. There have already been indications that even modest changes in ocean surface temperature can have a disproportionate effect on hurricane strength and the damage inflicted by these storms. In an effort to understand these impacts, modelers turn to predictions generated by hydrodynamic coastal ocean models such as the Sea, Lake and Overland Surges from Hurricanes (SLOSH) model. The proposed research advances the knowledge and understanding of probabilistic storm surge products by enhancements to the SLOSH model itself and through mechanisms that take advantage of commercial cloud resources. This knowledge is expected to have application in research, the classroom, and in operational settings.

The broader significance of the project is several-fold. Cloud computing is an important economic driver but it remains difficult for use in computationally driven scientific research. This project lowers the barriers to conducting e-Science research that utilizes cloud resources, specifically Azure. It will contribute tools to help researchers share, preserve, and publicize the scientific data sets that result from their research. Because we focus on and improve an application that predicts storm surge in response to sea level changes and severe storms, our work contributes to societal responses and adaptations to climate change, including planning and building the sustainable, hazard-resilient coastal communities of the future.

Project Report

Look to the Clouds for Forecasting Storm Surge Outcome: Large hurricanes, such as Superstorm Sandy (2012), often push a dangerous dome of water ahead of them that can rapidly flood coastal communities, causing massive death and destruction. To predict this complex phenomenon, storm surge forecasters are looking to the clouds – cloud computing, that is. Impact/Benefit: Research scientists from the University of Miami, computer scientists from Indiana University, and Storm Surge Specialists from the National Hurricane Center have developed a new way to run numerical storm surge prediction models in a cloud-computing environment. When a hurricane threatens the US coastline and lives are in danger, forecasters can go into "burst mode", instantly create powerful virtual supercomputing clusters in the cloud consisting of thousands of CPUs, simulate how high the water will rise, issue warnings so people can escape before the storm hits, then tear down the cluster to save the taxpayers money. Explanation: Cloud computing is a flexible, cost-effective way of accessing computational resources on demand, without incurring the high setup and maintenance costs of a locally owned compute cluster. Storm surge forecasting is a difficult "grand challenge" problem because of the non-linear interactions that occur between the oceanic circulation, atmospheric dynamics, terrain, bathymetry, land cover, and coastline geometry. Numerical models, such as the National Weather Service’s SLOSH (Sea, Lake and Overland Surges from Hurricanes), can simulate this complex interplay between these physical aspects of the Earth system. Nevertheless, it can require tens of thousands of storm surge simulations to accurately predict how much flooding will occur during a landfalling hurricane because there is so much uncertainty in a storm’s future intensity, size, direction and speed of motion. Cloud computing makes it possible to run thousands of these "ensemble" simulations at the same time, giving the forecasters much more precise information about the threat from a storm in a fraction of the time it would take on a desktop PC. This technology is also being used by climate scientists at the University of Miami’s Rosenstiel School of Marine and Atmospheric Science to estimate how much coastal inundation will occur under future sea level rise scenarios.

Agency
National Science Foundation (NSF)
Institute
Division of Advanced CyberInfrastructure (ACI)
Type
Standard Grant (Standard)
Application #
1148085
Program Officer
Daniel Katz
Project Start
Project End
Budget Start
2012-05-01
Budget End
2014-04-30
Support Year
Fiscal Year
2011
Total Cost
$199,725
Indirect Cost
Name
University of Miami
Department
Type
DUNS #
City
Key Biscayne
State
FL
Country
United States
Zip Code
33149