The goal of this project is to develop the mathematical, statistical and computational tools needed to assess and predict the risk associated with geophysical hazards such as volcanic pyroclastic flows. Based on a preliminary data analysis, the investigators develop stochastic models beginning with stationary independent increment processes employing (possibly tapered) Pareto distributions for the volumes of pyroclastic flows exceeding some observational threshold, in the domain of attraction of an alpha-stable process governing the aggregate flow volume of multiple smaller eruptions. Von Mises distributions are used for flow initiation angles. The deterministic TITAN2D two-dimensional computational environment is employed, which uses available digital elevation maps to predict the impact at various sites of interest from flows of specified volume and initiation angles. TITAN2D is a depth-averaged, thin-layer computational fluid dynamics code based on an adaptive grid Godunov solver, suitable for simulating geophysical mass flows. A rapid emulator based on a simple Gaussian random-field approximation to the TITAN2D model enables the investigators to emulate hundreds of thousands of TITAN2D runs and construct an estimate of the set of possible flow volumes and initiation angles that would lead to significant impact; a hierarchical Bayesian statistical model then reflects the probability of such an impact over a specified period of time.

Recent advances in computing power and algorithms have led to the application of mathematical and computer modeling to such highly complex phenomena as storms, floods, earthquakes and volcanic eruptions. It is increasingly being understood that development of mathematical models of these phenomena is only one part of a much more complex process needed for making reliable estimates and predictions of risk. This project develops the mathematical, statistical and computational tools needed for assessing and predicting the risk associated with such natural hazards. A particular focus of the work is the study of how these risks vary in space and time, and of how uncertain they are. This methodology is developed in context of the specific problem of volcanic avalanches and pyroclastic flows (so-called geophysical mass flows), but much of it will be applicable more broadly to problems in the analysis and quantification of risk in problems featuring spatial variability and model uncertainty. It brings together a unique team of scientists with specialties including volcanology, to guide the development of realistic models for the geophysical processes under study; in stochastic processes, to reflect uncertainty and variability about initial conditions, flow frequencies, and other features in realistic and verifiable ways; in deterministic computer modeling, for the difficult task of making detailed spatial predictions of the consequences of the most probable and of the most hazardous possible events; in computer model emulation, to accelerate many thousand-fold the computations necessary for predicting the risk of rare events under a wide range of possible scenarios; and in statistical modeling and analysis, to reflect honestly all the different sources of uncertainty and variability in this analysis, leading to a full quantification of the risk of hazardous events. Only with such a broad range of expertise can investigators build the tapestry of science that is required to develop tools for studying devastating natural hazards.

Project Report

Pyroclastic flows (PFs) are among the most common, destructive, and terrifying forms of volcanic eruption. These fast-moving (>200 mph) flows of rock, water, and gas mixtures can exceed 1000 F. The goal of our research was to develop improved methods for predicting the hazard of PF events like those that occurred at Mt. St. Helen's. The new methods can also be applied to other natural hazards, and our advances in statistical science and applied mathematics will have wide benefit. Our six-investigator team of statisticians, applied mathematicians, and a geologist collaborated to build "hazard maps" indicating the probability of inundation by catastrophic PF events within a specific time period (5, 10, or 20 years) at each "target" physical location within range of an active volcano. We broke the problem down into two parts: Deterministic Flow Modeling: Pyroclastic Flows are characterized by their volume (in millions of cubic meters) and the direction of initial flow (any compass direction is possible). For each target location, we determined how large PFs would have to be, and with what initial directions, to reach that target. Stochastic Volume & Frequence Modeling: The primary volcano we studied was the Soufrière Hills volcano on the Caribbean island of Montserrat, discovered by Columbus in 1493. After centuries of dormancy this volcano erupted in July 1995 and, since then, has been among the most active and best studied volcanoes in the world. Using data shared with us by scientists at the Montserrat Volcano Observatory we have studied and modeled the frequency, volumes, and directions of hundreds of PFs over a period of 15 years to support prediction of flow hazards for the coming decades. Addressing each of these has led to advances in applied mathematics and statistics, and has advanced the quantitative study of geophysical hazards. The physics and mechanics that govern the flow of granular material like PFs differ from those that govern familiar fluids like water and air. PI Pitman co-developed one of the leading computer models for predicting granular flow, "TITAN-2D". In about 1h hour of supercomputer time this model can predict the entire time-course of a Pyroclastic Flow on (a digital representation of) the topography, for specified values of the initial flow direction and volume and other uncertain quantities like the friction parameters that govern how quickly the kinetic energy of motion is transformed into heat. For our purposes, this 1h running time is simply too slow. To predict the probability of devastation at thousands of target locations, from each of thousands of possible PF volumes in thousands of possible directions, would require billions of hours of supercomputing time. Even mere hundreds of locations, volumes, and directions would take millions of hours, or centuries. We solved this problem by building a Bayesian statistical model (called an "emulator") for the TITAN-2D simulation model. The emulator is trained to imitate TITAN-2D by showing it the simulation model's output at a few hundred carefully selected input values (including PF volumes, initial directions, friction parameters). The emulator then reproduces these values exactly, and predicts what TITAN-2D would return at new, untried input values (along with meaningful estimates of how accurate the predictions are). The key is that the emulator does this prediction in seconds, not hours, speeding computations by over three orders of magnitude, to complete a century's computation in days. Specific scientific advances include: - Previous emulator investigators have taken short-cuts intended to simplify computations, hoping not to compromise predictions. We found ways to implement fully-Bayesian emulation without short-cuts, adding very little computational burden. - The input parameter "initial direction" is periodic: +270° from East is identical to -90° from East (both are due South). Previous investigators used ad hoc methods to approximate periodic input; we found efficient, elegant ways to emulate periodic inputs exactly. - The most extreme values (such as the very largest PF volumes) play a special role in hazard assessment, and are especially difficult to model or assess precisely because of their rarity. We found new ways to model uncertainty about these extremes and to reflect that uncertainty honestly in our hazard reports. - To predict probabilities of inundation at thousands of target points, we need not just one but rather thousands of emulators, each trained at design points chosen to best reveal a specific target. We found a way to automate selection of design points so as to do this automatically in parallel at thousands of target points. - We discovered novel ways to model and reflect uncertainty about digital topography representations. - Interaction between the input parameters flow volume and friction is not well understood--- large-volume flows travel farther and faster than expected, as if they had less friction. We found ways to use data from other similar volcanoes to help resolve the uncertain relationship between these variables to improve our predictions.

Agency
National Science Foundation (NSF)
Institute
Division of Mathematical Sciences (DMS)
Application #
0757527
Program Officer
Gabor J. Szekely
Project Start
Project End
Budget Start
2008-07-01
Budget End
2012-06-30
Support Year
Fiscal Year
2007
Total Cost
$219,468
Indirect Cost
Name
National Institute of Statistical Sciences
Department
Type
DUNS #
City
Research Triangle Pk
State
NC
Country
United States
Zip Code
27709