A Complete Spatial Downscaler
Transcription
A Complete Spatial Downscaler
A Complete Spatial Downscaler Yen-Ning Huang, Brian J Reich, Montserrat Fuentes Sankar Arumugam 2 1 Department 2 Department 1 of Statistics, NC State University of Civil, Construction, and Environmental Engineering, NC State University April 28th , 2016 A Complete Spatial Downscaler 1 / 30 Outline Introduction Complete spatial downscaler Marginal distributions Spatial correlation Calibration Application A Complete Spatial Downscaler 2 / 30 Introduction Computer models are essential for environmental science to evaluate climate/emission scenarios and make predictions where data are not available. Calibration Evaluation of the performance of physical models is needed to obtain reliable forecasts Model calibration is challenging when the process of interest is dynamic across space and time Reproducing the spatial behavior of the physical process of interest is important Downscaler Combine disparate spatial data sources to improve the spatial prediction A Complete Spatial Downscaler 3 / 30 Challenges Comparing monitor data and computer output is difficult because they often measure different spatial scales Computer models notoriously struggle to explain extreme behaviour Models run at coarse scales have trouble capturing local phenomenon, as thus may have different spatial correlation from monitor data A Complete Spatial Downscaler 4 / 30 Data sources Our motivating example is a study of summer temperatures in the US. Climate model output: a 30-year Hindcast model of monthly average temperature in summer (Jun-Aug) from 1981 to 2010 Monitor data: monthly average temperature in summer from 1950 to 1999 (interpolated to the same grid points as model output) Our goal is not to match data sources for individual months, but rather to match their distributions. A Complete Spatial Downscaler 5 / 30 Monitor observations (top) and model output (bottom) means 35 30 25 20 15 10 35 30 25 20 15 10 A Complete Spatial Downscaler 6 / 30 Monitor observations (top) and model output (bottom) standard deviations 4 3 2 1 0 4 3 2 1 0 A Complete Spatial Downscaler 7 / 30 Variograms for monitor observations and model outputs 20 15 10 5 Semi−variogram 25 Monitor observations Model outputs 0 500 1000 1500 2000 Distance (km) A Complete Spatial Downscaler 8 / 30 Complete spatial downscaler We propose a Bayesian approach to adjust for biases of the marginal distribution as well as the spatial correlation of computer model outputs Our approach has the following three components: (1) Model the spatially varying and potentially non-Gaussian marginal distributions of the model outputs and monitor data (2) Model the spatial covariance of both data sources (3) Perform calibration to rectify differences between the two data sources These three components are implemented simultaneously using Bayesian hierarchical modeling A Complete Spatial Downscaler 9 / 30 Marginal distributions Xt (s): computer model output Yt (s): monitor observations Let Fs (x) and Gs (y ) be the CDF of Xt (s) and Yt (s), respectively We can transform both processes to standard normal distribution: ZXt (s) = Φ−1 {Fs [Xt (s)]} and ZYt (s) = Φ−1 {Gs [Yt (s)]} Similarly, Xt (s) = Fs−1 {Φ[ZXt (s)]} and Yt (s) = Gs−1 {Φ[ZYt (s)]} A Complete Spatial Downscaler 10 / 30 Assuming normality When both processes and are Gaussian, Xt (s) = µX (s)+σX (s)ZXt (s) and Yt (s) = µY (s)+σY (s)ZYt (s) where µX (s) and σX (s) > 0 are spatially-varying mean and standard deviation of Xt (s) µY (s) and σY (s) > 0 are spatially-varying mean and standard deviation of Yt (s) A Complete Spatial Downscaler 11 / 30 Normal Q-Q plots for monitor observations ( −82.5, 32.5) 23 24 25 26 27 28 30 29 (−111.5, 31.5) 23 24 25 26 27 28 29 24 28 30 (−103.5, 46.5) 10 14 14 18 18 22 (−120.5, 40.5) 26 10 12 14 16 18 20 14 16 18 20 22 24 A Complete Spatial Downscaler 12 / 30 Normal Q-Q plots for model outputs ( −82.5, 32.5) 22 22 26 26 30 30 (−111.5, 31.5) 22 24 26 28 30 22 26 28 30 (−103.5, 46.5) 10 16 14 20 18 24 22 28 (−120.5, 40.5) 24 10 12 14 16 18 20 22 24 16 18 20 22 24 26 28 A Complete Spatial Downscaler 13 / 30 Skew-t model The Gaussian model fails to fit the tails For a richer class of marginal distributions we also consider the spatial skew-t process (Jones and Faddy 2003; Azzalini and Capitanio 2003): Xt (s) = µX (s) + ξX |rXt | + σXt σX (s)ZXt (s) indep (1) iid 2 ), σ 2 ∼ InvGamma(ν /2, ν /2) where rXt ∼ N(0, σXt X X Xt In this setting the marginal distribution of Xt (s) follow skew-t distribution with parameters {µX (s), σX (s), νX , ξX } A skew-t model also permits asymptotic spatial dependence A Complete Spatial Downscaler 14 / 30 Skew t Q-Q plots for monitor observations ( −82.5, 32.5) 23 24 25 26 27 28 30 29 (−111.5, 31.5) 24 25 26 27 28 29 24 25 26 27 28 29 30 (−103.5, 46.5) 10 14 14 18 18 22 (−120.5, 40.5) 12 14 16 18 20 14 16 18 20 22 24 A Complete Spatial Downscaler 15 / 30 Skew t Q-Q plots for model outputs ( −82.5, 32.5) 22 22 26 26 30 30 (−111.5, 31.5) 22 24 26 28 30 22 24 28 30 (−103.5, 46.5) 10 16 14 20 18 24 22 28 (−120.5, 40.5) 26 10 12 14 16 18 20 22 24 16 18 20 22 24 26 28 A Complete Spatial Downscaler 16 / 30 Spatial correlation Assume ZXt (s) and ZYt (s) have mean zero and variance one Expand the two processes using Karhunen-Loéve decomposition: ZXt (s) = L X φl (s)xtl and l=1 ZYt (s) = L X φl (s)ytl (2) l=1 where φl (s) are orthonormal functions xtl and ytl are pairwise independence with mean zero and Var(xtl ) = fl and Var(ytl ) = gl A Complete Spatial Downscaler 17 / 30 Spatial covariance The covariance functions can be written as: Cov[ZXt (si ), ZXt (sj )] = L X φl (si )φl (sj )fl l=1 Cov[ZYt (si ), ZYt (sj )] = L X φl (si )φl (sj )gl l=1 In particular, the basis functions we consider in data analysis are cos(s0 ωl/2 ) l is even φl (s) = sin(s0 ωdl/2e ) l is odd. where ω = (ω1 , . . . , ωn ) is the set of Fourier frequencies A Complete Spatial Downscaler 18 / 30 Matérn spectral density function For the covariances fl and gl we consider the spectral density of Matérn covariance on lattice (Guinness et al. 2014): fl (ω) = fl (ω1 , ω2 ) = 1+ αf 2 δ sin2 σf2 δω1 2 + sin2 δω2 2 νf +1 evaluated over the set of Fourier frequencies ω A Complete Spatial Downscaler 19 / 30 Calibration Conditional on the parameters in the marginal distributions Fs and Gs , the covariances f and g , and the latent xtl and ytl , the standardized computer model output is calibrated as Z̃Xt (s) = L X wl φl (s)xtl l=1 p wl = gl /fl is the ratio of spectral densities which is used to calibrate the spatial correlation of the model output The covariance of Z̃Xt (s) matches with ZYt (s) since Cov[Z̃Xt (si ), Z̃Xt (sj )] = L X wl2 φl (si )φl (sj )fl l=1 = Cov[ZYt (si ), ZYt (sj )] A Complete Spatial Downscaler 20 / 30 Calibration The calibrated computer model output X̃t (s) = Gs−1 {Φ[Z̃Xt (s)]} has Yt (s)0 s marginal distribution and spatial correlation If both data sources follow skew-t distribution, then the calibrated computer model output is X̃t (s) = µY (s) + ξY |rYt | + σYt σY (s) L X wl φl (s)xtl (3) l=1 A Complete Spatial Downscaler 21 / 30 Calibration for skew-t model If skew-t assumption is appropriate, then we can have 2 Xt ∼ N(µX + ξX |rXt |, σXt (ΦF Φ0 + τX2 In )) and 2 Yt ∼ N(µY + ξY |rYt |, σYt (ΦG Φ0 + τY2 In )) 2 ) and σ 2 ∼ InvGamma( νX , νX ). where rXt ∼ N(0, σXt Xt 2 2 Each MCMC iteration gives a sample for all model parameters (µX , σX2 , etc), as well as random effects xtl This gives a posterior predictive distribution of X̃t (s) in (3) A Complete Spatial Downscaler 22 / 30 Monitor observations (top) and calibrated model output (bottom) means 35 30 25 20 15 10 35 30 25 20 15 10 A Complete Spatial Downscaler 23 / 30 Monitor observations (top) and calibrated model output (bottom) standard deviations 4 3 2 1 0 4 3 2 1 0 A Complete Spatial Downscaler 24 / 30 Calibrated quantiles (skew t) (−111.5, 31.5) (−82.5, 32.5) 30 22 22 26 26 30 Monitor observations Model outputs Calibrated model outputs 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 probs probs (−120.5, 40.5) ( −103.5, 46.5) 1.0 15 25 10 15 20 25 0.0 0.0 0.2 0.4 0.6 probs 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 probs A Complete Spatial Downscaler 25 / 30 Calibrated variograms (skew t) 20 15 10 5 Semi−variogram 25 Monitor observations Model outputs Calibrated model outputs 0 500 1000 1500 2000 Distance (km) A Complete Spatial Downscaler 26 / 30 Representative Concentration Pathways (RCPs) The RCPs are greenhouse gas concentration trajectories adopted by the IPCC in 2014 for climate modeling and research Depending on how much greenhouse gases are emitted in the years to come, RCPs can be used to describe different possible future climates We apply our downscaler to summer monthly temperature data from RCP8.5 (2005 - 2034) . A Complete Spatial Downscaler 27 / 30 Future RCP8.5 means before and after calibration 35 30 25 20 15 10 35 30 25 20 15 10 A Complete Spatial Downscaler 28 / 30 Acknowledgment This work was supported by grants DOI (14-1-04-9), NIH (R21ES022795-01A1, 5R01ES014843-02), and NSF (DMS1107046). A Complete Spatial Downscaler 29 / 30 Thank you! For comments or questions, please contact me at yehuang@ncsu.edu A Complete Spatial Downscaler 30 / 30