Volume 5, Issue 6
Application
Free Access

Bayesian hierarchical models for spatially misaligned data in R

Andrew O. Finley

Corresponding Author

Department of Forestry, Michigan State University, 126 Natural Resources Building, East Lansing, MI, 48824‐1222 USA

Correspondence author. E‐mail: finleya@msu.eduSearch for more papers by this author
Sudipto Banerjee

Division of Biostatistics, School of Public Health, University of Minnesota, A460 Mayo Building, MMC 303, 420 Delaware Street S.E., Minneapolis, MN, 55455 USA

Search for more papers by this author
Bruce D. Cook

Biospheric Sciences Laboratory, National Aeronautics and Space Administration, Goddard Space Flight Center, Code 618, Greenbelt, MD, 20771 USA

Search for more papers by this author
First published: 10 May 2014
Citations: 8

Summary

  1. Spatial misalignment occurs when at least one of multiple outcome variables is missing at an observed location. For spatial data, prediction of these missing observations should be informed by within location association among outcomes and by proximate locations where measurements were recorded.
  2. This study details and illustrates a Bayesian regression framework for modelling spatially misaligned multivariate data. Particular attention is paid to developing valid probability models capable of estimating parameter posterior distributions and propagating uncertainty through to outcomes' predictive distributions at locations where some or all of the outcomes are not observed.
  3. Models and associated software are presented for both Gaussian and non‐Gaussian outcomes. Model parameter and predictive inference within the proposed framework is illustrated using a synthetic and forest inventory data set.
  4. The proposed Markov chain Monte carlo samplers were written in c++ and leverage R's Foreign Language Interface to call fortran blas (Basic Linear Algebra Subprograms) and lapack (Linear Algebra Package) libraries for efficient matrix computations. The models are implemented in the spMisalignLM and spMisalignGLM functions within the spBayes r package available via the Comprehensive R Archive Network (cran) (http://cran.r-project.org).

Introduction

Investment in long‐term monitoring networks and advancement in sensor technologies are creating data‐rich environments that provide extraordinary opportunities to understand the complexity of large and spatially indexed ecological data. Building such understanding often requires the analysis of spatially indexed data sets with multiple variables measured at each location. In such settings, it is commonly posited that there is association between the measurements at a given location as well as association among measurements across locations. In ecological analysis, we often seek inference about the association among these multiple variables or wish to predict their values at new locations. For example, consider the analysis of (i) species co‐occurrence where species' presence/absence or abundance is recorded at each location, for example Ovaskainen, Hottola & Siitonen (2010); (ii) soil nutrient impact on local tree growth and competition where soil nutrient measurements coincide with tree inventory locations, for example Baribault, Kobe & Finley (2012); or (iii) relationship between multiple environmental stressors and measures of focal species fitness, for example Swope & Parker (2012). In each case, development of a statistical model typically requires the full set of outcomes, for example species presence/absence, and covariates, for example soil nutrients or environmental stressors, at a set of locations.

Given such multivariate settings, it is common that different subsets of the outcome variables, or covariates, are available at different locations. In the statistical literature, this situation is referred to as spatial misalignment. Following the examples above, say observers record the presence/absence of different subsets of species at different locations, or for a subset of locations, only some of the soil nutrients or plant stressors were measured – perhaps due to different sampling protocol or if data were drawn from different data bases. In such cases, it is necessary to somehow impute or predict the value of the missing observations. Note, if there is misalignment among the covariates, then we might view them as outcomes in a model used to predict their missing observations. Regardless of where the misalignment occurs, these predictions should be informed using the within location association among variables and from proximate locations where measurements were recorded. Further, it is common to seek prediction for the entire set of outcomes at new locations where no measurements were recorded. In both cases, an assessment of prediction uncertainty is also often desired.

Here, we consider point‐point misalignment to distinguish it from point‐areal which refers to the situation where some variables may be referenced by their points, while others may have been aggregated over spatial regions. Although the term point‐areal misalignment is used in the literature, following Gotway & Young (2002), we prefer to classify this as a change‐of‐support problem. See also Mugglin, Carlin & Gelfand (2000), Gelfand, Zhu & Carlin (2001), Zhu, Carlin & Gelfand (2003) and the references therein for methods for change‐of‐support problems.

A salient feature of our data is that every location generates, at most, only one replicate of the multiple outcomes. For empirical estimation of the association among these outcomes using sampling‐based multivariate analysis methods, one must consider the observations at different spatial locations as independent replicates. This will, however, preclude estimation of the spatial associations. Under this setting, inference on associations should deploy fully model‐based approaches using the flexibility of spatial stochastic processes.

Existing model‐based methods for handling spatial point‐point misalignment primarily aim to align disparate variables by accounting for additional uncertainty when kriging or other smoothing methods are used to align the spatially referenced data (Madsen, Ruppert & Altman 2008; Buonaccorsi 2009; Gryparis et al. 2009; Paciorek et al. 2009; Szpiro et al. 2011; Lopiano, Young & Gotway 2011 2013). These approaches build conditional regression‐like models where the marginal distribution of the first outcome is specified, followed by the conditional distribution of the second outcome given the first and so on. This approach is easily interpretable and ensures the legality of the resulting joint distributions from the process realizations. However, the approach is more suitable when the number of outcomes is small and there is a natural ‘ordering' that would suggest the sequence for constructing the conditional distributions.

Settings such as ours lack such information on ordering, so joint modelling of the outcomes is preferable to avoid the explosion in models emerging from alternate ordering schemes. Joint models attempt to directly construct cross‐covariance functions that describe the covariances between different outcomes at two, possibly different, locations. A model‐based Bayesian approach for point‐point misalignment was presented in Banerjee & Gelfand (2002). More recently, joint modelling of point patterns and misaligned covariates are considered in Illian, Sorbye & Rue (2012), while Ren & Banerjee (2013) considered modelling spatial misalignment using a class of spatial latent factor models.

While the problem of spatial misalignment is ubiquitous, software to implement model‐based analysis of such data is absent. Our current work focuses upon point‐point misalignment and extends and integrates some of the aforementioned methodological work into a Bayesian hierarchical modelling framework. In addition, we demonstrate how this is implemented in our spBayes package for the r statistical programming language and environment.

Multivariate spatial regression with misalignment

Let S1,S2,…,Sm denote sets comprising n1,n2,…,nm locations where m outcomes have been observed. We collect all observations for the first outcome into an n1 × 1 column vector y1, those for the second outcome into an n2 × 1 column vector y2, and so on until we collect observations corresponding to the m‐th outcome into an nm × 1 column vector ym. Each of these is stacked into an N × 1 column vector y where urn:x-wiley:2041210X:media:mee312189:mee312189-math-0001. The covariates corresponding to the i‐th outcome yi are collected into an ni × pi matrix Xi, and we let βi denote the pi × 1 regression slope vector associated with Xi.

The other key ingredient in the multivariate spatial regression model is the vector of unobserved spatial random effects. For any location s, indexed by some coordinate frame, we have a spatial random effect wi(s) associated with the i‐th outcome yi(s) for i = 1,2,…,m. We collect the random effects corresponding to the i‐th outcome into an ni × 1 vector wi so that it corresponds to yi. The multivariate spatial linear regression model is given by
urn:x-wiley:2041210X:media:mee312189:mee312189-math-0002(eqn 1)
where εi is an ni × 1 column of zero‐centred residual random errors corresponding to the i‐th outcome such that the covariance between an element in εi and an element in εj is zero whenever i and j correspond to different outcomes. Two elements within εi represent random errors associated with the i‐th outcome measured at two different locations. The covariances between any two such elements and the variances of each element in εi are placed as off‐diagonal and diagonal entries in an ni × ni matrix Ψi, which is the variance–covariance matrix of εi. Each of the εi's is assumed to be normally distributed, independent of the others, with mean zero and variance–covariance matrix Ψi.

Model eqn 1 can be extended to accommodate non‐Gaussian outcomes such as (i) binary data modelled using logit or probit regression, and (ii) count data modelled using Poisson regression. Diggle, Tawn & Moyeed (1998) unify the use of generalized linear models in spatial data contexts. See also Lin et al. (2000), Kamman & Wand (2003), and Banerjee, Carlin & Gelfand (2004). Essentially we replace model eqn 1 with the assumption that E[yi(s)] is linear on a transformed scale, that is, g(E[yi(s)]) = xi(s)βi + wi(s), where g(·) is a suitable link function and xi(s) is the pi × 1 vector that includes outcome‐ and location‐specific covariates.

Spatial association is captured by the spatial effects, that is, the wi's in eqn 1. Any two entries in wi correspond to the spatial random effects for outcome i from two different locations. These are assumed to be associated or correlated based upon a function of the separation or distance between the two locations. The essence of multivariate spatial modelling is to prescribe these covariances in such a way that the joint distribution of the wi's, for i = 1,2,…,m, in eqn 1 is a multivariate normal distribution.

The key modelling ingredient here is a multivariate spatial process, see, for example, Chilés & Delfiner (1999), Cressie & Wikle (2011), and Banerjee, Carlin & Gelfand (2004). In our context, the multivariate spatial process is an infinite collection of m × 1 vectors w(s) indexed by spatial coordinates s residing in two or three dimensional Euclidean space. The spatial random effects arise as a finite subset of this set indexed by the locations where the outcomes have been observed. A spatial process is well‐defined whenever any finite collection of random effects has a legitimate probability distribution. When these distributions always belong to a multivariate normal family, we say that the spatial process is a Gaussian process. In eqn 1, each wi is an ni × 1 vector of spatial random effects collected over the locations where outcome i has been observed.

The covariance among outcomes' spatial random effects provides learning about missing observations. The details on constructing and estimating the covariance among spatial random effects are given in Appendix S1. In brief say we wish to model the covariance between spatial random effects corresponding to two different outcomes at two different locations. That is, for outcomes i and j, and locations sk and sl, we must specify cov{wi(sk),wj(sl)} in a manner that will ensure a legitimate probability distribution for the joint distribution of {wi:i = 1,2,…,m}. This covariance is specified using a spatial cross‐covariance function that is constructed using outcome‐specific spatial correlation functions which include parameters to control the random effects' spatial dependence, for example rate of spatial decay. Given parameter estimates, the cross‐covariance functions provide inference about how outcomes covary in space, after accounting for covariates, and inform prediction.

We adopt the Bayesian paradigm for inference, see, for example, Gelman et al. (2004), and build hierarchical models by modelling the parameters using probability distributions. Inference about the regression slopes, the spatial random effects, and the variances and covariances is based on Markov chain Monte Carlo (MCMC) sampling from posterior distributions.

As noted in Introduction, a primary aim of our analysis is interpolation and prediction. Following terminology used in Banerjee & Gelfand (2002), when we estimate the value of an outcome at a location where some of the other outcomes have been observed, we call it interpolation. When we seek to estimate the value of an outcome at a new location, where none of the outcomes have been observed, we call it prediction. In sampling‐based Bayesian inference, we draw samples from the posterior predictive distributions of the outcome variable at unobserved locations given the observed data. The posterior predictive distribution is in fact the posterior distribution of yi(s0) given y, where s0 is the location we want to interpolate or predict. Additional details can be found in the Appendix S1.

Software implementation

The models described in the preceding section are available in the spBayes (version ≥ 0.3‐8) r package spMisalignLM and spMisalignGLM functions for Gaussian and non‐Gaussian outcomes, respectively. These functions are written in c++ and leverage r's Foreign Language Interface to call fortran blas (Basic Linear Algebra Subprograms, see Blackford et al. 2002) and lapack (Linear Algebra Package, see Anderson et al. 1999) libraries for efficient matrix computations. A heavy reliance on blas and lapack functions allows the software to leverage multiprocessor/core machines via threaded implementations of blas and lapack, for example Intel's Math Kernel Library (MKL; http://software.intel.com/en-us/intel-mkl). Use of MKL, or similar threaded libraries, can dramatically reduce sampler run‐times. For example, the illustrative analyses offered in subsequent sections were conducted using r, and hence spBayes, compiled with MKL on an Intel Ivy Bridge i7 quad‐core processor with hyperthreading. The use of these parallel matrix operations results in a near linear speedup in the MCMC sampler's run‐time with the number of CPUs. In addition to Appendix S1, Finley, Banerjee & Gelfand (2013) provide specifics on efficient implementation of the multivariate Gaussian process parameter estimation.

Illustrative analyses

Synthetic data

We consider a synthetic data set comprising three outcome variables observed over unique and common locations within a unit square domain. The analysis of these data demonstrates how the strength of correlation between outcomes' spatial random effects and range of spatial dependence influences the accuracy and precision of prediction and interpolation. The r code to reproduce this and subsequent analyses is available in Finley, Banerjee & Cook (2014).

Following model eqn 1 and using the true parameter values given in the first column of Table 1, we generated outcomes at all locations in Fig. 1(a). These outcomes are shown in Fig. 1(b–d). Outcome observations were then subsampled to create misalignment following the design in Fig. 1(a). Here, each circle contains those locations where the given outcome – identified by the circles' number – is observed. Regions where the circles overlap identify those locations where multiple outcomes were observed.

image
(a) Locations of observed and unobserved outcome variables. Data associated with each outcome are observed within its respective circle, indicated by numbers 1, 2 and 3. Intersecting regions contain locations where two or more outcomes are observed. Surfaces for outcomes 1, 2 and 3 are given in (b), (c) and (d), respectively.
Table 1. Parameter values used to generated the synthetic data in the column labelled True along with spMisalignLM estimated parameter posterior distribution 50 (2·5, 97·5) percentiles. The β·,0 correspond to outcomes' regression intercepts, ρ is the cross‐correlation between the outcomes' spatial random effects, ϕ is the spatial cross‐correlation decay parameter, and Ψ is the non‐spatial residual variances associated with each outcome. Subscripts indicate the associated outcome variable
True Estimate
β1,0 1 1·08 (0·67, 1·57)
β2,0 5 4·96 (3·33, 6·39)
β3,0 10 10·98 (9·19, 12·91)
ρ1,2 −0·8 −0·86 (−0·97, −0·66)
ρ1,3 0·9 0·87 (0·63, 0·97)
ρ2,3 −0·4 −0·62 (−0·87, −0·17)
ϕ1 6 7·83 (4·39, 14·21)
ϕ2 6 12·23 (4·72, 29·47)
ϕ3 6 12·97 (5·34, 25·21)
Ψ1 0·01 0·005 (0·002, 0·012)
Ψ2 0·01 0·004 (0·001, 0·021)
Ψ3 0·01 0·005 (0·002, 0·027)

The true spatial cross‐covariances used to generate the data can be converted to cross‐correlations to facilitate interpretation. These correlations are provided in Table 1 and also displayed in their respective regions of overlap in Fig. 1(a).

Given spatially misaligned data, the spMisalignLM function called in the r code below generates posterior samples from the parameters of the posited model. This function takes each outcome's symbolic regression model and locations where data are observed. Additionally, parameter starting values, prior distributions, MCMC Metropolis algorithm proposal distribution variances, spatial correlation function and number of desired MCMC samples are also passed to the spMisalignLM function. A full explanation of argument syntax and output is available in the function's manual available via cran.image

The resulting MCMC samples were summarized using functions in the coda package and displayed in Table 1. Here, we can see that parameters' estimated 95% credible intervals include the true parameter values. As we will see in the subsequent data analysis, Penobscot Experimental Forest LiDAR and biomass data, the parameter estimates associated with the spatial random effects' cross‐correlations can be used to explore hypotheses about association after accounting for the impact of covariates. Also, one can look to the spatial cross‐correlation decay parameters to make inference about the geographical range of dependence among observations.

Given the spMisalignLM object m.miss and spatial coordinates with associated covariates, one can interpolate and predict using the spPredict function. In the code below, spPredict is used to generate posterior predictive samples for all three outcomes at all locations in Fig. 1(a).image

Figures 2 and 3 show the median and dispersion of the resulting posterior predictive distributions. The interpolated and predicted outcomes shown in Fig. 2(a–c) closely approximate the observed data Fig. 1(b–d). We summarize the prediction uncertainty using the width between the lower and upper 95% posterior predictive credible intervals; given in Fig. 3(a), (b) and (c) for outcomes 1, 2 and 3, respectively. These surfaces show that stronger cross‐correlation between outcomes result in more precise interpolation. For example, the spatial random effects associated with outcome 1 are strongly correlated with those of outcomes 2 and 3, that is, estimated cross‐correlation of ∼ 0·86 and ∼ −0·87, respectively. As a result, Fig. 3(a) shows greater precision in interpolation of outcomes 2 and 3 when outcome 1 is observed (notice the lighter colours in circles 2 and 3). In contrast, when the cross‐correlation is weak, there is less information available to inform interpolation. For example, a cross‐correlation of ∼ −0·62 between outcomes 2 and 3 results in only marginal narrowing of the interpolation precision in either Fig. 3(b) or (c).

image
Misalignment model posterior predictive distribution median surfaces for outcomes 1, 2 and 3 in (a), (b) and (c), respectively.
image
Misalignment model posterior predictive distribution uncertainty surfaces for outcomes 1, 2 and 3 in (a), (b) and (c), respectively.

We are in a prediction setting when none of the outcomes are observed at a given location. In Fig. 1(a), prediction occurs for all locations outside of the three circles. In the absence of covariates, prediction is only informed by proximate observed locations. The stronger the spatial dependence, the more information for prediction is gleaned from observed locations. For example, the spatial decay point estimate for outcome 1 is ∼7·83, which corresponds to an effective spatial range ∼0·38 domain distance units (where we define effective spatial range as the distance at which the spatial correlation drops to 0·05). The result of this relatively long spatial range is that predictions made just outside of circle 1 show more precise posterior predictive intervals (notice the halo around the circle).

To assess the usefulness of estimating the covariance among the outcomes' spatial random effects for interpolation, we compare the misalignment model results to predictions generated by outcome‐specific univariate spatial regressions. The univariate models are equivalent to model eqn 1 but assume there is no covariance among the outcomes' random effects. These univariate models can be fit using the spLM function in spBayes. Summaries of prediction performance are given in Table 2 and show the misalignment model improves prediction accuracy and precision for each outcome, as reflected by lower RMSE and narrower 95% credible intervals compared to those of the univariate model.

Table 2. Univariate spatial regression model prediction and misalignment model interpolation performance. Performance metrics are (i) root mean squared error (RMSE) between the observed and predicted or interpolated outcomes; (ii) mean width between the lower and upper 95% posterior predictive distribution credible intervals (CI width); and (iii) the percentage of observations covered by their respective 95% credible interval (CI cover)
Univariate outcome Misalignment outcome
1 2 3 1 2 3
RMSE 0·34 1·60 1·86 0·22 1·05 1·12
CI width 1·79 6·83 6·79 0·98 4·22 5·30
CI cover 100·00 94·59 93·69 96·52 93·69 98·20

Penobscot Experimental Forest LiDAR and biomass data

This illustrative analysis considers data from a ∼60‐ha area within the US Forest Service Penobscot Experimental Forest (PEF; http://www.fs.fed.us/ne/durham/4155/penobsco.htm), ME, USA. The PEF has been studied extensively beginning in the 1950s and is under active forest management as part of several long‐term silvicultural experiments. A variety of forest variables are recorded on over 600 permanent georeferenced sample plots across the PEF. Light Detection and Ranging (LiDAR) data from the National Aeronautics and Space Administration (NASA) airborne Laser Vegetation Imaging Sensor (LVIS; http://lvis.gsfc.nasa.gov), and LiDAR, hyperspectral and thermal (G‐LiHT; Cook et al. 2013) sensors are also available for the PEF.

The objectives of this illustrative analysis are to produce predictive maps, with associated uncertainty, of (i) forest canopy height metrics from sparsely sampled LiDAR, for example, G‐LiHT, and (ii) forest variables measured at forest sample plots. For brevity, we consider only a subset of the available PEF data. The location and extent of these data are show in Fig. 4(a) and include:

image
Penobscot Experimental Forest LiDAR and sample plot data extent and locations (a). Misalignment model posterior distribution median (black point symbol) and 95% credible intervals for Gp95 and BIO in (b) and (c), respectively.
  • forest canopy height 25th and 95th percentiles, labelled Lp25 and Lp95, respectively, measured in 2003 using LVIS at a 25‐m‐diameter footprint across the extent of the study area;
  • forest canopy height 95th percentile, labelled Gp95, measured in 2012 using the G‐LiHT sensor at a 25‐m‐diameter footprint along a single transect across the study area;
  • metric tons of live above‐ground tree biomass per ha, BIO, estimated at each of the 87 permanent sample plots between 2008 and 2012.

Here, we are interested in predicting both Gp95 and BIO at a fine spatial resolution across the study area. We expect a positive relationship between Gp95, which is a proxy for canopy height, and BIO. Further, although the forest structure has changed since 2003 due to timber harvesting, the complete coverage LVIS Lp25 and Lp95 variables might explain some variability in the more current G‐LiHT Gp95, and therefore, we use these metrics as covariates in the subsequent regression. This model is specified in the code below, along with parameter starting values, prior distributions, MCMC algorithm specifics and the spatial correlation function. Although not shown, variogram analysis of univariate non‐spatial model residuals and other exploratory data analysis tools can help guide choice of prior distributions and associated hyperparameters for the spatial and non‐spatial covariances. Again, a full explanation of argument syntax and output is available in the function's manual available via cran.image

The resulting MCMC samples were summarized using functions in the coda package and displayed in Table 3. Here, we see the LVIS Lp25 covariate explains a substantial portion of variability in G‐LiHT Gp95, that is, the 95% credible intervals of the βGp95,Lp25 do not include zero. Given timber harvesting activity in the study area over the 9 years between the LiDAR measurements, the lack of relationship between the sensors' 95th canopy height percentiles is not too surprising. The long effective spatial ranges estimated for Gp95 and BIO suggest there is substantial spatial structure among the residuals. The effective spatial ranges are calculated using the cross‐covariance and spatial correlation functions' parameter estimates, see Finley, Banerjee & Cook (2014) and Gelfand et al. 2004, p. 292). Further, Gp95's and BIO's spatial random effects are moderately correlated ρGp95,BIO≈0·36. Estimating this cross‐correlation is useful for exploring hypotheses about strength and direction of association among the outcomes' residual spatial structure, perhaps after accounting for some covariates. In this analysis, we could say there is a positive and significant, that is, credible intervals do not include zero, correlation between the residual spatial structure of Gp95 and BIO.

Table 3. Estimated parameter posterior distribution 50 (2·5, 97·5) percentiles for Penobscot Experimental Forest misalignment model. The β·,0 correspond to outcomes' regression intercepts, ρGp95,BIO is the cross‐correlations between the outcomes' spatial random effects, and Ψ is the non‐spatial residual variances associated with each outcome. Estimates for ϕGp95 and ϕBIO have been transformed to their respective effective spatial range in km
Estimate
βGp95,0 16·18 (12·45, 19·98)
βGp95,Lp25 0·45 (0·2, 0·68)
βGp95,Lp95 −0·04 (−0·13, 0·05)
βBio,0 9·83 (8·95, 10·53)
ΨGp95 0·75 (0·28, 0·91)
ΨBIO 0·48 (0·26, 1·09)
ρGp95,BIO 0·36 (0·16, 0·9)
Gp95 eff. range (km) 2·99 (1·3, 4·86)
BIO eff. range (km) 0·49 (0·17, 1·54)

Given the spMisalignLM object m.miss and spatial coordinates with associated covariates, one can interpolate and predict using the spPredict function. In the supplemental analysis code (Finley, Banerjee & Cook 2014), spPredict is used to generate posterior predictive samples for Gp95 and BIO at all 2261 locations where Lp25 and Lp95 were observed. Surfaces of the resulting posterior predictive distributions' median and width between the lower and upper 95% credible intervals are given for Gp95 and BIO in Fig. 5. The posterior predictive medians shown in Fig. 5(a) and (b) closely approximate the observed data, see, for example, model fitted versus observed values in Fig. 4(b) and (c). However, more pertinent to this illustration, Fig. 5(c) and (d) shows narrowing of the posterior predictive distribution at and near locations of interpolation for the respective outcome. For example, the narrowing of the posterior predictive distributions for predicted Gp95 at and near observed BIO locations is clearly seen in Fig. 5(c). Similarly, Fig. 5(d) shows the posterior predictive distributions for BIO narrow within and adjacent to the G‐LiHT transect where Gp95 is observed.

image
Penobscot Experimental Forest misalignment model posterior predictive distribution summary surfaces. Posterior median for Gp95 and BIO given in (a) and (b), respectively. Range between the lower and upper 95% credible intervals for Gp95 and BIO given in (c) and (d), respectively.

Discussion and summary

The multivariate model should yield improved predictive inference, over univariate models, in settings where there is moderate‐to‐strong covariance among outcomes' spatial random effects and where the spatial range of dependence is sufficiently long as to allow observations to contribute information across locations.

The development in the section ‘Multivariate spatial regression with misalignment', and subsequent analyses, assumes a constant covariance among outcomes over the domain. This assumption might be reasonable in many settings. However, a more flexible model would pursue a non‐stationary formulation of the cross‐covariance matrix, see, for example, Guhaniyogi et al. (2013). Such non‐stationary cross‐covariance models could improve inference about changing patterns in the strength and direction of the correlation between outcomes at broad spatial scales.

In addition to improving prediction and interpolation in some settings, the multivariate misalignment model could be useful in designing efficient monitoring efforts. For example, if one had an a‐priori estimate of the covariance among outcomes, or could learn about this covariance through an initial sampling effort, then resources could be used for an appropriate level of sampling of outcome subsets. This represents a very active area of work that builds upon a rich literature on sampling designs for spatiotemporal environmental data, see, for example, Mateu & Müller (2013). Further development of the multivariate misalignment model for inference about spatiotemporal processes is a logical next step and would likely find application for exploring complex and dynamic ecological processes.

Acknowledgements

This work was supported by National Science Foundation Grants DMS‐1106609, EF‐1137309, EF‐1241874 and EF‐1253225, as well as NASA Carbon Monitoring System grants.

    Data accessibility

    Data deposited in the Dryad repository: http://datadryad.org/resource/doi:10.5061dryad.3g9s2

        Number of times cited according to CrossRef: 8

        • A Bayesian framework for emergent constraints: case studies of climate sensitivity with PMIP, Climate of the Past, 10.5194/cp-16-1715-2020, 16, 5, (1715-1735), (2020).
        • The recent past and promising future for data integration methods to estimate species’ distributions, Methods in Ecology and Evolution, 10.1111/2041-210X.13110, 10, 1, (22-37), (2019).
        • Spatially-explicit modelling with support of hyperspectral data can improve prediction of plant traits, Remote Sensing of Environment, 10.1016/j.rse.2019.05.019, (111200), (2019).
        • Residential radon and cancer mortality in Galicia, Spain, Science of The Total Environment, 10.1016/j.scitotenv.2017.08.144, 610-611, (1125-1132), (2018).
        • Geostatistical estimation of forest biomass in interior Alaska combining Landsat-derived tree cover, sampled airborne lidar and field observations, Remote Sensing of Environment, 10.1016/j.rse.2018.04.044, 212, (212-230), (2018).
        • Modelling spatio-temporal patterns of disease for spatially misaligned data: An application on measles incidence data in Namibia from 2005-2014, PLOS ONE, 10.1371/journal.pone.0201700, 13, 8, (e0201700), (2018).
        • Compositional analysis of topsoil metals and its associations with cancer mortality using spatial misaligned data, Environmental Geochemistry and Health, 10.1007/s10653-016-9904-3, 40, 1, (283-294), (2017).
        • Modelling spatial patterns of misaligned disease data: An application on measles incidence in Namibia, Clinical Epidemiology and Global Health, 10.1016/j.cegh.2017.01.002, 5, 4, (190-195), (2017).