1/ 22
Introduction DIMAQ Results Conclusions
Data Integration Model for Air Quality: A Hierarchical Approach to - - PowerPoint PPT Presentation
Introduction DIMAQ Results Conclusions Data Integration Model for Air Quality: A Hierarchical Approach to the Global Estimation of Exposures to Ambient Air Pollution Matthew Thomas Supervised by Prof. Gavin Shaddick In collaboration with
1/ 22
Introduction DIMAQ Results Conclusions
2/ 22
Introduction DIMAQ Results Conclusions
◮ Introduction ◮ DIMAQ ◮ Results ◮ Conclusions
3/ 22
Introduction DIMAQ Results Conclusions
◮ Air pollution has been identified as a global health priority. ◮ In 2016, the World Health Organisation (WHO) estimated that
◮ The Global Burden of Disease (GBD) project estimate that in 2015
◮ Burden of disease calculations require accurate estimates of
4/ 22
Introduction DIMAQ Results Conclusions
◮ Accurate estimates of exposure to air pollution are required
◮ at global, national and local levels ◮ with associated measures of uncertainty.
◮ While networks are expanding, ground monitoring is limited in
10 20 30 40 50 60 70 80 90+
5/ 22
Introduction DIMAQ Results Conclusions
◮ Can utilise information from other
◮ satellite remote sensing ◮ atmospheric models ◮ population estimates ◮ land use ◮ local network characteristics.
◮ Result of modelling and will be
6/ 22
Introduction DIMAQ Results Conclusions
◮ Developed to the Data Integration Model for Air Quality
◮ DIMAQ calibrates ground measurements to estimates
◮ satellite remote sensing, ◮ specific components of chemical transport models ◮ land use ◮ population.
◮ The coefficients in the calibration model are estimated by
◮ Model allows borrowing from higher aggregations and if
◮ Exploits a geographical nested hierarchy. ◮ Achieved using hierarchical random effects.
7/ 22
Introduction DIMAQ Results Conclusions
Asia Pacific, High Income Asia, Central Asia, East Asia, South Asia, Southeast Australasia Caribbean Europe, Central Europe, Eastern Europe, Western Latin America, Andean Latin America, Central Latin America, Southern Latin America, Tropical North Africa / Middle East North America, High Income Oceania Sub−Saharan Africa, Central Sub−Saharan Africa, East Sub−Saharan Africa, Southern Sub−Saharan Africa, West
Asia Pacific, High Income Asia, Central Asia, East Asia, South Asia, Southeast Australasia Caribbean Europe, Central Europe, Eastern Europe, Western Latin America, Andean Latin America, Central Latin America, Southern Latin America, Tropical North Africa / Middle East North America, High Income Oceania Sub−Saharan Africa, Central Sub−Saharan Africa, East Sub−Saharan Africa, Southern Sub−Saharan Africa, West
Figure: Map of regions.
8/ 22
Introduction DIMAQ Results Conclusions
High income North Africa / Middle East South Asia Central Europe, Eastern Europe, Central Asia Latin America and Caribbean Southeast Asia, East Asia and Oceania Sub−Saharan Africa
Figure: Map of super-regions.
9/ 22
Introduction DIMAQ Results Conclusions
◮ Ground measurements at point locations, s, within grid cell, l,
◮ The model consists of a set of fixed and random effects, for both
10/ 22
Introduction DIMAQ Results Conclusions
◮ The random effect terms have contributions from the country,
q,ijk + βR q,jk + βSR q,k ◮ The intercept also having a random effect for the cell
0,lijk + βC 0,ijk + βR 0,jk + βSR 0,k
11/ 22
Introduction DIMAQ Results Conclusions
◮ The coefficients for super-regions are distributed with mean
k
SR) ◮ The coefficients for regions are distributed with mean equal to
jk ∼ N(βSR k , σ2 R,k) ◮ The coefficients for a country is distributed with mean equal to
ijk ∼ N(βR jk, σ2 C,jk)
12/ 22
Introduction DIMAQ Results Conclusions
◮ Approximate Bayesian inference, such as Integrated Nested
◮ INLA performs numerical calculations of posterior densities
◮ Latent Gaussian models allows for sparse matrices, and therefore
13/ 22
Introduction DIMAQ Results Conclusions
◮ R-INLA was used to implement DIMAQ. ◮ Unable to run this model on standard computers (4-8GB RAM). ◮ Required the use of a High-Performance Computing (HPC)
◮ Balena cluster at University of Bath. ◮ 2 × 512GB RAM nodes (32 × 32GB RAM cores).
◮ Took an iterative approach to prediction.
14/ 22
Introduction DIMAQ Results Conclusions
20 40 60 1 2 3 4 5 6 7
Super Region Population Weighted Root Mean Square Error Model
GBD2013 DIMAQ
Figure: Summaries of predictive ability of the GBD2013 model and DIMAQ, for each of seven super–regions: 1, High income; 2, Central Europe, Eastern Europe, Central Asia; 3, Latin America and Caribbean; 4, Southeast Asia, East Asia and Oceania; 5, North Africa / Middle East; 6, Sub-Saharan Africa; 7, South Asia. For each model, population weighted root mean squared errors (µgm−3) are given with dots denoting the median of the distribution from 25 training/evaluation sets and the vertical lines the range of values.
15/ 22
Introduction DIMAQ Results Conclusions
Figure: Median estimates of annual averages of PM2.5 (µgm−3) for 2014 for each grid cell (0.1o × 0.1o resolution) using DIMAQ.
16/ 22
Introduction DIMAQ Results Conclusions
Figure: Half the width of 95% posterior credible intervals for 2014 for each grid cell (0.1o × 0.1o resolution) using DIMAQ.
17/ 22
Introduction DIMAQ Results Conclusions
Figure: Medians of posterior distributions for estimates of annual mean PM2.5 concentrations (µgm−3) for 2014, in China. Figure: Probability of exceeding 35 µgm−3 using a Bayesian hierarchical model for each grid cell (0.1o × 0.1o resolution) for 2014, in China.
18/ 22
Introduction DIMAQ Results Conclusions
500 1000 1500 50 100
µgm−3 Number of grid cells
Figure: Estimated annual average concentrations
Black crosses denote the annual averages recorded at ground monitors.
1 2 50 100
µgm−3 Percentage of total population (%)
Figure: Estimated population level exposures (blue bars) and population weighted measurements from ground monitors (black bars).
19/ 22
Introduction DIMAQ Results Conclusions
◮ DIMAQ integrates data from multiple sources with producing
◮ Estimates used by the WHO and GBD in burden of disease
◮ Future Developments
◮ Higher resolution estimates ◮ Within country variability ◮ Allowing for errors and biases in covariates ◮ Use data at native resolutions
◮ Possible approaches to address these issues
◮ Statistical downscaling ◮ Bayesian melding.
20/ 22
Introduction DIMAQ Results Conclusions
21/ 22
Introduction DIMAQ Results Conclusions
◮ DIMAQ Paper:
◮ WHO Report:
◮ GBD Paper:
◮ Interactive Map:
22/ 22
Introduction DIMAQ Results Conclusions