Example exploration Beijing R.W. Oldford Air quality in Beijing - - PowerPoint PPT Presentation

example exploration
SMART_READER_LITE
LIVE PREVIEW

Example exploration Beijing R.W. Oldford Air quality in Beijing - - PowerPoint PPT Presentation

Example exploration Beijing R.W. Oldford Air quality in Beijing Problem Interest lies in understanding the quality of air in Beijing, China. Air quality in Beijing Problem Interest lies in understanding the quality of air in Beijing, China.


slide-1
SLIDE 1

Example exploration

Beijing R.W. Oldford

slide-2
SLIDE 2

Air quality in Beijing

Problem Interest lies in understanding the quality of air in Beijing, China.

slide-3
SLIDE 3

Air quality in Beijing

Problem Interest lies in understanding the quality of air in Beijing, China. A measure of that quality is PM2.5 which determines an Air Quality Index.

slide-4
SLIDE 4

Air quality in Beijing

Problem Interest lies in understanding the quality of air in Beijing, China. A measure of that quality is PM2.5 which determines an Air Quality Index. This is a measure of atmospheric particulate matter having diameter less than 2.5 µm (diameter of a human hair is ≈ 50 − 70µm).

slide-5
SLIDE 5

Air quality in Beijing

Problem Interest lies in understanding the quality of air in Beijing, China. A measure of that quality is PM2.5 which determines an Air Quality Index. This is a measure of atmospheric particulate matter having diameter less than 2.5 µm (diameter of a human hair is ≈ 50 − 70µm). Plan The plan is to regularly (hourly) collect data on pm2.5 and meteorological variates in Beijing over a sustained time period (a few years).

slide-6
SLIDE 6

Air quality in Beijing

Problem Interest lies in understanding the quality of air in Beijing, China. A measure of that quality is PM2.5 which determines an Air Quality Index. This is a measure of atmospheric particulate matter having diameter less than 2.5 µm (diameter of a human hair is ≈ 50 − 70µm). Plan The plan is to regularly (hourly) collect data on pm2.5 and meteorological variates in Beijing over a sustained time period (a few years). Air quality can be measured at the US Embassy in Beijing and meteorological date from the Beijing Capital International Airport.

slide-7
SLIDE 7

Air quality in Beijing

Problem Interest lies in understanding the quality of air in Beijing, China. A measure of that quality is PM2.5 which determines an Air Quality Index. This is a measure of atmospheric particulate matter having diameter less than 2.5 µm (diameter of a human hair is ≈ 50 − 70µm). Plan The plan is to regularly (hourly) collect data on pm2.5 and meteorological variates in Beijing over a sustained time period (a few years). Air quality can be measured at the US Embassy in Beijing and meteorological date from the Beijing Capital International Airport. Data Hourly data between January 1, 2010 and December 31, 2014 were collected (available in 2017 from the UCI Machine Learning Repository and on the course website as “BeijingAir.csv”).

slide-8
SLIDE 8

Air quality in Beijing

Problem Interest lies in understanding the quality of air in Beijing, China. A measure of that quality is PM2.5 which determines an Air Quality Index. This is a measure of atmospheric particulate matter having diameter less than 2.5 µm (diameter of a human hair is ≈ 50 − 70µm). Plan The plan is to regularly (hourly) collect data on pm2.5 and meteorological variates in Beijing over a sustained time period (a few years). Air quality can be measured at the US Embassy in Beijing and meteorological date from the Beijing Capital International Airport. Data Hourly data between January 1, 2010 and December 31, 2014 were collected (available in 2017 from the UCI Machine Learning Repository and on the course website as “BeijingAir.csv”). A single table of 43,824 rows (each hour) and 12 measurements (+ the row number).

slide-9
SLIDE 9

Air quality in Beijing

Problem Interest lies in understanding the quality of air in Beijing, China. A measure of that quality is PM2.5 which determines an Air Quality Index. This is a measure of atmospheric particulate matter having diameter less than 2.5 µm (diameter of a human hair is ≈ 50 − 70µm). Plan The plan is to regularly (hourly) collect data on pm2.5 and meteorological variates in Beijing over a sustained time period (a few years). Air quality can be measured at the US Embassy in Beijing and meteorological date from the Beijing Capital International Airport. Data Hourly data between January 1, 2010 and December 31, 2014 were collected (available in 2017 from the UCI Machine Learning Repository and on the course website as “BeijingAir.csv”). A single table of 43,824 rows (each hour) and 12 measurements (+ the row number).

Questions What is the target population?

slide-10
SLIDE 10

Air quality in Beijing

Problem Interest lies in understanding the quality of air in Beijing, China. A measure of that quality is PM2.5 which determines an Air Quality Index. This is a measure of atmospheric particulate matter having diameter less than 2.5 µm (diameter of a human hair is ≈ 50 − 70µm). Plan The plan is to regularly (hourly) collect data on pm2.5 and meteorological variates in Beijing over a sustained time period (a few years). Air quality can be measured at the US Embassy in Beijing and meteorological date from the Beijing Capital International Airport. Data Hourly data between January 1, 2010 and December 31, 2014 were collected (available in 2017 from the UCI Machine Learning Repository and on the course website as “BeijingAir.csv”). A single table of 43,824 rows (each hour) and 12 measurements (+ the row number).

Questions What is the target population? The study population?

slide-11
SLIDE 11

Air quality in Beijing

Problem Interest lies in understanding the quality of air in Beijing, China. A measure of that quality is PM2.5 which determines an Air Quality Index. This is a measure of atmospheric particulate matter having diameter less than 2.5 µm (diameter of a human hair is ≈ 50 − 70µm). Plan The plan is to regularly (hourly) collect data on pm2.5 and meteorological variates in Beijing over a sustained time period (a few years). Air quality can be measured at the US Embassy in Beijing and meteorological date from the Beijing Capital International Airport. Data Hourly data between January 1, 2010 and December 31, 2014 were collected (available in 2017 from the UCI Machine Learning Repository and on the course website as “BeijingAir.csv”). A single table of 43,824 rows (each hour) and 12 measurements (+ the row number).

Questions What is the target population? The study population? The sample?

slide-12
SLIDE 12

Air quality in Beijing - data

PM2.5 concentration is measured in µg per cubic metre.

slide-13
SLIDE 13

Air quality in Beijing - data

PM2.5 concentration is measured in µg per cubic metre. For example, the PM2.5 concentration can be as low as 5 µg/m3 or less on a very clear, non-hazy, day.

slide-14
SLIDE 14

Air quality in Beijing - data

PM2.5 concentration is measured in µg per cubic metre. For example, the PM2.5 concentration can be as low as 5 µg/m3 or less on a very clear, non-hazy, day. It also matters how long the PM2.5 concentration stays at a particular level.

slide-15
SLIDE 15

Air quality in Beijing - data

PM2.5 concentration is measured in µg per cubic metre. For example, the PM2.5 concentration can be as low as 5 µg/m3 or less on a very clear, non-hazy, day. It also matters how long the PM2.5 concentration stays at a particular level. For example, a 24-hour concentration of PM2.5 greater than 35.4 µg/m3 is considered unhealthy.

slide-16
SLIDE 16

Air quality in Beijing - data

PM2.5 concentration is measured in µg per cubic metre. For example, the PM2.5 concentration can be as low as 5 µg/m3 or less on a very clear, non-hazy, day. It also matters how long the PM2.5 concentration stays at a particular level. For example, a 24-hour concentration of PM2.5 greater than 35.4 µg/m3 is considered unhealthy. The health effects of various ranges have been described by the US Environmental Protection Agency as follows:

PM2.5 Air Quality Index PM2.5 Health Effects Precautionary Actions 0 to 12.0 Good: 0 to 50 Little to no risk. None.

slide-17
SLIDE 17

Air quality in Beijing - data

PM2.5 concentration is measured in µg per cubic metre. For example, the PM2.5 concentration can be as low as 5 µg/m3 or less on a very clear, non-hazy, day. It also matters how long the PM2.5 concentration stays at a particular level. For example, a 24-hour concentration of PM2.5 greater than 35.4 µg/m3 is considered unhealthy. The health effects of various ranges have been described by the US Environmental Protection Agency as follows:

PM2.5 Air Quality Index PM2.5 Health Effects Precautionary Actions 0 to 12.0 Good: 0 to 50 Little to no risk. None. 12.1 to 35.4 Moderate: 51 to 100 Unusually sensitive individuals may expe- rience respiratory symptoms. Unusually sensitive people should consider reducing prolonged or heavy exertion.

slide-18
SLIDE 18

Air quality in Beijing - data

PM2.5 concentration is measured in µg per cubic metre. For example, the PM2.5 concentration can be as low as 5 µg/m3 or less on a very clear, non-hazy, day. It also matters how long the PM2.5 concentration stays at a particular level. For example, a 24-hour concentration of PM2.5 greater than 35.4 µg/m3 is considered unhealthy. The health effects of various ranges have been described by the US Environmental Protection Agency as follows:

PM2.5 Air Quality Index PM2.5 Health Effects Precautionary Actions 0 to 12.0 Good: 0 to 50 Little to no risk. None. 12.1 to 35.4 Moderate: 51 to 100 Unusually sensitive individuals may expe- rience respiratory symptoms. Unusually sensitive people should consider reducing prolonged or heavy exertion. 35.5 to 55.4 Unhealthy for Sensitive Groups: 101 to 150 Increasing likelihood of respiratory symp- toms in sensitive individuals, aggravation

  • f heart or lung disease and premature

mortality in persons with cardiopulmonary disease and the elderly. People with respiratory or heart disease, the elderly and children should limit pro- longed exertion.

slide-19
SLIDE 19

Air quality in Beijing - data

PM2.5 concentration is measured in µg per cubic metre. For example, the PM2.5 concentration can be as low as 5 µg/m3 or less on a very clear, non-hazy, day. It also matters how long the PM2.5 concentration stays at a particular level. For example, a 24-hour concentration of PM2.5 greater than 35.4 µg/m3 is considered unhealthy. The health effects of various ranges have been described by the US Environmental Protection Agency as follows:

PM2.5 Air Quality Index PM2.5 Health Effects Precautionary Actions 0 to 12.0 Good: 0 to 50 Little to no risk. None. 12.1 to 35.4 Moderate: 51 to 100 Unusually sensitive individuals may expe- rience respiratory symptoms. Unusually sensitive people should consider reducing prolonged or heavy exertion. 35.5 to 55.4 Unhealthy for Sensitive Groups: 101 to 150 Increasing likelihood of respiratory symp- toms in sensitive individuals, aggravation

  • f heart or lung disease and premature

mortality in persons with cardiopulmonary disease and the elderly. People with respiratory or heart disease, the elderly and children should limit pro- longed exertion. 55.5 to 150.4 Unhealthy: 151 to 200 Increased aggravation of heart or lung dis- ease and premature mortality in persons with cardiopulmonary disease and the el- derly; increased respiratory effects in gen- eral population. People with respiratory or heart disease, the elderly and children should avoid pro- longed exertion; everyone else should limit prolonged exertion.

slide-20
SLIDE 20

Air quality in Beijing - data

PM2.5 concentration is measured in µg per cubic metre. For example, the PM2.5 concentration can be as low as 5 µg/m3 or less on a very clear, non-hazy, day. It also matters how long the PM2.5 concentration stays at a particular level. For example, a 24-hour concentration of PM2.5 greater than 35.4 µg/m3 is considered unhealthy. The health effects of various ranges have been described by the US Environmental Protection Agency as follows:

PM2.5 Air Quality Index PM2.5 Health Effects Precautionary Actions 0 to 12.0 Good: 0 to 50 Little to no risk. None. 12.1 to 35.4 Moderate: 51 to 100 Unusually sensitive individuals may expe- rience respiratory symptoms. Unusually sensitive people should consider reducing prolonged or heavy exertion. 35.5 to 55.4 Unhealthy for Sensitive Groups: 101 to 150 Increasing likelihood of respiratory symp- toms in sensitive individuals, aggravation

  • f heart or lung disease and premature

mortality in persons with cardiopulmonary disease and the elderly. People with respiratory or heart disease, the elderly and children should limit pro- longed exertion. 55.5 to 150.4 Unhealthy: 151 to 200 Increased aggravation of heart or lung dis- ease and premature mortality in persons with cardiopulmonary disease and the el- derly; increased respiratory effects in gen- eral population. People with respiratory or heart disease, the elderly and children should avoid pro- longed exertion; everyone else should limit prolonged exertion. 150.5 to 250.4 Very Unhealthy: 201 to 300 Significant aggravation of heart or lung disease and premature mortality in persons with cardiopulmonary disease and the el- derly; significant increase in respiratory ef- fects in general population. People with respiratory or heart disease, the elderly and children should avoid any

  • utdoor activity;

everyone else should avoid prolonged exertion.

slide-21
SLIDE 21

Air quality in Beijing - data

PM2.5 concentration is measured in µg per cubic metre. For example, the PM2.5 concentration can be as low as 5 µg/m3 or less on a very clear, non-hazy, day. It also matters how long the PM2.5 concentration stays at a particular level. For example, a 24-hour concentration of PM2.5 greater than 35.4 µg/m3 is considered unhealthy. The health effects of various ranges have been described by the US Environmental Protection Agency as follows:

PM2.5 Air Quality Index PM2.5 Health Effects Precautionary Actions 0 to 12.0 Good: 0 to 50 Little to no risk. None. 12.1 to 35.4 Moderate: 51 to 100 Unusually sensitive individuals may expe- rience respiratory symptoms. Unusually sensitive people should consider reducing prolonged or heavy exertion. 35.5 to 55.4 Unhealthy for Sensitive Groups: 101 to 150 Increasing likelihood of respiratory symp- toms in sensitive individuals, aggravation

  • f heart or lung disease and premature

mortality in persons with cardiopulmonary disease and the elderly. People with respiratory or heart disease, the elderly and children should limit pro- longed exertion. 55.5 to 150.4 Unhealthy: 151 to 200 Increased aggravation of heart or lung dis- ease and premature mortality in persons with cardiopulmonary disease and the el- derly; increased respiratory effects in gen- eral population. People with respiratory or heart disease, the elderly and children should avoid pro- longed exertion; everyone else should limit prolonged exertion. 150.5 to 250.4 Very Unhealthy: 201 to 300 Significant aggravation of heart or lung disease and premature mortality in persons with cardiopulmonary disease and the el- derly; significant increase in respiratory ef- fects in general population. People with respiratory or heart disease, the elderly and children should avoid any

  • utdoor activity;

everyone else should avoid prolonged exertion. 250.5 to 500.4 Hazardous: 301 to 500 Serious aggravation of heart or lung dis- ease and premature mortality in persons with cardiopulmonary disease and the el- derly; serious risk of respiratory effects in general population. Everyone should avoid any outdoor exer- tion; people with respiratory or heart dis- ease, the elderly and children should re- main indoors. Source: US EPA, taken from https://blissair.com/what-is-pm-2-5.htm

slide-22
SLIDE 22

Air quality in Beijing - the data

Time variates (for each row) are

◮ year year of the data in that row ◮ month month of the data in that row ◮ day day of the data in that row ◮ hour hour of the data in that row

slide-23
SLIDE 23

Air quality in Beijing - the data

Time variates (for each row) are

◮ year year of the data in that row ◮ month month of the data in that row ◮ day day of the data in that row ◮ hour hour of the data in that row

These variates define the primary key in the table beijing %>% count(year, month, day, hour) %>% filter(n > 1) %>% nrow()== 0 ## [1] TRUE

slide-24
SLIDE 24

Air quality in Beijing - the data

Time variates (for each row) are

◮ year year of the data in that row ◮ month month of the data in that row ◮ day day of the data in that row ◮ hour hour of the data in that row

These variates define the primary key in the table beijing %>% count(year, month, day, hour) %>% filter(n > 1) %>% nrow()== 0 ## [1] TRUE Although the authors add a column called No indicating the row number as their primary key to the table.

slide-25
SLIDE 25

Air quality in Beijing - the data

Meterological variates (for each row) are

◮ DEWP The dew point in degrees Celsius

slide-26
SLIDE 26

Air quality in Beijing - the data

Meterological variates (for each row) are

◮ DEWP The dew point in degrees Celsius This is the atmospheric temperature below

which water droplets begin to condense and dew can form.

slide-27
SLIDE 27

Air quality in Beijing - the data

Meterological variates (for each row) are

◮ DEWP The dew point in degrees Celsius This is the atmospheric temperature below

which water droplets begin to condense and dew can form.

◮ TEMP Air temperature in degrees Celsius

slide-28
SLIDE 28

Air quality in Beijing - the data

Meterological variates (for each row) are

◮ DEWP The dew point in degrees Celsius This is the atmospheric temperature below

which water droplets begin to condense and dew can form.

◮ TEMP Air temperature in degrees Celsius ◮ PRES Atmospheric pressure in hPA (hecto-pascals).

slide-29
SLIDE 29

Air quality in Beijing - the data

Meterological variates (for each row) are

◮ DEWP The dew point in degrees Celsius This is the atmospheric temperature below

which water droplets begin to condense and dew can form.

◮ TEMP Air temperature in degrees Celsius ◮ PRES Atmospheric pressure in hPA (hecto-pascals). ◮ cbwd The combined wind direction.

slide-30
SLIDE 30

Air quality in Beijing - the data

Meterological variates (for each row) are

◮ DEWP The dew point in degrees Celsius This is the atmospheric temperature below

which water droplets begin to condense and dew can form.

◮ TEMP Air temperature in degrees Celsius ◮ PRES Atmospheric pressure in hPA (hecto-pascals). ◮ cbwd The combined wind direction. According to the authors:

“The weather data had 16 wind directions. Our study shows that the directions can be grouped into five broad categories: northwest (NW), which includes W, WNW, NW, NNW and N; northeast (NE), for NNE, NE and ENE; southeast (SE), covering E, ESE, SE, SSE and S; southwest (SW), having SSW, SW and WSW; and calm and variable (CV). The decision to allocate E to SE and W to NW was based on the locations of major polluting industries around Beijing.” Source: “Assessing Beijing’s PM2.5 pollution: severity, weather impact, APEC and winter heating”

slide-31
SLIDE 31

Air quality in Beijing - the data

Meterological variates (for each row) are

◮ DEWP The dew point in degrees Celsius This is the atmospheric temperature below

which water droplets begin to condense and dew can form.

◮ TEMP Air temperature in degrees Celsius ◮ PRES Atmospheric pressure in hPA (hecto-pascals). ◮ cbwd The combined wind direction. According to the authors:

“The weather data had 16 wind directions. Our study shows that the directions can be grouped into five broad categories: northwest (NW), which includes W, WNW, NW, NNW and N; northeast (NE), for NNE, NE and ENE; southeast (SE), covering E, ESE, SE, SSE and S; southwest (SW), having SSW, SW and WSW; and calm and variable (CV). The decision to allocate E to SE and W to NW was based on the locations of major polluting industries around Beijing.” Source: “Assessing Beijing’s PM2.5 pollution: severity, weather impact, APEC and winter heating” In the data set we have summary(as.factor(beijing$cbwd)) ## cv NE NW SE ## 9387 4997 14150 15290

slide-32
SLIDE 32

Air quality in Beijing - the data

Meterological variates (for each row) are

◮ DEWP The dew point in degrees Celsius This is the atmospheric temperature below

which water droplets begin to condense and dew can form.

◮ TEMP Air temperature in degrees Celsius ◮ PRES Atmospheric pressure in hPA (hecto-pascals). ◮ cbwd The combined wind direction. According to the authors:

“The weather data had 16 wind directions. Our study shows that the directions can be grouped into five broad categories: northwest (NW), which includes W, WNW, NW, NNW and N; northeast (NE), for NNE, NE and ENE; southeast (SE), covering E, ESE, SE, SSE and S; southwest (SW), having SSW, SW and WSW; and calm and variable (CV). The decision to allocate E to SE and W to NW was based on the locations of major polluting industries around Beijing.” Source: “Assessing Beijing’s PM2.5 pollution: severity, weather impact, APEC and winter heating” In the data set we have summary(as.factor(beijing$cbwd)) ## cv NE NW SE ## 9387 4997 14150 15290 Only 4 different values (of which only 3 are directions).

slide-33
SLIDE 33

Air quality in Beijing - the data

Meterological variates (continued):

◮ Iws This is a measure of the accumulated wind speed (in summed m/s) in

the identified sustained direction (presumably by cbwd).

slide-34
SLIDE 34

Air quality in Beijing - the data

Meterological variates (continued):

◮ Iws This is a measure of the accumulated wind speed (in summed m/s) in

the identified sustained direction (presumably by cbwd). According to the authors, this measurement “takes into account wind velocity in a particular direction. It reflects the fact that it is sustained wind from a fixed direction that reduces or increases the pollution. CWP is the cumulated wind speed from the start of the wind direction to the time of

  • interest. When the wind direction changes, the CWP under a new

direction starts to cumulate again. For calm wind, we use 0.445 m/s as the unit of cumulation.” Source: “Assessing Beijing’s PM2.5 pollution: severity, weather impact, APEC and winter heating”

slide-35
SLIDE 35

Air quality in Beijing - the data

Meterological variates (continued):

◮ Iws This is a measure of the accumulated wind speed (in summed m/s) in

the identified sustained direction (presumably by cbwd). According to the authors, this measurement “takes into account wind velocity in a particular direction. It reflects the fact that it is sustained wind from a fixed direction that reduces or increases the pollution. CWP is the cumulated wind speed from the start of the wind direction to the time of

  • interest. When the wind direction changes, the CWP under a new

direction starts to cumulate again. For calm wind, we use 0.445 m/s as the unit of cumulation.” Source: “Assessing Beijing’s PM2.5 pollution: severity, weather impact, APEC and winter heating” This accumulates as long as the wind direction is the same, that is over previous hours as well. (See analysis.)

slide-36
SLIDE 36

Air quality in Beijing - the data

Meterological variates (continued):

◮ Ir This is a measure of the accumulated number of hours of rain. For any

hour, this is the number of hours (including the present one) over which it has been raining. (See analysis.)

slide-37
SLIDE 37

Air quality in Beijing - the data

Meterological variates (continued):

◮ Ir This is a measure of the accumulated number of hours of rain. For any

hour, this is the number of hours (including the present one) over which it has been raining. (See analysis.)

◮ Is This is a measure of the accumulated number of hours of snow. For any

hour, this is the number of hours (including the present one) over which it has been snowing. (See analysis.)

slide-38
SLIDE 38

Air quality in Beijing - the data

Meterological variates (continued):

◮ Ir This is a measure of the accumulated number of hours of rain. For any

hour, this is the number of hours (including the present one) over which it has been raining. (See analysis.)

◮ Is This is a measure of the accumulated number of hours of snow. For any

hour, this is the number of hours (including the present one) over which it has been snowing. (See analysis.)

slide-39
SLIDE 39

Air quality in Beijing

Analysis What does the pm2.5 look like?

slide-40
SLIDE 40

Air quality in Beijing

Analysis What does the pm2.5 look like?

Off the charts! Hazardous Very Unhealthy Unhealthy Unhealthy for Sensitive Groups Moderate Good

0.0000 0.0025 0.0050 0.0075 0.0100 250 500 750 1000

pm2.5 density PM2.5 measure of air quality

Beijing, China: 2010 − 2014

slide-41
SLIDE 41

Air quality in Beijing

Analysis What does the pm2.5 look like?

Off the charts! Hazardous Very Unhealthy Unhealthy Unhealthy for Sensitive Groups Moderate Good

0.0000 0.0025 0.0050 0.0075 0.0100 250 500 750 1000

pm2.5 density PM2.5 measure of air quality

Beijing, China: 2010 − 2014 PM2.5 has been 30 − 239 in Beijing over the past 48 hours (March 20, 2019).

slide-42
SLIDE 42

Air quality in Beijing

Analysis What does the pm2.5 look like?

Off the charts! Hazardous Very Unhealthy Unhealthy Unhealthy for Sensitive Groups Moderate Good

0.0000 0.0025 0.0050 0.0075 0.0100 250 500 750 1000

pm2.5 density PM2.5 measure of air quality

Beijing, China: 2010 − 2014 PM2.5 has been 30 − 239 in Beijing over the past 48 hours (March 20, 2019). From China’s aqicn.org: Kitchener 5 − 63

slide-43
SLIDE 43

Air quality in Beijing

Analysis What does the pm2.5 look like?

Off the charts! Hazardous Very Unhealthy Unhealthy Unhealthy for Sensitive Groups Moderate Good

0.0000 0.0025 0.0050 0.0075 0.0100 250 500 750 1000

pm2.5 density PM2.5 measure of air quality

Beijing, China: 2010 − 2014 PM2.5 has been 30 − 239 in Beijing over the past 48 hours (March 20, 2019). From China’s aqicn.org: Kitchener 5 − 63; Toronto 17 − 97.