Example exploration Beijing R.W. Oldford Air quality in Beijing - - PowerPoint PPT Presentation
Example exploration Beijing R.W. Oldford Air quality in Beijing - - PowerPoint PPT Presentation
Example exploration Beijing R.W. Oldford Air quality in Beijing Problem Interest lies in understanding the quality of air in Beijing, China. Air quality in Beijing Problem Interest lies in understanding the quality of air in Beijing, China.
Air quality in Beijing
Problem Interest lies in understanding the quality of air in Beijing, China.
Air quality in Beijing
Problem Interest lies in understanding the quality of air in Beijing, China. A measure of that quality is PM2.5 which determines an Air Quality Index.
Air quality in Beijing
Problem Interest lies in understanding the quality of air in Beijing, China. A measure of that quality is PM2.5 which determines an Air Quality Index. This is a measure of atmospheric particulate matter having diameter less than 2.5 µm (diameter of a human hair is ≈ 50 − 70µm).
Air quality in Beijing
Problem Interest lies in understanding the quality of air in Beijing, China. A measure of that quality is PM2.5 which determines an Air Quality Index. This is a measure of atmospheric particulate matter having diameter less than 2.5 µm (diameter of a human hair is ≈ 50 − 70µm). Plan The plan is to regularly (hourly) collect data on pm2.5 and meteorological variates in Beijing over a sustained time period (a few years).
Air quality in Beijing
Problem Interest lies in understanding the quality of air in Beijing, China. A measure of that quality is PM2.5 which determines an Air Quality Index. This is a measure of atmospheric particulate matter having diameter less than 2.5 µm (diameter of a human hair is ≈ 50 − 70µm). Plan The plan is to regularly (hourly) collect data on pm2.5 and meteorological variates in Beijing over a sustained time period (a few years). Air quality can be measured at the US Embassy in Beijing and meteorological date from the Beijing Capital International Airport.
Air quality in Beijing
Problem Interest lies in understanding the quality of air in Beijing, China. A measure of that quality is PM2.5 which determines an Air Quality Index. This is a measure of atmospheric particulate matter having diameter less than 2.5 µm (diameter of a human hair is ≈ 50 − 70µm). Plan The plan is to regularly (hourly) collect data on pm2.5 and meteorological variates in Beijing over a sustained time period (a few years). Air quality can be measured at the US Embassy in Beijing and meteorological date from the Beijing Capital International Airport. Data Hourly data between January 1, 2010 and December 31, 2014 were collected (available in 2017 from the UCI Machine Learning Repository and on the course website as “BeijingAir.csv”).
Air quality in Beijing
Problem Interest lies in understanding the quality of air in Beijing, China. A measure of that quality is PM2.5 which determines an Air Quality Index. This is a measure of atmospheric particulate matter having diameter less than 2.5 µm (diameter of a human hair is ≈ 50 − 70µm). Plan The plan is to regularly (hourly) collect data on pm2.5 and meteorological variates in Beijing over a sustained time period (a few years). Air quality can be measured at the US Embassy in Beijing and meteorological date from the Beijing Capital International Airport. Data Hourly data between January 1, 2010 and December 31, 2014 were collected (available in 2017 from the UCI Machine Learning Repository and on the course website as “BeijingAir.csv”). A single table of 43,824 rows (each hour) and 12 measurements (+ the row number).
Air quality in Beijing
Problem Interest lies in understanding the quality of air in Beijing, China. A measure of that quality is PM2.5 which determines an Air Quality Index. This is a measure of atmospheric particulate matter having diameter less than 2.5 µm (diameter of a human hair is ≈ 50 − 70µm). Plan The plan is to regularly (hourly) collect data on pm2.5 and meteorological variates in Beijing over a sustained time period (a few years). Air quality can be measured at the US Embassy in Beijing and meteorological date from the Beijing Capital International Airport. Data Hourly data between January 1, 2010 and December 31, 2014 were collected (available in 2017 from the UCI Machine Learning Repository and on the course website as “BeijingAir.csv”). A single table of 43,824 rows (each hour) and 12 measurements (+ the row number).
Questions What is the target population?
Air quality in Beijing
Problem Interest lies in understanding the quality of air in Beijing, China. A measure of that quality is PM2.5 which determines an Air Quality Index. This is a measure of atmospheric particulate matter having diameter less than 2.5 µm (diameter of a human hair is ≈ 50 − 70µm). Plan The plan is to regularly (hourly) collect data on pm2.5 and meteorological variates in Beijing over a sustained time period (a few years). Air quality can be measured at the US Embassy in Beijing and meteorological date from the Beijing Capital International Airport. Data Hourly data between January 1, 2010 and December 31, 2014 were collected (available in 2017 from the UCI Machine Learning Repository and on the course website as “BeijingAir.csv”). A single table of 43,824 rows (each hour) and 12 measurements (+ the row number).
Questions What is the target population? The study population?
Air quality in Beijing
Problem Interest lies in understanding the quality of air in Beijing, China. A measure of that quality is PM2.5 which determines an Air Quality Index. This is a measure of atmospheric particulate matter having diameter less than 2.5 µm (diameter of a human hair is ≈ 50 − 70µm). Plan The plan is to regularly (hourly) collect data on pm2.5 and meteorological variates in Beijing over a sustained time period (a few years). Air quality can be measured at the US Embassy in Beijing and meteorological date from the Beijing Capital International Airport. Data Hourly data between January 1, 2010 and December 31, 2014 were collected (available in 2017 from the UCI Machine Learning Repository and on the course website as “BeijingAir.csv”). A single table of 43,824 rows (each hour) and 12 measurements (+ the row number).
Questions What is the target population? The study population? The sample?
Air quality in Beijing - data
PM2.5 concentration is measured in µg per cubic metre.
Air quality in Beijing - data
PM2.5 concentration is measured in µg per cubic metre. For example, the PM2.5 concentration can be as low as 5 µg/m3 or less on a very clear, non-hazy, day.
Air quality in Beijing - data
PM2.5 concentration is measured in µg per cubic metre. For example, the PM2.5 concentration can be as low as 5 µg/m3 or less on a very clear, non-hazy, day. It also matters how long the PM2.5 concentration stays at a particular level.
Air quality in Beijing - data
PM2.5 concentration is measured in µg per cubic metre. For example, the PM2.5 concentration can be as low as 5 µg/m3 or less on a very clear, non-hazy, day. It also matters how long the PM2.5 concentration stays at a particular level. For example, a 24-hour concentration of PM2.5 greater than 35.4 µg/m3 is considered unhealthy.
Air quality in Beijing - data
PM2.5 concentration is measured in µg per cubic metre. For example, the PM2.5 concentration can be as low as 5 µg/m3 or less on a very clear, non-hazy, day. It also matters how long the PM2.5 concentration stays at a particular level. For example, a 24-hour concentration of PM2.5 greater than 35.4 µg/m3 is considered unhealthy. The health effects of various ranges have been described by the US Environmental Protection Agency as follows:
PM2.5 Air Quality Index PM2.5 Health Effects Precautionary Actions 0 to 12.0 Good: 0 to 50 Little to no risk. None.
Air quality in Beijing - data
PM2.5 concentration is measured in µg per cubic metre. For example, the PM2.5 concentration can be as low as 5 µg/m3 or less on a very clear, non-hazy, day. It also matters how long the PM2.5 concentration stays at a particular level. For example, a 24-hour concentration of PM2.5 greater than 35.4 µg/m3 is considered unhealthy. The health effects of various ranges have been described by the US Environmental Protection Agency as follows:
PM2.5 Air Quality Index PM2.5 Health Effects Precautionary Actions 0 to 12.0 Good: 0 to 50 Little to no risk. None. 12.1 to 35.4 Moderate: 51 to 100 Unusually sensitive individuals may expe- rience respiratory symptoms. Unusually sensitive people should consider reducing prolonged or heavy exertion.
Air quality in Beijing - data
PM2.5 concentration is measured in µg per cubic metre. For example, the PM2.5 concentration can be as low as 5 µg/m3 or less on a very clear, non-hazy, day. It also matters how long the PM2.5 concentration stays at a particular level. For example, a 24-hour concentration of PM2.5 greater than 35.4 µg/m3 is considered unhealthy. The health effects of various ranges have been described by the US Environmental Protection Agency as follows:
PM2.5 Air Quality Index PM2.5 Health Effects Precautionary Actions 0 to 12.0 Good: 0 to 50 Little to no risk. None. 12.1 to 35.4 Moderate: 51 to 100 Unusually sensitive individuals may expe- rience respiratory symptoms. Unusually sensitive people should consider reducing prolonged or heavy exertion. 35.5 to 55.4 Unhealthy for Sensitive Groups: 101 to 150 Increasing likelihood of respiratory symp- toms in sensitive individuals, aggravation
- f heart or lung disease and premature
mortality in persons with cardiopulmonary disease and the elderly. People with respiratory or heart disease, the elderly and children should limit pro- longed exertion.
Air quality in Beijing - data
PM2.5 concentration is measured in µg per cubic metre. For example, the PM2.5 concentration can be as low as 5 µg/m3 or less on a very clear, non-hazy, day. It also matters how long the PM2.5 concentration stays at a particular level. For example, a 24-hour concentration of PM2.5 greater than 35.4 µg/m3 is considered unhealthy. The health effects of various ranges have been described by the US Environmental Protection Agency as follows:
PM2.5 Air Quality Index PM2.5 Health Effects Precautionary Actions 0 to 12.0 Good: 0 to 50 Little to no risk. None. 12.1 to 35.4 Moderate: 51 to 100 Unusually sensitive individuals may expe- rience respiratory symptoms. Unusually sensitive people should consider reducing prolonged or heavy exertion. 35.5 to 55.4 Unhealthy for Sensitive Groups: 101 to 150 Increasing likelihood of respiratory symp- toms in sensitive individuals, aggravation
- f heart or lung disease and premature
mortality in persons with cardiopulmonary disease and the elderly. People with respiratory or heart disease, the elderly and children should limit pro- longed exertion. 55.5 to 150.4 Unhealthy: 151 to 200 Increased aggravation of heart or lung dis- ease and premature mortality in persons with cardiopulmonary disease and the el- derly; increased respiratory effects in gen- eral population. People with respiratory or heart disease, the elderly and children should avoid pro- longed exertion; everyone else should limit prolonged exertion.
Air quality in Beijing - data
PM2.5 concentration is measured in µg per cubic metre. For example, the PM2.5 concentration can be as low as 5 µg/m3 or less on a very clear, non-hazy, day. It also matters how long the PM2.5 concentration stays at a particular level. For example, a 24-hour concentration of PM2.5 greater than 35.4 µg/m3 is considered unhealthy. The health effects of various ranges have been described by the US Environmental Protection Agency as follows:
PM2.5 Air Quality Index PM2.5 Health Effects Precautionary Actions 0 to 12.0 Good: 0 to 50 Little to no risk. None. 12.1 to 35.4 Moderate: 51 to 100 Unusually sensitive individuals may expe- rience respiratory symptoms. Unusually sensitive people should consider reducing prolonged or heavy exertion. 35.5 to 55.4 Unhealthy for Sensitive Groups: 101 to 150 Increasing likelihood of respiratory symp- toms in sensitive individuals, aggravation
- f heart or lung disease and premature
mortality in persons with cardiopulmonary disease and the elderly. People with respiratory or heart disease, the elderly and children should limit pro- longed exertion. 55.5 to 150.4 Unhealthy: 151 to 200 Increased aggravation of heart or lung dis- ease and premature mortality in persons with cardiopulmonary disease and the el- derly; increased respiratory effects in gen- eral population. People with respiratory or heart disease, the elderly and children should avoid pro- longed exertion; everyone else should limit prolonged exertion. 150.5 to 250.4 Very Unhealthy: 201 to 300 Significant aggravation of heart or lung disease and premature mortality in persons with cardiopulmonary disease and the el- derly; significant increase in respiratory ef- fects in general population. People with respiratory or heart disease, the elderly and children should avoid any
- utdoor activity;
everyone else should avoid prolonged exertion.
Air quality in Beijing - data
PM2.5 concentration is measured in µg per cubic metre. For example, the PM2.5 concentration can be as low as 5 µg/m3 or less on a very clear, non-hazy, day. It also matters how long the PM2.5 concentration stays at a particular level. For example, a 24-hour concentration of PM2.5 greater than 35.4 µg/m3 is considered unhealthy. The health effects of various ranges have been described by the US Environmental Protection Agency as follows:
PM2.5 Air Quality Index PM2.5 Health Effects Precautionary Actions 0 to 12.0 Good: 0 to 50 Little to no risk. None. 12.1 to 35.4 Moderate: 51 to 100 Unusually sensitive individuals may expe- rience respiratory symptoms. Unusually sensitive people should consider reducing prolonged or heavy exertion. 35.5 to 55.4 Unhealthy for Sensitive Groups: 101 to 150 Increasing likelihood of respiratory symp- toms in sensitive individuals, aggravation
- f heart or lung disease and premature
mortality in persons with cardiopulmonary disease and the elderly. People with respiratory or heart disease, the elderly and children should limit pro- longed exertion. 55.5 to 150.4 Unhealthy: 151 to 200 Increased aggravation of heart or lung dis- ease and premature mortality in persons with cardiopulmonary disease and the el- derly; increased respiratory effects in gen- eral population. People with respiratory or heart disease, the elderly and children should avoid pro- longed exertion; everyone else should limit prolonged exertion. 150.5 to 250.4 Very Unhealthy: 201 to 300 Significant aggravation of heart or lung disease and premature mortality in persons with cardiopulmonary disease and the el- derly; significant increase in respiratory ef- fects in general population. People with respiratory or heart disease, the elderly and children should avoid any
- utdoor activity;
everyone else should avoid prolonged exertion. 250.5 to 500.4 Hazardous: 301 to 500 Serious aggravation of heart or lung dis- ease and premature mortality in persons with cardiopulmonary disease and the el- derly; serious risk of respiratory effects in general population. Everyone should avoid any outdoor exer- tion; people with respiratory or heart dis- ease, the elderly and children should re- main indoors. Source: US EPA, taken from https://blissair.com/what-is-pm-2-5.htm
Air quality in Beijing - the data
Time variates (for each row) are
◮ year year of the data in that row ◮ month month of the data in that row ◮ day day of the data in that row ◮ hour hour of the data in that row
Air quality in Beijing - the data
Time variates (for each row) are
◮ year year of the data in that row ◮ month month of the data in that row ◮ day day of the data in that row ◮ hour hour of the data in that row
These variates define the primary key in the table beijing %>% count(year, month, day, hour) %>% filter(n > 1) %>% nrow()== 0 ## [1] TRUE
Air quality in Beijing - the data
Time variates (for each row) are
◮ year year of the data in that row ◮ month month of the data in that row ◮ day day of the data in that row ◮ hour hour of the data in that row
These variates define the primary key in the table beijing %>% count(year, month, day, hour) %>% filter(n > 1) %>% nrow()== 0 ## [1] TRUE Although the authors add a column called No indicating the row number as their primary key to the table.
Air quality in Beijing - the data
Meterological variates (for each row) are
◮ DEWP The dew point in degrees Celsius
Air quality in Beijing - the data
Meterological variates (for each row) are
◮ DEWP The dew point in degrees Celsius This is the atmospheric temperature below
which water droplets begin to condense and dew can form.
Air quality in Beijing - the data
Meterological variates (for each row) are
◮ DEWP The dew point in degrees Celsius This is the atmospheric temperature below
which water droplets begin to condense and dew can form.
◮ TEMP Air temperature in degrees Celsius
Air quality in Beijing - the data
Meterological variates (for each row) are
◮ DEWP The dew point in degrees Celsius This is the atmospheric temperature below
which water droplets begin to condense and dew can form.
◮ TEMP Air temperature in degrees Celsius ◮ PRES Atmospheric pressure in hPA (hecto-pascals).
Air quality in Beijing - the data
Meterological variates (for each row) are
◮ DEWP The dew point in degrees Celsius This is the atmospheric temperature below
which water droplets begin to condense and dew can form.
◮ TEMP Air temperature in degrees Celsius ◮ PRES Atmospheric pressure in hPA (hecto-pascals). ◮ cbwd The combined wind direction.
Air quality in Beijing - the data
Meterological variates (for each row) are
◮ DEWP The dew point in degrees Celsius This is the atmospheric temperature below
which water droplets begin to condense and dew can form.
◮ TEMP Air temperature in degrees Celsius ◮ PRES Atmospheric pressure in hPA (hecto-pascals). ◮ cbwd The combined wind direction. According to the authors:
“The weather data had 16 wind directions. Our study shows that the directions can be grouped into five broad categories: northwest (NW), which includes W, WNW, NW, NNW and N; northeast (NE), for NNE, NE and ENE; southeast (SE), covering E, ESE, SE, SSE and S; southwest (SW), having SSW, SW and WSW; and calm and variable (CV). The decision to allocate E to SE and W to NW was based on the locations of major polluting industries around Beijing.” Source: “Assessing Beijing’s PM2.5 pollution: severity, weather impact, APEC and winter heating”
Air quality in Beijing - the data
Meterological variates (for each row) are
◮ DEWP The dew point in degrees Celsius This is the atmospheric temperature below
which water droplets begin to condense and dew can form.
◮ TEMP Air temperature in degrees Celsius ◮ PRES Atmospheric pressure in hPA (hecto-pascals). ◮ cbwd The combined wind direction. According to the authors:
“The weather data had 16 wind directions. Our study shows that the directions can be grouped into five broad categories: northwest (NW), which includes W, WNW, NW, NNW and N; northeast (NE), for NNE, NE and ENE; southeast (SE), covering E, ESE, SE, SSE and S; southwest (SW), having SSW, SW and WSW; and calm and variable (CV). The decision to allocate E to SE and W to NW was based on the locations of major polluting industries around Beijing.” Source: “Assessing Beijing’s PM2.5 pollution: severity, weather impact, APEC and winter heating” In the data set we have summary(as.factor(beijing$cbwd)) ## cv NE NW SE ## 9387 4997 14150 15290
Air quality in Beijing - the data
Meterological variates (for each row) are
◮ DEWP The dew point in degrees Celsius This is the atmospheric temperature below
which water droplets begin to condense and dew can form.
◮ TEMP Air temperature in degrees Celsius ◮ PRES Atmospheric pressure in hPA (hecto-pascals). ◮ cbwd The combined wind direction. According to the authors:
“The weather data had 16 wind directions. Our study shows that the directions can be grouped into five broad categories: northwest (NW), which includes W, WNW, NW, NNW and N; northeast (NE), for NNE, NE and ENE; southeast (SE), covering E, ESE, SE, SSE and S; southwest (SW), having SSW, SW and WSW; and calm and variable (CV). The decision to allocate E to SE and W to NW was based on the locations of major polluting industries around Beijing.” Source: “Assessing Beijing’s PM2.5 pollution: severity, weather impact, APEC and winter heating” In the data set we have summary(as.factor(beijing$cbwd)) ## cv NE NW SE ## 9387 4997 14150 15290 Only 4 different values (of which only 3 are directions).
Air quality in Beijing - the data
Meterological variates (continued):
◮ Iws This is a measure of the accumulated wind speed (in summed m/s) in
the identified sustained direction (presumably by cbwd).
Air quality in Beijing - the data
Meterological variates (continued):
◮ Iws This is a measure of the accumulated wind speed (in summed m/s) in
the identified sustained direction (presumably by cbwd). According to the authors, this measurement “takes into account wind velocity in a particular direction. It reflects the fact that it is sustained wind from a fixed direction that reduces or increases the pollution. CWP is the cumulated wind speed from the start of the wind direction to the time of
- interest. When the wind direction changes, the CWP under a new
direction starts to cumulate again. For calm wind, we use 0.445 m/s as the unit of cumulation.” Source: “Assessing Beijing’s PM2.5 pollution: severity, weather impact, APEC and winter heating”
Air quality in Beijing - the data
Meterological variates (continued):
◮ Iws This is a measure of the accumulated wind speed (in summed m/s) in
the identified sustained direction (presumably by cbwd). According to the authors, this measurement “takes into account wind velocity in a particular direction. It reflects the fact that it is sustained wind from a fixed direction that reduces or increases the pollution. CWP is the cumulated wind speed from the start of the wind direction to the time of
- interest. When the wind direction changes, the CWP under a new
direction starts to cumulate again. For calm wind, we use 0.445 m/s as the unit of cumulation.” Source: “Assessing Beijing’s PM2.5 pollution: severity, weather impact, APEC and winter heating” This accumulates as long as the wind direction is the same, that is over previous hours as well. (See analysis.)
Air quality in Beijing - the data
Meterological variates (continued):
◮ Ir This is a measure of the accumulated number of hours of rain. For any
hour, this is the number of hours (including the present one) over which it has been raining. (See analysis.)
Air quality in Beijing - the data
Meterological variates (continued):
◮ Ir This is a measure of the accumulated number of hours of rain. For any
hour, this is the number of hours (including the present one) over which it has been raining. (See analysis.)
◮ Is This is a measure of the accumulated number of hours of snow. For any
hour, this is the number of hours (including the present one) over which it has been snowing. (See analysis.)
Air quality in Beijing - the data
Meterological variates (continued):
◮ Ir This is a measure of the accumulated number of hours of rain. For any
hour, this is the number of hours (including the present one) over which it has been raining. (See analysis.)
◮ Is This is a measure of the accumulated number of hours of snow. For any
hour, this is the number of hours (including the present one) over which it has been snowing. (See analysis.)
Air quality in Beijing
Analysis What does the pm2.5 look like?
Air quality in Beijing
Analysis What does the pm2.5 look like?
Off the charts! Hazardous Very Unhealthy Unhealthy Unhealthy for Sensitive Groups Moderate Good
0.0000 0.0025 0.0050 0.0075 0.0100 250 500 750 1000
pm2.5 density PM2.5 measure of air quality
Beijing, China: 2010 − 2014
Air quality in Beijing
Analysis What does the pm2.5 look like?
Off the charts! Hazardous Very Unhealthy Unhealthy Unhealthy for Sensitive Groups Moderate Good
0.0000 0.0025 0.0050 0.0075 0.0100 250 500 750 1000
pm2.5 density PM2.5 measure of air quality
Beijing, China: 2010 − 2014 PM2.5 has been 30 − 239 in Beijing over the past 48 hours (March 20, 2019).
Air quality in Beijing
Analysis What does the pm2.5 look like?
Off the charts! Hazardous Very Unhealthy Unhealthy Unhealthy for Sensitive Groups Moderate Good
0.0000 0.0025 0.0050 0.0075 0.0100 250 500 750 1000
pm2.5 density PM2.5 measure of air quality
Beijing, China: 2010 − 2014 PM2.5 has been 30 − 239 in Beijing over the past 48 hours (March 20, 2019). From China’s aqicn.org: Kitchener 5 − 63
Air quality in Beijing
Analysis What does the pm2.5 look like?
Off the charts! Hazardous Very Unhealthy Unhealthy Unhealthy for Sensitive Groups Moderate Good
0.0000 0.0025 0.0050 0.0075 0.0100 250 500 750 1000