Biased Resampling Strategies for Imbalanced Spatio-temporal Forecasting
M A R I AN A O L IV EI R A¹ ⋅ ² , N U N O M O N I Z ¹ ⋅ ² , L U Í S T O R G O ¹ ⋅ ² ⋅ ³ A N D V Í T O R S A N T O S C O S TA ¹ ⋅ ² 5 - 8 O C T O B E R 2 0 1 9
1 2 3
Biased Resampling Strategies for Imbalanced Spatio-temporal - - PowerPoint PPT Presentation
Biased Resampling Strategies for Imbalanced Spatio-temporal Forecasting M A R I AN A O L IV EI R A , N U N O M O N I Z , L U S T O R G O A N D V T O R S A N T O S C O S TA 5 - 8 O C T O B
M A R I AN A O L IV EI R A¹ ⋅ ² , N U N O M O N I Z ¹ ⋅ ² , L U Í S T O R G O ¹ ⋅ ² ⋅ ³ A N D V Í T O R S A N T O S C O S TA ¹ ⋅ ² 5 - 8 O C T O B E R 2 0 1 9
1 2 3
Remote monitoring equipment (Source: NDSU) Air quality measurement station network (Source: Zheng et al., 2013) PM 2.5 pollution levels (time series)
2/28
IMBALANCED DOMAIN RELEVANCE FUNCTION
3/28
approaches are often used to tackle this problem
not i.i.d. -- there are spatial and temporal dependencies
Motivation
sampling bias that takes into account spatio-temporal dependencies improve performance?
dimensions differently?
Research Questions
4/28
5/28
Spatio-temporal Random Under-sampling (STRUS)
Spatio-temporal Random Over-sampling (STROS)
6/28
Temp mporal weight
More recent
Spatialweight (extreme cases)
Isolated rare cases
Spatialweight (normal cases)
Far away from rare cases
Which cases should have higher probability of being selected during resampling?
At each time-step
7/28
What if spatial and temporal dimensions havedifferent impacts? Add weighting parameter α 8/28
9/28
Data source ID # time IDs # loc IDs % available % extreme MESA 10 280 20 100 7.3 NCDC 20 105 72 100 6.0 30 6.3 TCE 31 330 26 100 3.8 32 2.4 Rural 40 4k 70 ~49 7.5 50 3.5 Beijing Air 51 11k 36 ~40 5.5 52 8.6 53 3.8
10/28
Calculate spatio- temporal indicators
Feature engineering
Resampling
Forest
Model
PRE-PROCESSING 11/28
Evaluation metrics Performance estimation procedure
12/28
for numeric prediction:
13/28
train test
time y x
14/28
15/28
16/28
INTERNAL ESTIMATION PROCEDURE For each training set: Temporal-block CV PARAMETER GRID SEARCH
Parameter Values u 0.2; 0.4; 0.6; 0.8; 0.95
α 0; 0.25; 0.5; 0.75; 1
time y x
17/28
18/28
For all training sets: Fixed parameters at middle of the grid. FIXED PARAMETERS
Parameter Values u 0.2; 0.4; 0.6; 0.8; 0.95
α 0; 0.25; 0.5; 0.75; 1 19/28
20/28
EXTERNAL ESTIMATION PROCEDURE For each data set: Choose parameters with best results on the external (prequential) procedure. PARAMETER GRID SEARCH
Parameter Values u 0.2; 0.4; 0.6; 0.8; 0.95
α 0; 0.25; 0.5; 0.75; 1 21/28
22/28
Parametrization None ROS STROS RUS STRUS Internal tuning 4.60 3.07 2.37 2.67 2.30 Fixed a priori 4.53 2.77 2.73 2.57 2.40 Optimal a posteriori 5.00 3.07 2.27 2.93 1.73
23/28
TWO PARAMETERS DIMENSION WEIGHTING
RUS/ROS RUS/ROS RUS/ROS
24/28
PRECISION RECALL
RUS/ROS RUS/ROS RUS/ROS RUS/ROS RUS/ROS RUS/ROS
25/28
26/28
cases that are spatially distant from extreme cases
27/28
Code available at https://github.com/mrfoliveira/STResampling-DSAA2019
28/28