First International Workshop on Learning over Multiple Contexts LMCE - PowerPoint PPT Presentation

C. Ferri, J. Hernández-Orallo, A. Martínez-Usó and M.J. Ramírez-Quintana DSIC, UPV, DSIC, UPV, València València, Spain , Spain {cferri, jorallo, admarus, mramirez}@dsic.upv.es First International Workshop on Learning over Multiple Contexts LMCE 2014 Nancy, 19 September 2014

 Motivation  The Noise setting  Context Plots and Dominance  Experiments  Conclusions and Future Work First International Workshop on Learning over Multiple Contexts, LMCE 2014 2

Very often, operating contexts (OC) at the • training and the deployment time are different. training idealistic “perfect” Q features Operating Context “noisy” less-idealistic deployment The error of the model depends on the level of • noise introduced by the OC. First International Workshop on Learning over Multiple Contexts, LMCE 2014 3

• Alarm system where two models (A and B) have been trained (ideal conds., OC=temperature in [0,30]) • Validation: A>B (A is better!) • Deployment • How these OC affect to sensors is also known • OC in deployment are given • Which model is better for each OC? …. For several OC, model B could be better now! First International Workshop on Learning over Multiple Contexts, LMCE 2014 4

In order to answer this question we propose:  To evaluate the models with different levels of simulate noise.  To draw a context plot with all models, and to determine dominance regions.  During deployment, the noise level is derived from the OC and the best model for that noise is applied. First International Workshop on Learning over Multiple Contexts, LMCE 2014 #

Noise is calculated by using probability distributions:  Numerical attributes  we estimate the σ of all values of the attribute  for a level of noise ν , we modify a value x using a normal distribution x’ ~ N(x, σ . ν )  Nominal attribute  we estimate the frecuency of each value v i , p=(p v1 ,…,p vn )  for an instance x with value v i we estimate the vector t=(t 1 ,…,t n ) t i =1 and t j =0 if i ≠ j  for a level of noise ν we calculate p’= α . p + (1- α ) . t where α =1 - e (- ν )  we use p’ to sample the new value x’ First International Workshop on Learning over Multiple Contexts, LMCE 2014 #

Vehicle Vehicle First International Workshop on Learning over Multiple Contexts, LMCE 2014 7

Methodology: Methodology: 12 datasets from the UCI repository 12 datasets from the UCI repository • 50% Train, 25% Validation, 25 % Test 50% Train, 25% Validation, 25 % Test • Classification: Classification: • J48, Naive Bayes, Logistic Regression and J48, Naive Bayes, Logistic Regression and kNN kNN. . • Reference method: majority class. Reference method: majority class. • Classification error Classification error • Regression: Regression: • Linear Regression,M5P, Linear Regression,M5P, kNN kNN , , SMOreg SMOreg. . • Reference method: Reference method: ZeroR ZeroR. . • Relative absolute error Relative absolute error • First International Workshop on Learning over Multiple Contexts, LMCE 2014 8

Creditg Creditg First International Workshop on Learning over Multiple Contexts, LMCE 2014 9

Abalone Abalone First International Workshop on Learning over Multiple Contexts, LMCE 2014 10

Methods: Methods: ValNoNoise ValNoNoise: For all the estimated values of noise, we select • the method that obtains the best performance without noise. ValBestArea ValBestArea: For all the estimated values of noise, we select • the method that obtains the best performance in the validation dataset by averaging all noise levels (i.e., the curve with lowest area in the context plot). ValNoiseOpt: For each of the estimated values of noise, we ValNoiseOpt • select the method with best performance in the validation dataset with that value of noise. Idealistic Idealistic: For each of the estimated values of noise, we select • the model with best performance in the test dataset with that value of noise. This strategy is not realistic (it cannot be done in practice). We just include this as a reference First International Workshop on Learning over Multiple Contexts, LMCE 2014 11

The performance results are The performance results are normalised normalised by the Idealistic by the Idealistic • performance performance (idealistic/method)*100 (idealistic/method)*100 First International Workshop on Learning over Multiple Contexts, LMCE 2014 12

1. 1. In this paper we have analysed the case when: In this paper we have analysed the case when:  the noise level depends on a context the noise level depends on a context  we know the context in advance we know the context in advance 2. 2. The model that best behaves for each noise level situation is used. The model that best behaves for each noise level situation is used. 3. 3. CONS: CONS: It It takes takes some some time. time. PROS: PROS: Selection/application Selection/application of of the the best best model is straightforward with results close to an idealistic process. model is straightforward with results close to an idealistic process. 4. 4. As As a a future future work work we we plan plan working working on on different different noise noise models, models, derived derived from scenarios with real operating conditions and with real sensors. from scenarios with real operating conditions and with real sensors. • Each Each attribute attribute will will have have a a different different operating operating range, range, but but the the context would still be given by a single parameter (e.g., , temperature, or the number of measurements performed..). temperature, or the number of measurements performed..). First International Workshop on Learning over Multiple Contexts, LMCE 2014 13

Thanks for your atention….. First International Workshop on Learning over Multiple Contexts, LMCE 2014 #

First International Workshop on Learning over Multiple Contexts LMCE - PowerPoint PPT Presentation

C. Ferri, J. Hernndez-Orallo, A. Martnez-Us and M.J. Ramrez-Quintana DSIC, UPV, DSIC, UPV, Valncia Valncia, Spain , Spain {cferri, jorallo, admarus, mramirez}@dsic.upv.es First International Workshop on Learning over Multiple

Multiple Decrement Models Lecture: Weeks 8-9 Lecture: Weeks 8-9 (STT 456) Multiple Decrement

Multiple Decrement Models Lecture: Weeks 8-9 Lecture: Weeks 8-9 (STT 456) Multiple Decrement

Multiple Sequence Multiple Sequence Alignments Alignments Multiple alignment Pairwise

Single Single- -Thread NVE Thread NVE Multiple Subsystems, Multiple Threads Multiple

Multiple Access Readings: Kurose & Ross, 5.3, 5.5 Multiple Access Multiple hosts sharing

Multiple Antenna Secret Broadcast over Multiple Antenna Secret Broadcast over Wireless Networks

The Learning Tree Workshop: The Learning Tree Workshop: Experience-based Learning Series on

Multiple Input and Output Channels Multiple Input and Output Channels Multiple Input Channels In

Multiple Stressors Multiple Uses Multiple Rules Solution: Synchronicity Positive

Multiple Regression and Logistic Regression I Dajiang Liu @PHS 525 Apr-14-2016 Multiple

Multiple Programs How do programs communicate? 1 Multiple Programs How do programs communicate?

Surviving the First Night Surviving the First Night Surviving the First Night Surviving

International Standards & International International Standards & International

Multiple Kernel Learning and Feature Space Denoising Fei Yan, Josef Kittler and Krystian

Composing multiple StarPU applications Composing multiple StarPU applications over heterogeneous

Army Abbreviations Abbreviation Rank Descripiton 1LT FIRST LIEUTENANT 1SG FIRST SERGEANT 1ST

Menu Concerns about the quality of the predictive distributions Augmentation: a bit more

Compressive Extreme Learning Machines Improved Models Through Exploiting Time-Accuracy Trade-offs

How to Take into Account the Discrete Parameters in the BIC Criterion? V. Vandewalle University

Frequentist Statistics DS GA 1002 Probability and Statistics for Data Science

Event Calendar SHIMA Daio,

Voyaging around nacre with the x-ray shuttle from biomineralisation to prosthetics via mollusc

Introduction to Gaussian Processes Iain Murray School of Informatics, University of Edinburgh

Information Theory and Statistical Inference Samuel Cheng School of ECE University of Oklahoma

First International Workshop on Learning over Multiple Contexts LMCE - PowerPoint PPT Presentation

C. Ferri, J. Hernndez-Orallo, A. Martnez-Us and M.J. Ramrez-Quintana DSIC, UPV, DSIC, UPV, Valncia Valncia, Spain , Spain {cferri, jorallo, admarus, mramirez}@dsic.upv.es First International Workshop on Learning over Multiple

Multiple Decrement Models Lecture: Weeks 8-9 Lecture: Weeks 8-9 (STT 456) Multiple Decrement

Multiple Decrement Models Lecture: Weeks 8-9 Lecture: Weeks 8-9 (STT 456) Multiple Decrement

Multiple Sequence Multiple Sequence Alignments Alignments Multiple alignment Pairwise

Single Single- -Thread NVE Thread NVE Multiple Subsystems, Multiple Threads Multiple

Multiple Access Readings: Kurose &amp; Ross, 5.3, 5.5 Multiple Access Multiple hosts sharing

Multiple Antenna Secret Broadcast over Multiple Antenna Secret Broadcast over Wireless Networks

The Learning Tree Workshop: The Learning Tree Workshop: Experience-based Learning Series on

Multiple Input and Output Channels Multiple Input and Output Channels Multiple Input Channels In

Multiple Stressors Multiple Uses Multiple Rules Solution: Synchronicity Positive

Multiple Regression and Logistic Regression I Dajiang Liu @PHS 525 Apr-14-2016 Multiple

Multiple Programs How do programs communicate? 1 Multiple Programs How do programs communicate?

Surviving the First Night Surviving the First Night Surviving the First Night Surviving

International Standards &amp; International International Standards &amp; International

Multiple Kernel Learning and Feature Space Denoising Fei Yan, Josef Kittler and Krystian

Composing multiple StarPU applications Composing multiple StarPU applications over heterogeneous

Army Abbreviations Abbreviation Rank Descripiton 1LT FIRST LIEUTENANT 1SG FIRST SERGEANT 1ST

Menu Concerns about the quality of the predictive distributions Augmentation: a bit more

Compressive Extreme Learning Machines Improved Models Through Exploiting Time-Accuracy Trade-offs

How to Take into Account the Discrete Parameters in the BIC Criterion? V. Vandewalle University

Frequentist Statistics DS GA 1002 Probability and Statistics for Data Science

Event Calendar SHIMA Daio,

Voyaging around nacre with the x-ray shuttle from biomineralisation to prosthetics via mollusc

Introduction to Gaussian Processes Iain Murray School of Informatics, University of Edinburgh

Information Theory and Statistical Inference Samuel Cheng School of ECE University of Oklahoma

Multiple Access Readings: Kurose & Ross, 5.3, 5.5 Multiple Access Multiple hosts sharing

International Standards & International International Standards & International