On the Influence of Input Noise On the Influence of Input Noise on - - PowerPoint PPT Presentation

on the influence of input noise on the influence of input
SMART_READER_LITE
LIVE PREVIEW

On the Influence of Input Noise On the Influence of Input Noise on - - PowerPoint PPT Presentation

IASTED-AIA2004 Feb. 16-18, 2004 On the Influence of Input Noise On the Influence of Input Noise on a Generalization Error Estimator on a Generalization Error Estimator (1,2) Masashi Sugiyama (2) Yuta Okabe (2) Hidemitsu Ogawa (1)


slide-1
SLIDE 1
  • Feb. 16-18, 2004

IASTED-AIA2004

On the Influence of Input Noise

  • n a Generalization Error Estimator

On the Influence of Input Noise

  • n a Generalization Error Estimator

Masashi Sugiyama

(1,2)

Yuta Okabe

(2)

Hidemitsu Ogawa

(2) (1) Fraunhofer FIRST-IDA, Berlin, Germany (2) Tokyo Institute of Technology, Tokyo, Japan

slide-2
SLIDE 2

2

From , obtain a good approximation to

Regression Problem Regression Problem

L L

:Underlying function :Learned function :Training examples (noise)

slide-3
SLIDE 3

3

Typical Method of Learning Typical Method of Learning

Kernel regression model Ridge estimation

:Parameters to be learned :Kernel function (e.g., Gaussian) :Ridge parameter (model parameter)

slide-4
SLIDE 4

4

Model Selection Model Selection

is too small is appropriate is too large Underlying function Learned function

Choice of the model is crucial for obtaining good learned function !

slide-5
SLIDE 5

5

Generalization Error Generalization Error

Determine the model so that an estimator of the unknown generalization error is minimized.

For model selection, we need a criterion that measures ‘closeness’ between and : Generalization error

slide-6
SLIDE 6

6

Noise in Input Points Noise in Input Points

Previous research mainly deals with the cases where noise is included only in output values. However, noise is sometimes included also in input points, e.g.,

Input points are measured: Signal/image recognition, robot motor control, and bioinformatic data analysis. Input points are estimated: Time series prediction of multiple-step ahead.

slide-7
SLIDE 7

7

Noise in Input Points (cont.) Noise in Input Points (cont.)

We want to measure output values at But measurement is actually done at unknown Output noise is then added

Input noise Output noise

slide-8
SLIDE 8

8

Aim of Our Research Aim of Our Research

So far, it seems that model selection in the presence of input noise has not been well studied yet. We investigate the effect of input noise on a generalization error estimator called the subspace information criterion (SIC).

Sugiyama & Ogawa (Neural Computation, 2001) Sugiyama & Müller (JMLR, 2002)

slide-9
SLIDE 9

9

  • : A reproducing kernel Hilbert space

We assume We shall measure the generalization error by

Generalization Error in RKHS Generalization Error in RKHS

:Norm :Expectation over output noise

slide-10
SLIDE 10

10

Setting Setting

Kernel regression model Linear estimation

:Parameters to be learned :Kernel function (e.g., Gaussian) :Learning matrix

slide-11
SLIDE 11

11

Subspace Information Criterion Subspace Information Criterion

In the absence of input noise, SIC is an unbiased estimator of : We investigate how the unbiasedness

  • f SIC is affected by input noise.

:Pseudo inverse of :Inner product

Sugiyama & Ogawa (Neural Computation, 2001) Sugiyama & Müller (JMLR, 2002)

slide-12
SLIDE 12

12

Unbiasedness of SIC in the Presence of Input Noise Unbiasedness of SIC in the Presence of Input Noise

In the presence of input noise,

:Noiseless input points :Noisy input points

Unbiasedness of SIC does not generally hold in the presence of input noise.

slide-13
SLIDE 13

13

Effect of Small Input Noise Effect of Small Input Noise

When is continuous, small input noise varies the output value only slightly, i.e., is small. Therefore, we expect that the unbiasedness of SIC is not severely affected ( is small) by small input noise.

:Noiseless input points :Noisy input points

slide-14
SLIDE 14

14

Effect of Small Input Noise (cont.) Effect of Small Input Noise (cont.)

However, we can show that, for some learning matrix , it holds that as for all . This implies that, for some , the unbiasedness of SIC is heavily affected even when input noise is very small.

:Input noise

slide-15
SLIDE 15

15

Theorem Theorem

Let be the matrix norm defined by If the learning matrix satisfies then as for all .

:Noiseless input points :Noisy input points

slide-16
SLIDE 16

16

Ridge Estimation Ridge Estimation

Ridge estimation We can prove that ridge estimation satisfies Therefore, SIC with ridge estimation is robust against small input noise.

:Ridge parameter :Identity matrix

slide-17
SLIDE 17

17

  • :Gaussian RKHS

Learning target function : sinc function Training examples :

  • Ridge estimation is used for learning.

Simulation Simulation

slide-18
SLIDE 18

18

Result (No Input Noise) Result (No Input Noise)

:Ridge parameter

SIC is surely unbiased without input noise

slide-19
SLIDE 19

19

Result (Small Input Noise) Result (Small Input Noise)

:Ridge parameter

SIC is still almost unbiased with small input noise

slide-20
SLIDE 20

20

Result (Large Input Noise) Result (Large Input Noise)

:Ridge parameter

SIC is no longer reliable with large input noise

slide-21
SLIDE 21

21

Conclusions Conclusions

Effect of input noise on SIC. In some cases, the unbiasedness of SIC is heavily affected even by small input noise. A sufficient condition for unbiasedness. Ridge estimation satisfies this condition. Experiments: SIC is still almost unbiased for small input noise. Future work: Accurately estimate the generalization error when input noise is large.