Overfitting Can Happen Overfitting Can Happen Overfitting Can - - PowerPoint PPT Presentation

overfitting can happen overfitting can happen overfitting
SMART_READER_LITE
LIVE PREVIEW

Overfitting Can Happen Overfitting Can Happen Overfitting Can - - PowerPoint PPT Presentation

Overfitting Can Happen Overfitting Can Happen Overfitting Can Happen Overfitting Can Happen Overfitting Can Happen 30 25 test 20 error 15 (boosting stumps on heart-disease dataset) train 10 5 0 1 10 100 1000 # rounds


slide-1
SLIDE 1

Overfitting Can Happen Overfitting Can Happen Overfitting Can Happen Overfitting Can Happen Overfitting Can Happen

5 10 15 20 25 30 1 10 100 1000

test train error # rounds

(boosting “stumps” on heart-disease dataset)

slide-2
SLIDE 2

Actual Typical Run Actual Typical Run Actual Typical Run Actual Typical Run Actual Typical Run

10 100 1000 5 10 15 20

error test train ) T # of rounds ( (boosting C4.5 on “letter” dataset)

  • test error does not increase, even after 1000 rounds
  • (total size > 2,000,000 nodes)
  • test error continues to drop even after training error is zero!

# rounds 5 100 1000 train error 0.0 0.0 0.0 test error 8.4 3.3 3.1

  • Occam’s razor wrongly predicts “simpler” rule is better
slide-3
SLIDE 3

Empirical Evidence: The Margin Distribution Empirical Evidence: The Margin Distribution Empirical Evidence: The Margin Distribution Empirical Evidence: The Margin Distribution Empirical Evidence: The Margin Distribution

  • margin distribution

= cumulative distribution of margins of training examples

10 100 1000 5 10 15 20

error test train ) T # of rounds (

  • 1
  • 0.5

0.5 1 0.5 1.0

cumulative distribution 1000 100 margin 5

# rounds 5 100 1000 train error 0.0 0.0 0.0 test error 8.4 3.3 3.1 % margins ≤ 0.5 7.7 0.0 0.0 minimum margin 0.14 0.52 0.55