Diagnosing ML System
Shih-Yang Su Virginia Tech
ECE-5424G / CS-5824 Spring 2019
Diagnosing ML System Shih-Yang Su Virginia Tech ECE-5424G / - - PowerPoint PPT Presentation
Diagnosing ML System Shih-Yang Su Virginia Tech ECE-5424G / CS-5824 Spring 2019 Today's Lectures Advice on how getting learning algorithms to different applications How to fix your learning algorithm Basically ZERO MATH Debugging a
Shih-Yang Su Virginia Tech
ECE-5424G / CS-5824 Spring 2019
Source: Andrew Ng
Source: Andrew Ng
Source: Andrew Ng
Source: reddit?
Source: reddit?
Price ($) Size (ft) Price ($) Size (ft) Price ($) Size (ft) Source: Andrew Ng
Price ($) Size (ft) Price ($) Size (ft) Price ($) Size (ft) Underfit Overfit Just right Source: Andrew Ng
Price ($) Size (ft) Price ($) Size (ft) Price ($) Size (ft) Underfit Overfit Just right
Source: Andrew Ng
Procedure: Step 1. Train on training set Step 2. Evaluate validation error Step 3. Pick the best model based on Step 2. Step 4. Evaluate the test error
Price ($) Size (ft) Price ($) Size (ft) Price ($) Size (ft) Underfit Overfit Just right Source: Andrew Ng
Price ($) Size (ft) Price ($) Size (ft) Price ($) Size (ft) Underfit High bias Overfit High Variance Just right Source: Andrew Ng
Price ($) Size (ft) Price ($) Size (ft) Price ($) Size (ft) Underfit High bias Too simple Overfit High Variance Too Complex Just right Source: Andrew Ng
Price ($) Size (ft) Price ($) Size (ft) Price ($) Size (ft) Underfit High bias Too simple Too much regularization Overfit High Variance Too Complex Too little regularization Just right Source: Andrew Ng
Loss Degree of Polynomial Source: Andrew Ng
Loss Degree of Polynomial High bias High Variance
Loss λ Source: Andrew Ng
Loss High bias High Variance Source: Andrew Ng λ
m=1 m=3 m=5 m=2 m=4 m=6
m=1 m=3 m=5 m=2 m=4 m=6
Underfit High bias Overfit High Variance
Underfit High bias
Does adding more data help?
Price ($) Size (ft)
Underfit High bias
Does adding more data help?
Price ($) Size (ft)
Underfit High bias
Does adding more data help?
Price ($) Size (ft)
Does adding more data help?
Price ($) Size (ft) Price ($) Size (ft)
More data doesn't help when your model has high bias
Does adding more data help?
Overfit High Variance Price ($) Size (ft)
Does adding more data help?
Overfit High Variance Price ($) Size (ft)
Does adding more data help? More data is likely to help when your model has high variance
Price ($) Size (ft) Price ($) Size (ft)
Analyze your model before you act