Basic recipe for machine learning: High bias? High variance?

Data Science
Author

Benjamin DK Luong

Published

September 15, 2019

How to check for High Bias, High Variance:

  • High bias (underfit) means the model doesnt get trained enough
  • High variance (overfit) means the model is overtrained
Case 1 Case 2 Case 3 Case 4
Train error 1% 15% 15% 0.5%
Test error 11% 16% 30% 1%
High variance: The model memorizes the training set. It performs badly on the test set High bias: The model doesnt lean enough High variance and High bias: The model doesnt learn enough, and performs badly on the test set Low variance and Low bias: This is where we want the model to be

Treatments

  • First, we check for high bias (underfit). If YES (high train error), we make bigger network, train more layers, etc, then retrain the model. If NO, we have low train error, we move to the next step.
  • Next, we check for high variance (overfit). If YES (high test error), we add more data, dropout, regulations, etc, then retrain the model. If NO, we are done.