Big Data

Overfitting and how to avoid it

What is overfitting?

In machine learning you’re usually trying to predict outcomes for values that you’ve never seen before based on training data that you have seen and know about. 

So overfitting is basically when your model is trained so specific on the training dataset that predictions are bad for data that the model has never seen before.