Regression: Guided Project

Regression: Guided Project


Hello My question is why are they using a 50/50 split for the the training data and not 70/30.

Hi @rm2966:

In this case the dataset is considered small <10,000 entries, thus a 50/50 split is used so that there will be sufficient data to test on . If there are >10,000, usually a 70/30 or 80/20 split is used since there is sufficient entries for testing