Regression: Guided Project
Hello My question is why are they using a 50/50 split for the the training data and not 70/30.
Regression: Guided Project
Hi @rm2966:
In this case the dataset is considered small <10,000
entries, thus a 50/50 split is used so that there will be sufficient data to test on . If there are >10,000
, usually a 70/30 or 80/20 split is used since there is sufficient entries for testing