After two days of struggle with this project I am ready to give up. I’ve come a long way, but I really do not understand these extreme values. Like how does anyone get an rmse of 10^15 big haha.
I hope there will be a great lesson here, and I hope there is someone there to understand my slightly messy code
Predicting house sale prices.ipynb (762.2 KB)
By the way I found a small mistake in the solutions:
at the end they are doing this:
shuffled_df = df.sample(frac=1, )
train = df[:1460]
test = df[1460:]
It’s like this shuffled_df
is not being used
Click here to view the jupyter notebook file in a new tab