I’m working on Star Wars survey overview. In the step of removing NaN in the RespondentID row, I found my data a bit odd. I checked the dataframe before cleaning and compare the result of cleaning by checking it again. I found that the input didn’t change. The RespondentID mentioned the same 1186 non-null value before and after cleaning.
However based on the introduction, the total 835 responses received from survey.
I was using the code below:
> #checking the dataframe info > print(star_wars.info()) > > #removing any rows where RespondentID is NaN > star_wars = star_wars[star_wars['RespondentID'].notnull()] > print('\n') > #checking the dataframe after removal > print(star_wars.info())
This is my notebook file:
Star_wars_survey.ipynb (42.8 KB)
Could anyone tell me the insight?
Thank you in advance for your time.