Hi all,
I’m working on Star Wars survey overview. In the step of removing NaN in the RespondentID row, I found my data a bit odd. I checked the dataframe before cleaning and compare the result of cleaning by checking it again. I found that the input didn’t change. The RespondentID mentioned the same 1186 non-null value before and after cleaning.
However based on the introduction, the total 835 responses received from survey.
I was using the code below:
> #checking the dataframe info
> print(star_wars.info())
>
> #removing any rows where RespondentID is NaN
> star_wars = star_wars[star_wars['RespondentID'].notnull()]
> print('\n')
> #checking the dataframe after removal
> print(star_wars.info())
This is my notebook file:
Star_wars_survey.ipynb (42.8 KB)
Could anyone tell me the insight?
Thank you in advance for your time.
Best regards,
Click here to view the jupyter notebook file in a new tab