Workflow is mentioned in the exercise tutorial, workflow is like an approach to attain a particular task. So, here the task is to get the sorted distribution of the values in a percentage for
ad_created , and
To explain, I am taking only one column and the same procedure you can apply to rest of the columns.
We know that all the columns are strings, So in
date_crawled column we need to take dates distribution for which,
First, we have to extract/parse only date from the each row of string in
autos['date_crawled'].str[:10] #Because the date is only till 9th index
Then, you have to see the distribution/frequency in percentages which you can achieve with:
.value_counts(normalize=True, dropna=False) # normalize= True will give you percentage and False will give only counts.
So, now code will be:
Next you have to sort the distribution in ascending order which you can achieve with
.sort_index() #Here you are sorting values in reference of index.
So, final code will be:
Now, you can repeat the same steps on rest of the columns.
Hope this helps!