I am having some conceptual issues regarding slide 5 of this guided project:
I believe the purpose of the exercise is to collect all re-used words in a set, and determine how many times words are repeated (to ultimately determine how many times similar questions are asked).
I seem to be able to grasp everything up until the point where I am asked to do the following:
" If the length of
split_question is greater than
0 , divide
match_count by the length of
Conceptually I am struggling to understand the purpose of this code. I see that we determining whether a word in a question is in our set, if it is we add 1 to our match_count. If not the word goes on to get added to the set and will be matched (with the relative increase in the match_count) if this word is in the next question and so on.
I am struggling to wrap my head around the code that follows from the above instruction:
'if len(split_question) > 0:
match_count /= len(split_question)
If someone could please explain this to me conceptually: what is the purpose of doing this? What does it achieve etc?
I would be forever grateful for any light that is shed on this predicament I find myself in!