This is in reference to page 8: Relations Between Tags of the Popular Data Science Questions Guided Project. https://app.dataquest.io/m/469/guided-project%3A-popular-data-science-questions/8/relations-between-tags
Here goes my question:
For finding tag relationships, does it make sense to calculate and visualize a Pearson correlation coefficient for the relations_most_used dataframe found in the solutions. Why or why not?
# Calculate the Pearson correlation coefficients between tags
most_used_correlate = relations_most_used.corr()
# Delete all values where the tag relates to itself
for i in range(most_used_correlate.shape[0]):
correlate.iloc[i,i] = pd.np.nan
# Plot the heatmap
sns.heatmap(correlate, cmap='Greens')