While working on a next ‘guided project’ (Data Analyst in Python), I was wondering whether there is any standard or best practice for showing a data dictionary for data that you include in your Jupyter notebook?
Typically, after importing some data (e.g. into a pandas dataframe), you’ll do some initial exploration e.g. by showing the first 5 or 10 lines. To understand what I see though, and also to be able to later refer back to it, around this time I would also want to see a description of the meaning of all columns.
I can think of multiple ways how to show such data dictionary:
- Add a markdown cell with this info
- Create the data dictionary itself in a file, read it in as another dataframe, then display that dataframe
- Refer to the data dictionary (if there is one) on the website where you got the data from in the first place
I was wondering whether there is any standard or best practice for doing this?