Now that all the technical issues are resolved, I’m back with my review
Your project looks very nice, well-organized and with good markdown explanations. The subheadings are descriptive, all the links to the original data are present. Also, I didn’t even encounter any typos, which rarely happens with the projects Congratulations on having done a good job!
Below are some suggestions for your consideration:
- When you place a link, it’s better to make its text not very long. For example, in case of your 2nd link in the introduction, it’s enough to leave “raw data” as the text of the link, not the whole sentence.
- A good practice is to use a uniform style of quote marks for the string data in the code cells throughout the project: or only single, or only double quote marks.
- For this piece of code, repeated several times in your project:
you can consider using a for-loop.
- I would recommend you to do all your guided projects on your local computer, with Anaconda with the latest version of Python, rather than on DQ. The DQ platform is now in process of converting all the missions into new Python, but it’s still ongoing. In this case, for example, for the piece of code below:
ax.tick_params(right='off', left='off', bottom='off', top='off')
you can use the new syntax:
False instead of
off. It’s always better to learn and apply the newest possible things.
- For very long code lines, it’s always better to divide them into several code lines, to improve their readability. For example, instead of:
stem_cats = ['Engineering', 'Computer Science', 'Psychology', 'Biology', 'Physical Sciences', 'Math and Statistics']
you can use:
stem_cats = ['Engineering', 'Computer Science',
'Psychology', 'Biology', 'Physical Sciences',
'Math and Statistics']
It’s an especially good idea for graphs, where you can put each argument on a new row. Like this:
- The code cell : you might think of one giant for-loop for all the 3 for-loops here (those for each column).
- In general, you can also consider combining all the code cells from  to  into one unique cell. Practically it means using only the last of them (the code cell ), with all the modifications applied, with all the intermediate technical explanations (like setting x-axis, adding a horozontal line, etc.) added inside this giant cell as comments. I know it’s a project for learning and practicing all these things, and my own same project here is a disaster But it’s always a good idea to optimize your code as much as possible (and of course, I’m going to return to my project as well, and to introduce all these improvements). Especially if you’re planning to use this project in your portfolio.
- In the conclusion section, it’s better to add more specific conclusions about the gender gap in various spheres (or categories of spheres).
I hope my feedback was helpful. Good luck with your future projects, and happy learning!