Hi @zackthemonk!
Thanks for sharing your first project here, and welcome to the community! With regards to your bold question, the links that you click on to open the notebook in your browser are automatically added by the forum shortly after you upload an ‘.ipynb’ file.
I’m unfamiliar with the helper
package that you import at the beginning of your project. @Elena_Kosourova, is that something new that DQ rolled out for early projects? Could either of you point me towards the documentation?
To me, it looks like you have got a handle on the basics of data manipulation using that helper package. It is always helpful to include a few more comments about what parts of the data you are accessing or manipulating in each set of cells. For example, I think that In [72]
's purpose is to drop the longer expository paragraph from the rest of the data generated by data_from_url()
, but to reach that conclusion, I had to work through the code myself.
Depending on where you are on your data science journey, you will find that there are a number of other functions that can help you present your notebook more crisply. For example, if your data
variable is a DataFrame
, you could use the function data.head(3)
to perform the same function as your for
loop in In [71]
(it also nicely formats the output).
In terms of data presentation, consider the difference between the two histograms of the same data I have pasted below (image source).
In the left histogram, the bins are small enough that we start to visualize noise in the data. In the right one, the bins are larger and help convey a clearer picture (there is a lot to say about how to decide the appropriate bin size… that is another discussion entirely. The histograms come from an article that discusses this topic).
Now that you have the data read into your notebook, take some time to think about other interesting ideas you might like to explore. One that comes to mind is what does this data look like "in context?"
- How many helicopter escapes per 1,000 prisoners in each country?
- Are helicopter escapes more common than other escapes (i.e. no vehicle, automobile, etc.)?
- Are helicopter escapes more common in urban or rural areas?
Then you could start hypothesizing why you see certain patterns in the data. For example:
- “I believe Parisian prisons are more likely to have helicopter escapes because most buildings must be shorter than the Eiffel Tower.”
- “I believe helicopter escapes will be more common in urban settings because helicopters are more common in the city.”
Hope these questions help get you thinking about where to take your project next. As you work through more lessons and guided projects, these sorts of questions will naturally come to your mind!
JKE