Project 1
Contents
Overview
The goal of Project 1 is for you to demonstrate your ability to analyze a dataset and tell an interesting, evidence-based story about it. You will perform exploratory analysis on a dataset, establish some interesting questions about the data, use data analysis techniques to answer those questions, and communicate the results in a coherent narrative.
(Back to top)Datasets
Below are the datasets you have to choose from; however, only one student may use a dataset. To claim a dataset, let me know your order of preference for all of the datasets. Datasets will be assigned first come first serve.
movies (ratings, tags)commutes in cities in the USKickstarter project dataHillary Clinton's emailAirBnB data- online retail
- occupancy data
- activities of daily living
employment datacrime data
You will only be given a subset of the dataset for reasons that will become apparent in Project 2.
(Back to top)Project constraints
Your project will be evaluated on the following criteria:
- You've asked and answered five interesting questions about the data using data analysis techniques (or extensions of them) that we covered in class or the readings
- You tell an interesting and compelling story about the dataset using the results of your analyses
- All processing steps are contained within the submitted Jupyter notebook and are well documented; I must be able to reproduce your steps
What constitutes "interesting" is subjective. In general, consider it something that is non-obvious, unexpected, or not previously well supported. We considered interesting and uninteresting questions/analyses in class, so think back to that.
See the rubric on the submission page for a break down of how these constraints will be graded.
(Back to top)Submissions
See this Canvas page to submit and see the rubric.
(Back to top)