Project Definiton
The final definition of my project:
I have 2 datasets :
1- Police stops in Los Angeles City, CA between 2013 and 2015 (350K+ rows)
1- Police stops in Columbus City, OH between 2013 and 2015 (150K+ rows)
I aim to make a comparative analysis between two cities on below:
- A significant correlation between driver/police behavior & race/gender.
- A significant correlation between driver/police behavior & race/gender.
Early Attempts
I have 2 datasets :
1- Police stops in Los Angeles City between 2013 and 2015 (350K+ rows)
The data has the information of driver race, driver, age, driver gender; while no info regarding the police officer.
Also, it has the information of violation, stop outcome, search outcome, contraband found or not, arrested or not.
2- Weather dataset of Las Angeles City between 2013 and 2015
The data has the information of temperature, humidity, precipitation, wind speed, wind direction info.
At the beginning, I aimed to elaborate on this datasets to find below:
- A significant correlation between driver/police behavior & race/gender.
- A significant correlation between driver/police behavior & weather conditions.
Here are my exploratory work & findings:
- Visually, I could not find any correlation between driver/police behavior & weather conditions.
- Even though there is a correlation between driver/police behavior & age, it is not enough. My purpose it to capture the racism or sexism!
- And, in Los Angeles, I can say that there is no significant evidence that race has an impact on driver behaviors and/or police behaviors.
You can find the correlation maps of my all data. I also circled the non-correlated pairs which I was expecting some correlation to dig in!
Hiç yorum yok:
Yorum Gönder