The GitHub repository for my capstone project can be found here.
Make Effective Data Visualization
The gist for my final project submission can be found here.
Identify Fraud from Enron Email
Identify Fraud from Enron Email D. Chris Young Summarize for us the goal of this project and how machine learning is useful in trying to accomplish it. As part of your answer, give some background on the dataset and how it can be used to answer the project question. Were there any outliers in the […]
Analyzing the NYC Subway Dataset
Section 1. Statistical Test 1.1 Which statistical test did you use to analyze the NYC subway data? Did you use a one-tail or a two-tail P value? What is the null hypothesis? What is your p-critical value? The Mann-Whitney U test was utilized to analyze the NYC subway data. A two-tailed P value was used […]
Data Wrangle OpenStreetMap Data
OpenStreetMap Area Phoenix, AZ United States Download source: https://mapzen.com/data/metro-extracts Problems encountered in the map The first step was to explore the structure of the OSM file. The tags.py script iterates through the file and counts the items for each start tag, enumerates the keys for the element attributes and iterates the âkâ attribute keys for […]