Identify Fraud from Enron Email

Identify Fraud from Enron Email D. Chris Young Summarize for us the goal of this project and how machine learning is useful in trying to accomplish it. As part of your answer, give some background on the dataset and how it can be used to answer the project question. Were there any outliers in the […]

Analyzing the NYC Subway Dataset

Section 1. Statistical Test 1.1 Which statistical test did you use to analyze the NYC subway data? Did you use a one-tail or a two-tail P value? What is the null hypothesis? What is your p-critical value? The Mann-Whitney U test was utilized to analyze the NYC subway data. A two-tailed P value was used […]

Data Wrangle OpenStreetMap Data

OpenStreetMap Area Phoenix, AZ United States Download source: https://mapzen.com/data/metro-extracts Problems encountered in the map The first step was to explore the structure of the OSM file. The tags.py script iterates through the file and counts the items for each start tag, enumerates the keys for the element attributes and iterates the “k” attribute keys for […]