This repo contains all data, code and reports for Udacity's data analyst nanodegree.
It is structured around the following projects:
- Testing a perceptual phenomenon using statistics
- Investigating a data set using Python, NumPy and pandas
- Wrangling OpenStreet Map data using Python, SQL and MongoDB
- Exploring and summarizing data using R
- Identiying fraud from Enron email using scikit-learn and NLTK
- Data Visualization in Tableau
Effective data visualization using data driven documents a.k.a. d3.js and dimple.jsA/B testing