Pinned Repositories
berkeley-jupyter-notebook
Jupyter Notebook tips and tricks for the Berkeley Institute for Data Science lecture. http://bids.berkeley.edu/
courses
Course materials for the Data Science Specialization: https://www.coursera.org/specialization/jhudatascience/1
data-science
Code snippets for data acquisition and organization in data science.
datasciencecoursera
Data scientist toolbox repo
datasharing
The Leek group guide to data sharing
ExData_Plotting1
Plotting Assignment 1 for Exploratory Data Analysis
GCD_Assignment
Getting and Cleaning Data - Course assignment repository
Million-Song-Dataset-Graph
Graph Model of the Million Song Dataset
NYT-Elasticsearch
Connect to the NYT Newswire API to get data of published articles. Crawl the article's text and store the results in Elasticsearch for querying.
TwitterStreamingApp
Twitter Streaming Application
andrea-soto's Repositories
andrea-soto/Million-Song-Dataset-Graph
Graph Model of the Million Song Dataset
andrea-soto/TwitterStreamingApp
Twitter Streaming Application
andrea-soto/berkeley-jupyter-notebook
Jupyter Notebook tips and tricks for the Berkeley Institute for Data Science lecture. http://bids.berkeley.edu/
andrea-soto/courses
Course materials for the Data Science Specialization: https://www.coursera.org/specialization/jhudatascience/1
andrea-soto/data-science
Code snippets for data acquisition and organization in data science.
andrea-soto/datasciencecoursera
Data scientist toolbox repo
andrea-soto/datasharing
The Leek group guide to data sharing
andrea-soto/ExData_Plotting1
Plotting Assignment 1 for Exploratory Data Analysis
andrea-soto/GCD_Assignment
Getting and Cleaning Data - Course assignment repository
andrea-soto/NYT-Elasticsearch
Connect to the NYT Newswire API to get data of published articles. Crawl the article's text and store the results in Elasticsearch for querying.
andrea-soto/RepData_PeerAssessment1
Peer Assessment 1 for Reproducible Research
andrea-soto/spark-py-notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
andrea-soto/Twitter-Ingest-Template
Template code to ingest tweets from the API and write to a local directory or to an S3 bucket
andrea-soto/twitter_popularity
Stream tweets to find hashtag pupularity
andrea-soto/W205_Lab6
Lab 6 Submission