My solutions for the Introduction to Data Science Coursera course.
Complete
Twitter sentiment analysis that involves collecting data from the Twitter API and computing sentiment or "mood" scores from the tweets.
Complete
Implementing different relational algebra operations, keyword search using a term document matrix and matrix multiplication all using SQL.
Complete
Using a mock implementation of MapReduce (written in Python) to complete various tasks that are good use-cases for this large-scale data processing programming model. Includes joins, DNA sequence trimming, and matrix multiplication.