A Network analysis of Game of Thrones |
Analyze the network of characters in Game of Thrones and how it changes over the course of the books. |
A New Era of Data Analysis in Baseball |
Use MLB's Statcast data to compare New York Yankees sluggers Aaron Judge and Giancarlo Stanton. |
A Visual History of Nobel Prize Winners |
Explore a dataset from Kaggle containing a century's worth of Nobel Laureates. Who won? Who got snubbed? |
ASL Recognition with Deep Learning |
Build a convolutional neural network to classify images of letters from American Sign Language. |
Analyze Your Runkeeper Fitness Data |
Import, clean, and analyze seven years worth of training data tracked on the Runkeeper app. |
Bad passwords and the NIST guidelines |
Check what passwords fail to conform to the National Institute of Standards and Technology password guidelines |
Book Recommendations from Charles Darwin |
Build a book recommendation system using NLP and the text of books like "On the Origin of Species." |
Classify Song Genres from Audio Data |
Rock or rap? Apply machine learning methods in Python to classify songs into genres. |
Comparing Cosmetics by Ingredients |
Process ingredient lists for cosmetics on Sephora then visualize similarity using t-SNE and Bokeh. |
Disney Movies and Box Office Success |
Explore Disney movie data, then build a linear regression model to predict box office success. |
Do Left-handed People Really Die Young? |
Use pandas and Bayesian statistics to see if left-handed people actually die earlier than righties. |
Dr. Semmelweis and the discovery of handwashing |
Reanalyse the data behind one of the most important discoveries of modern medicine: Handwashing. |
Exploring 67 years of LEGO |
In this project we will explore a database of every LEGO set ever built. |
Exploring the Bitcoin cryptocurrency market |
You will explore the market capitalization of Bitcoin and other cryptocurrencies. |
Exploring the evolution of Linux |
Find out about the development of the Linux operating system by exploring its Git repository history. |
Find Movie Similarity from Plot Summaries |
Use NLP and clustering on movie plot summaries from IMDb and Wikipedia to quantify movie similarity. |
Generating Keywords for Google AdWords |
Automatically generate keywords for a search engine marketing campaign using Python. |
Give Life: Predict Blood Donations |
Build a binary classifier to predict if a blood donor is likely to donate again. |
Mobile Games A/B Testing with Cookie Cats |
Analyze an A/B test from the popular mobile puzzle game, Cookie Cats. |
Name Game: Gender Prediction using Sound |
Analyze the gender distribution of children's book writers and use sound to match names to gender. |
Naïve Bees: Deep Learning with Images |
Build a deep learning model that can automatically detect honey bees and bumble bees in images. |
Naïve Bees: Image Loading and Processing |
Load, transform, and understand images of honey bees and bumble bees in Python. |
Naïve Bees: Predict Species from Images |
Build a model that can automatically detect honey bees and bumble bees in images. |
Predict Stock Trends from News Headlines |
Scrape news headlines for FB and TSLA then apply sentiment analysis to generate investment insight. |
Predicting Credit Card Approvals |
Build a machine learning model to predict if a credit card application will get approved. |
Recreating John Snow's Ghost Map |
Recreate John Snow's famous map of the 1854 cholera outbreak in London. |
Reducing Traffic Mortality in the USA |
How can we find a good strategy for reducing traffic-related deaths? |
Risk and Returns: The Sharpe Ratio |
Use pandas to calculate and compare profitability and risk of different investments using the Sharpe Ratio. |
The Android App Market on Google Play |
Load, clean, and visualize scraped Google Play Store data to understand the Android app market. |
The GitHub History of the Scala Language? |
Find the true Scala experts by exploring its development history in Git and GitHub. |
The Hottest Topics in Machine Learning |
Use Natural Language Processing on NIPS papers to uncover the trendiest topics in machine learning research. |
TV, Halftime Shows, and the Big Game |
Load, clean, and explore Super Bowl data in the age of soaring ad costs and flashy halftime shows. |
Up and Down With the Kardashians |
Plot Google Trends data to find the most famous Kardashian/Jenner sister. Is it Kim? Kendall? Kylie? |
Which Debts Are Worth the Bank's Effort? |
Play bank data scientist and use regression discontinuity to see which debts are worth collecting. |
Who Is Drunk and When in Ames, Iowa? |
Flex your pandas muscles on breath alcohol test data from Ames, Iowa, USA. |
Who's Tweeting? Trump or Trudeau? |
Use web scraping and NLP to find the most frequent words in Herman Melville's novel, Moby Dick. |
Word Frequency in Moby Dick |
Build a machine learning classifier that knows whether President Trump or Prime Minister Trudeau is tweeting! |