LeondraJames's Stars
openai/openai-cookbook
Examples and guides for using the OpenAI API
sibylhe/mmm_stan
Python/STAN Implementation of Multiplicative Marketing Mix Model, with deep dive into Adstock (carry-over effect), ROAS, and mROAS
shawlu95/Lookalike-Model
Finding similar, high-valued users based on seed users. The model includes 1805 features using Hive HQL and AWS Redshift.
Xueping/MusaNet
LeondraJames/AdClick_Fraud
Capstone project #2 for the Harvard University Professional Certificate in Data Science
LeondraJames/Customer-Churn-w-Logistic-Regression
Utilizing tools such as Spark, Python (PySpark), SQL, and Databricks, performed logistic regression on customers to predict those at a higher risk of churning, then applied the model to an unseen "new customers" data set.
LeondraJames/Disney-Movies-Box-Office-Hits
Analysis of Disney's top grossing films (adjusted for inflation) in Python, using regression to attribute film genre to success. The project includes using regression on the data, as well as bootstrap regression to determine confidence intervals of the intercept and coefficients.
LeondraJames/AWSSageMaker_PythonXGBoostTutorial
Python XGBoost model, using Amazon SageMaker, EC2 instances and S3 buckets. Used to prepare, partition, train, tune, predict and evaluate model. Project involves predicting customers who sign up for a financial product at a bank.
LeondraJames/Bikeshare-Exploratory-Analysis
An exploratory analysis of the Kaggle bikeshare data set with the application of linear regression models, which are not optimal for this particular problem of predicting bikes rented.
LeondraJames/BostonHousingPrices_NeuralNet
My first attempt at implementing a neural network using the Boston housing data set from the MASS library.
LeondraJames/CandyCrushProj
Candy Crush Level Difficulty Analysis
LeondraJames/ChipotleLocations
This is a descriptive and exploratory data analysis project from DataCamp which aims to explore real data on every Chipotle location to identify franchising opportunities. The goal is to scout out the next Chipotle location using interactive maps (ie: leaflet) and external data to compare proposed locations on several important factors, such as proximity to current Chipotle locations, the distribution of the state's population, and the distance from interstates and tourist attractions.
LeondraJames/DataKind-Project-7.21
Data Visualizations
LeondraJames/Degrees-That-Pay-You-Back
A cluster analysis leveraging the kmeans algorithm to determine which degrees are likely to yield which levels of income based on historical data.
LeondraJames/DS-Bootcamp-Capstone-Mondayball
Data Science & Machine Learning Data Capstone based on Moneyball dataset
LeondraJames/Film-Similarity-NLP-with-KMeans-Hierarchical-Clustering
Used NLP techniques (tokenization, stemming, vectorization for TF-IDF) and clustering algorithms (Kmeans and Hierarchical clustering) to mine the "similarities" between films based on their plots provided by IMBD and Wikipedia. The dataset contains the titles of the top 100 movies on IMDb.
LeondraJames/First-KNN-Attempt---ISLR-Caravan-Dataset
This is my first attempt at a KNN model, where I attempt to classify the purchase of caravan insurance in the Caravan data set (ISLR package).
LeondraJames/HarvardXCapstone---Film-Recommender-System
Capstone Submission #1 for the Harvard University Professional Certificate in Data Science.
LeondraJames/HeartDisease
"What Your Heart Is Telling You" Logit Model
LeondraJames/Hyundai-Cruise-Ship-Crew-Prediction
Predicting the number of required crew needed for manning a Hyundai Cruise ship based on information like number of cabins and passengers using linear regression. Leveraged SQL and PySpark,
LeondraJames/LoanPaymentPrediction_SVM
My first attempt with building a SVM model, and optimizing the cost and gamma parameters using the Gaussian Kernel grid search method.
LeondraJames/MarketBasketAnalysis-MBA-
Use of associative rule mining using the APRIORI algorithm
LeondraJames/MarkovChains_MultiTouchAttribution
Multi touch attribution models, including Markov chains
LeondraJames/MobileGameABTest
2 A/B tests, testing the difference in 1) average player 1 day and 2) 7 day retention against control (old player level) and new version (new player level)
LeondraJames/Private_Public_Colleges
Predicting whether a university is private or public using tree based models (ie: decision tree classifier, random forest classifier and gradient boosted tree classifier) using PySpark and Databricks.
LeondraJames/TheEconomistDataViz
Re-Imagination of The Economist: Corruption v. Development
LeondraJames/Titanic_Attempt-1
Kaggle Titanic Data Set Using Logit Model
LeondraJames/TV-HALFTIME-SHOWS-AND-THE-BIG-GAME
EDA project using SQL in Jupyter Notebooks, focusing on the history of games, broadcasts and performances for the National Football League
LeondraJames/WalmartStockEDA
An EDA of Walmart stock data using Databricks, Spark and PySpark.
LeondraJames/Whale-Image-Classification-
Computer Vision project