cas2247
NYC based Data Scientist with experience in the financial services and consumer products sectors 📈
Massachusetts Institute of TechnologyNew York, NY
Pinned Repositories
01
The #1 open-source voice interface for desktop, mobile, and ESP32 chips.
2015
Public material for CS109
Algorithmic-Fairness-Capstone
This project analyzes an Automated Decision System predicting whether data scientists seek new jobs. We assess biases in the data, processing steps, and model outputs to highlight fairness and transparency issues in HR-focused machine learning.
awesome-python
A curated list of awesome Python frameworks, libraries, software and resources
babel
The official repository for Babel, the Python Internationalization Library
covid19_analysis
Exploratory data analysis on the global covid-19 outbreak
Modern-Analysis-Textbook
As an undergraduate, I wrote a textbook for Real/Modern Analysis to complement my studying of the subject. My goal with the text was to present the subject more intuitively and more visually than is usually embraced by the formal texts.
Speed-Data-ers-Masters-Capstone
This project explores the Speed Dating Experiment dataset to uncover patterns in romantic match outcomes using statistical analysis and machine learning techniques. Through hypothesis testing and predictive modeling, we examined gender differences in match likelihood and the influence of shared interests on sucessful connections.
stan
Stan development repository (home page is linked below). The master branch contains the current release. The develop branch contains the latest stable development. See the Developer Process Wiki for details.
Statistics-Cheat-Sheet
This is a statistics cheatsheet complied from my Introductory Statistics with Calculus course which I took as an undergraduate. The probability formula sheet summarizes important probability concepts, formulas, and distributions, with figures, examples, and stories.
cas2247's Repositories
cas2247/Statistics-Cheat-Sheet
This is a statistics cheatsheet complied from my Introductory Statistics with Calculus course which I took as an undergraduate. The probability formula sheet summarizes important probability concepts, formulas, and distributions, with figures, examples, and stories.
cas2247/Speed-Data-ers-Masters-Capstone
This project explores the Speed Dating Experiment dataset to uncover patterns in romantic match outcomes using statistical analysis and machine learning techniques. Through hypothesis testing and predictive modeling, we examined gender differences in match likelihood and the influence of shared interests on sucessful connections.
cas2247/Algorithmic-Fairness-Capstone
This project analyzes an Automated Decision System predicting whether data scientists seek new jobs. We assess biases in the data, processing steps, and model outputs to highlight fairness and transparency issues in HR-focused machine learning.
cas2247/Modern-Analysis-Textbook
As an undergraduate, I wrote a textbook for Real/Modern Analysis to complement my studying of the subject. My goal with the text was to present the subject more intuitively and more visually than is usually embraced by the formal texts.
cas2247/stan
Stan development repository (home page is linked below). The master branch contains the current release. The develop branch contains the latest stable development. See the Developer Process Wiki for details.
cas2247/01
The #1 open-source voice interface for desktop, mobile, and ESP32 chips.
cas2247/2015
Public material for CS109
cas2247/awesome-python
A curated list of awesome Python frameworks, libraries, software and resources
cas2247/babel
The official repository for Babel, the Python Internationalization Library
cas2247/covid19_analysis
Exploratory data analysis on the global covid-19 outbreak
cas2247/datasets
A collection of datasets of ML problem solving
cas2247/Fashion_MNIST
Tensorflow 2.0 tutorial using the Fashion MNIST data set
cas2247/geocoder
:earth_asia: Python Geocoder
cas2247/go
The Open Source Data Science Masters
cas2247/HackerRank
HackerRank solutions in Java/JS/Python/C++/C#
cas2247/LendingClubAnalysis
cas2247/llama-stack
Composable building blocks to build Llama Apps
cas2247/markdown-cheatsheet
Markdown Cheatsheet for Github Readme.md
cas2247/ML_TensorflowAPIs
Notes and Code from Google's Machine Learning Crash Course
cas2247/monopoly
cas2247/natural_language_weekend
Weekend of natural language projects
cas2247/Probabilistic-Programming-and-Bayesian-Methods-for-Hackers
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming in data analysis with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
cas2247/prophet
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
cas2247/pymc3_vs_pystan
Personal project to compare hierarchical linear regression in PyMC3 and PyStan, as presented at http://pydata.org/london2016/schedule/presentation/30/ video: https://www.youtube.com/watch?v=Jb9eklfbDyg
cas2247/Python
My Python Examples
cas2247/vimrc
The ultimate Vim configuration: vimrc