Pinned Repositories
15.071x-The-Analytics-Edge
MIT Coursework
Ames-Housing-Regression
Using Lasso regression to predict housing prices in Ames, Iowa.
arroyo
Distributed stream processing engine in Rust
hdb-kaki
Visualizing Singapore's HDB Resale Prices using streamlit
monopoly
Monopoly is a Python library & CLI that converts bank statement PDFs to CSV.
NYT-Article-Popularity
Predicting online news popularity using New York Times articles in 2020
OutlookParser50
A work automation tool that includes an email parser and report writer
pdf2john
A modern refactoring of the legacy pdf2john.py library
StatementSensei
PDF to CSV conversion for your bank statements
WNV-Prediction
Predicting the West Nile Virus with Logistic Regression
benjamin-awd's Repositories
benjamin-awd/StatementSensei
PDF to CSV conversion for your bank statements
benjamin-awd/monopoly
Monopoly is a Python library & CLI that converts bank statement PDFs to CSV.
benjamin-awd/OutlookParser50
A work automation tool that includes an email parser and report writer
benjamin-awd/hdb-kaki
Visualizing Singapore's HDB Resale Prices using streamlit
benjamin-awd/pdf2john
A modern refactoring of the legacy pdf2john.py library
benjamin-awd/NYT-Article-Popularity
Predicting online news popularity using New York Times articles in 2020
benjamin-awd/WNV-Prediction
Predicting the West Nile Virus with Logistic Regression
benjamin-awd/15.071x-The-Analytics-Edge
MIT Coursework
benjamin-awd/Ames-Housing-Regression
Using Lasso regression to predict housing prices in Ames, Iowa.
benjamin-awd/arroyo
Distributed stream processing engine in Rust
benjamin-awd/astronomer-cosmos
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
benjamin-awd/dotfiles
benjamin-awd/ga-dataset
benjamin-awd/glints-etl
benjamin-awd/SAT-ACT-Analysis
An analysis of SAT and ACT results with a focus on socioeconomic inequality.
benjamin-awd/Subreddit-Classification-with-NLP
Comparing and classifying r/MensLib and r/MensRights with NLP
benjamin-awd/citibank-statement-downloader
benjamin-awd/datahub
The Metadata Platform for the Modern Data Stack
benjamin-awd/dbt-athena
The athena adapter plugin for dbt (https://getdbt.com)
benjamin-awd/dbt-clickhouse
The Clickhouse plugin for dbt (data build tool)
benjamin-awd/john
John the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs
benjamin-awd/nytimes-scraper
benjamin-awd/pybadges
A Python library for creating Github-style badges
benjamin-awd/pyHanko
pyHanko: sign and stamp PDF files
benjamin-awd/PythonDataScienceHandbook
Python Data Science Handbook: full text in Jupyter Notebooks
benjamin-awd/risingwave
SQL stream processing, analytics, and management. We decouple storage and compute to offer efficient joins, instant failover, dynamic scaling, speedy bootstrapping, and concurrent query serving.
benjamin-awd/sqlfmt
sqlfmt formats your dbt SQL files so you don't have to
benjamin-awd/streamlit
Streamlit — A faster way to build and share data apps.
benjamin-awd/vector
A high-performance observability data pipeline.