Pinned Repositories
azure-terraform-up-and-running-code
Code samples for the book "Terraform: Up & Running" by Yevgeniy Brikman - translated to Azure
Carbon-benefit-IT-Refurbishment
Returns the carbon benefit of reuse of an IT asset compared to recycling.
ChemBL-ADME
The publicly curated ChemBL database, merged with the Absorption, Distribution, Metabolism, and Excretion (ADME) datbase
faux_lars
Nose-bleedingly fast fake data generation
harley
Polars helper methods to enhance developer productivity
islr-v2
Exercises and labs in python, for the book Introduction to Statistical Learning Second Edition.
node-agnostic-spark
ETL worker which uses PySpark API but is node agnostic.
Out-of-the-Box-Sales-Analysis
A python app which provides analysis and visualisation for any sales file.
polari
Sentiment and language detection for text analytics.
tipitaka
RE: WIP Project to perform clustering on the Pali Text Society's Tipitaka
TomBurdge's Repositories
TomBurdge/polari
Sentiment and language detection for text analytics.
TomBurdge/harley
Polars helper methods to enhance developer productivity
TomBurdge/faux_lars
Nose-bleedingly fast fake data generation
TomBurdge/islr-v2
Exercises and labs in python, for the book Introduction to Statistical Learning Second Edition.
TomBurdge/azure-terraform-up-and-running-code
Code samples for the book "Terraform: Up & Running" by Yevgeniy Brikman - translated to Azure
TomBurdge/Carbon-benefit-IT-Refurbishment
Returns the carbon benefit of reuse of an IT asset compared to recycling.
TomBurdge/ChemBL-ADME
The publicly curated ChemBL database, merged with the Absorption, Distribution, Metabolism, and Excretion (ADME) datbase
TomBurdge/data-science-repo-template
A repository template using Poetry, Makefile, and pre-commit-hooks - fork from Gabriel Harris for use with WSL/Linux
TomBurdge/dbt-police-data
A dbt-duckdb pipeline which downloads stop and search data in the UK
TomBurdge/node-agnostic-spark
ETL worker which uses PySpark API but is node agnostic.
TomBurdge/Out-of-the-Box-Sales-Analysis
A python app which provides analysis and visualisation for any sales file.
TomBurdge/Parent-Brands-Open-AI-Streamlit-Demo
A Streamlit app which uses ChatGPT to find parent brands.
TomBurdge/tipitaka
RE: WIP Project to perform clustering on the Pali Text Society's Tipitaka
TomBurdge/duckdb-fork
DuckDB is an in-process SQL OLAP Database Management System
TomBurdge/functime-fork
Time-series machine learning at scale. Built with Polars for embarrassingly parallel feature extraction and forecasts on panel data.
TomBurdge/graph-algorithms
TomBurdge/learning-pyspark
Learning pyspark with the https://www.youtube.com/watch?v=_C8kWso4ne4 FreeCodeCamp tutorial
TomBurdge/logic-through-python
TomBurdge/pyspark-mastery
The repo name is over the top :) - I am practicing some PySpark. I am aiming to become very familiar with the PySpark API for DuckDB open source contributions and professional use.
TomBurdge/python-email-scraper
A python file which uses selenium to scrape email adresses from sites in a list. Saves the scraped emails to a CSV called "emails"
TomBurdge/TableauProject
A project to make a visualisation of a public dataset about the life-cycle carbon emissions of IT products.
TomBurdge/TomBurdge