Pinned Repositories
2019-in-demand-ds-tech-skills
Jupyter notebook for scraping and analysis of most in demand job technologies skills for data scientists.
causal-inference-python-packages
List of python packages for causal inference
data-viz-streamlit
deep-learning-cloud-providers
A list of all deep learning cloud providers
list-of-python-api-wrappers
List of Python API Wrappers and Libraries
pacc-2023
prefect-mlops-zoomcamp
MLOps Zoomcamp 2023 repository for Module 3
prefect-zoomcamp
Code for the Data Engineering Zoomcamp
discdiver's Repositories
discdiver/pandas-iterator-timings
Exploring the fastest ways to iterate over rows of data in Pandas.
discdiver/categorical-encoding
A library of sklearn compatible categorical variable encoders
discdiver/categorical_encoders_benchmark_kernel
More benchmarking for categorical encoders
discdiver/course-resources-ml-with-experts-budgets
Further student resources for DrivenData's 'Machine Learning with the Experts: School Budgets' DataCamp course.
discdiver/diet
diet data analysis project
discdiver/dsworkflow
Data Science Workflow
discdiver/fluentopt
A flexible hyper-parameter optimization library for machine learning
discdiver/gitignore
A collection of useful .gitignore templates
discdiver/imputing
Discussion of imputing options and workflows for machine learning. Also look at data measurement scales.
discdiver/intro-data-capstone-musclehub
discdiver/measurement-scales
Kernel that discusses how to clarify ordinal and nominal data and options for encoding.
discdiver/opendatadc
discdiver/regression-problem-workflow
Jupyter notebook of a machine learning regression problem workflow.
discdiver/tpot
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.