/e4ds-snippets

Code examples and experiments from https://engineeringfordatascience.com

Primary LanguageJupyter NotebookMIT LicenseMIT

View on GitHub Engineering for Data Science

Engineering for Data Science (e4ds) - Snippets

Collection of useful data science, machine learning and data engineering topics with code examples linked to posts in my blog engineeringfordatascience.com

Contents

Best Practices

Articles about Python and data science best practices

Title Article Repository
Which Python string formatting method should you be using in your data science project? Blog Link
Reproducible ML: Maybe you shouldn't be using Sklearn's train_test_split Blog Link
Setting up logging for Python projects Blog Link

Matplotlib

Title Article Repository
Plotting subplots in a loop Blog Link
Making impactful charts using plt.suptitle Blog Link

Jupyter

Title Article Repository
(Part 1) Voilà! Interactive Python Dashboards Straight from your Jupyter Notebook Blog Link
(Part 2) Voilà! Optimising Python Dashboard Performance Blog Link
(Part 3) Voilà! Deploy your Jupyter Notebook Based Python Dashboard on Heroku Blog Link

Pandas

Title Article Repository
SQL-like Window Functions in Pandas Blog Link

PySpark

Title Article Repository
Unit testing in PySpark Blog Link
How to save the output of PySpark DataFrame 'show' to a variable Blog Link
How to save a PySpark dataframe to a single output file Blog Link

PyTest

Title Article Repository
How to use fixtures as arguments in parametrize Blog Link
Tips for using PyTest Blog Link
How to mock sending SMTP emails using PyTest Blog Link
How to use allure-pytest and allure-pytest-bdd plugins in the same project Blog Link

Google Cloud Platform (GCP)

Title Article Repository
Event Driven Data Validation with Google Cloud Functions and Great Expectations Blog Link
Google Search Console API with Python Blog Link

Web Scraping & Automation

Title Article Repository
Export Spotify Playlist to CSV File with Python Blog Link