ploomber: A Python repository from EddieCornelious

Ploomber is the fastest way to build data pipelines ⚡️. Use your favorite editor (Jupyter, VSCode, PyCharm) to develop interactively and deploy ☁️ without code changes (Kubernetes, Airflow, AWS Batch, and SLURM). Do you have legacy notebooks? Refactor them into modular pipelines with a single command.

Installation

Compatible with Python 3.6 and higher.

Install with pip:

pip install ploomber

Or with conda:

conda install ploomber -c conda-forge

Getting started

Open a hosted JupyterLab instance:

Run an example locally:

# ML pipeline example
ploomber examples -n templates/ml-basic -o ml-basic
cd ml-basic

# install dependencies
pip install -r requirements.txt

# run pipeline
ploomber build

You just ran a Ploomber pipeline! 🎉

Check out the output/nb.html report with model results!

The pipeline.yaml contains the pipeline declaration. Feel free to modify any of the tasks, then call ploomber build again to update the results (Note: if using VSCode or PyCharm, execute ploomber nb -i before editing the files).

What's next?

Ready to migrate your project? Click here.

Do you want to learn more? Check out the introductory tutorial.

Run more examples.