/conveyor-samples

Samples on how to use Conveyor.

Primary LanguageJupyter Notebook

Open in Gitpod

Conveyor Samples

This repository contains a number of sample projects for Conveyor

Getting started

  1. Clone this repository
  2. Verify you have the Conveyor CLI installed by executing conveyor doctor
  3. Authenticate the Conveyor CLI conveyor auth login
  4. Create a new conveyor environment conveyor environment create --name samples
  5. Execute the additional steps listed in the prerequisites folder

Samples

Basic

  • pi_spark: use Apache Spark to calculate pi.
  • first_project_dbt: use dbt and DuckDB for the first time. Using this project is described in the Conveyor getting started guide.
  • coffee_shop_dbt: use dbt and DuckDB for cleaning and transforming the coffee shop input data and writing the results to S3.

MLOps

  • titanic: use the dataset from Kaggle to build a ML pipeline that will produce a model to predict survival.
  • housing: use the dataset from Kaggle to build a ML pipeline that will produce a model to predict the housing prices.

Various

  • alerting: use native Apache Airflow features to send notifications to Slack.