/feast_workshops

A series of workshop modules introducing Feast feature store.

Primary LanguageJupyter Notebook

What is Feast?

Feast is an operational data system for managing and serving machine learning features to models in production. It can serve features from a low-latency offline store (for real-time prediction) or from an off-line store (for scale-out batch scoring or training models).

Feature stores have emerged as a pivotal component in the modern machine learning stack, as more data scientists and engineers work together to operationalize ML. Associated with this task are some operational challenges. The toughest challenges for operationalizing ML is data: how to compute and select features, store, validate serve, discover and share them.

Goal and Objective

In this series of instructor led lectures and hands-on workshops, divided them into distinct modules, you will learn:

  • what key problems feature stores solve to operationalize ML
  • why features stores are a pivotal components in the model machine learning stack
  • common key use cases and deployment patterns for feature stores observed by the MLOps and ML practitioners
  • how feature stores are playing a transformational role with the rise of modern data platforms
  • get a hands-on experience with the popular open source feature store Feast

The workshop, all done on your laptop, is divided into four modules:

Time (mins) Description Module
45-60 Feast Concepts, Declarative & Consumption APIs, and Creating Features for offline/online access Module 1
45 Quick intro to MLflow and Feast & MLflow integration Module 2
45 A credit scoring & loan application use case with XGBoost and Ray for Distributed Training Module 3
45 How to use Feast for on-demand transformations (WIP) Module 4

Who Should Take This Workshop

A data scientist or data engineer who wants to learn and understand:

  • how Feature Store builds a bridge between your feature data and machine learning models
  • how to serve consistent features for both training and low-latency inference
  • how to build feature engineering pipelines to build features

Skill Level

Beginner to Intermediate

Instructor

Prerequisites

  • Knowledge of Python 3 and programming in general
  • Preferably a UNIX-based, fully-charged laptop with 8-16 GB, with a Chrome or Firefox browser
  • Some knowledge of Machine Learning concepts, libraries, and frameworks:
    • scikit-learn
    • pandas and Numpy
    • PyCharm/IntelliJ or choice of syntax-based Python editor
  • pip/pip3 or conda and Python 3 installed
  • Loads of virtual laughter, curiosity, and a sense of humor ... :-)

Obtaining the Workshop materials

Familiarity with git is important so that you can get all the material easily during the tutorial and workshop as well as continue to work in your free time, after the session is over.

git clone git@github.com:dmatrix/feast_workshops.git

git clone https://github.com/dmatrix/feast_workshops.git

Documentation Resources

This workshop will refer to documentation: