Federal University of Rio Grande do Norte

Technology Center

📚 Noah Gift, Alfredo Deza. Practical MLOps: Operationalizing Machine Learning Models [Link]
📚 Chip Huyen. Designing Machine Learning Systems: An Iterative Process for Production-Ready Applications. [Link]
📚 Hannes Hapke, Catherine Nelson. Building Machine Learning Pipelines. [Link]
📚 Mariano Anaya. Clean Code in Python [Link]
📚 Aurélien Géron. Hands on Machine Learning with Scikit-Learn, Keras and TensorFlow. [Link]
🤜 Dataquest Academic Program [Link]
😃 CS329S - ML Systems Design [Link]
🎯 Machine Learning Operations [Link]

Week 01: Course Outline

Git and Version Control
- You'll learn how to: a) organize your code using version control, b) resolve conflicts in version control, c) employ Git and Github to collaborate with others.
- 👊 U1T1: guided project + getting a git repository.

Week 02: CLI fundamentals

Elements of the Command Line
- You'll learn how to: a) employ the command line for Data Science, b) modify the behavior of commands with options, c) employ glob patterns and wildcards, d) define Important command line concepts, e) navigate he filesystem, f) manage users and permissions.
Text Processing in the Command Line
- You'll learn how to: a) read and explore documentation, b) perform basic text processing, c) redirect and pipe output, d) inspect files, e) define different kinds of output, f) employ streams and file descriptors.
🔠 U1T2: working with command line.

Week 03 - Clean Code Principles for Data Science and Machine Learning

Week 04 Production Ready Code

Week 05 Building a Data Pipeline

Week 06 Building a Reproducible Model Workflow

Outline
Business Reflections
Introduction to MLOps
A brief history of MLOps and Tools
Tools and environment installation
Tools and environment installation cont.
Machine Learning Pipelines
Machine Learning Pipelines - Command Line Interface
Versioning Data and Artifacts
Guided Exercise - CLI + Weights and Biases
MLflow Projects
Introduction to YAML
Guided Exercise - Build a MLflow component
Linking together the components MLflow + Hydra
Guided Exercise - Multiples MLflow components + Hydra
Additional Material
- Context Managers
- Introduction to Decorators
- Decorators: advanced

Week 07 Building a Reproducible Model Workflow Cont. - Introduction to Machine Learning

Week 08 Building a Reproducible Model Workflow Cont. - ETL, Data Checks, Data Segregation

Week 09 Building a Reproducible Model Workflow Cont. - Train, Validation and Experiment Tracking

Outline
A brief review
Decision Trees
- Introduction
- Mathematical Foundations
Evaluation Metrics
- How to choose an evaluation metric?
- Threshold metrics
- Ranking metrics
Implementing Pipelines
- MLOps Level 0 with Pipeline incorporating train
  - Part I
  - Part II
  - Part III
  - Source-Code
- MLOps Level 0 with Pipeline incorporating train and preprocessing
  - Part I
  - Part II
  - Part III
  - Source-Code
- MLOps Level 1 with Pipeline incorporating train and preprocessing
  - Part I
  - Part II
  - Source-Code
- MLOps Level 1 with Pipeline and Hyper-parameter Tuning
  - Part I
  - Part II
  - Part III
  - Part IV
  - Source-Code
- Test evaluation
  - Part I
  - Source-Code

Week 10 Building a Reproducible Model Workflow Cont. - Final Pipeline, Release and Deploy

Outline
Final Pipeline
- Big picture of the final pipeline
- All together
  - Part I
  - Part II
  - Source-Code
Release for reproducibility
- Create a GitHub repository for the final pipeline
- Semantic versioning and remote execution
Deployment
- Deploy with MLflow