/nova-mlops

Material for MLOps summer course in Nova

Primary LanguagePerl

MLOps tutorial for NOVA

Lint Markdown Check README and text Lint Deno Tests perl with system one

Course material for MLOps summer course in Nova. Also available at this URL.

Learning outcomes of the curricular unit

  1. Understand the fundamental principles of agile development
  2. Understand how to integrate design, model development, training and operations in a continuous cycle.
  3. Apply quality assurance concepts at every step of the process.
  4. Create a complete model as well as a report in a software artifact that changes continuously.
  5. Applying fundamental machine learning and scientific principles to the above design and reports.

Syllabus

Before the syllabus, we need to understand the principles | Web and read the introduction to the course | Web.

  1. Design thinking for solving real-life problems | Web
  2. Agile development | Web
  3. Test driven development for scraping workflows | Web.
  4. Automation infrastructure using GitHub actions | Web.
  5. Containers for reproducible science.
  6. MLOps tools.
  7. Open science: creation and deployment of machine learning projects.

Sessions

The course is organized as a bootcamp, with different sessions over which a project, and eventually a coauthored preprint, will be developed. Students can use this repository template to kick-start their projects with spell-checking workflows.

Session objectives and material are as follows:

  1. First session, meet and greet, organization and setup.
  2. Second session, first workflow stages.
  3. Third session, extracting and storing information.

Some links

How to run this

Most tasks that are included in workflows are also in the top-level Makefile

Should be all of them, although at a certain point they can be out of sync.

But there are a few tools that you should have installed in your system

  • Perl, preferably installed via perlbrew, including cpanm
  • R

LICENSE

All text here is released under a cc-by-sa license, all code under the GPL.