/iml-2022

UVa CS 4501/6501 Interpretable Machine Learning

1. Course Information

  • Instructors: Hanjie Chen, Yangfeng Ji
  • Location: Rice 340
  • Time: Tuesday and Thursday 11 AM - 12:15 PM
  • Office hour: TBA

2. Course Description

Machine learning models have achieved remarkable performance in a wide range of AI fields, such as Natural Language Processing and Computer Vision. However, the lack of interpretability of machine learning models raises concerns regarding the trustworthiness and reliability of their predictions. This problem blocks their applications in the real world, especially in high-stake scenarios, such as healthcare, economy and criminal justice. The goal of this course is to let students get familiar with the emerging problem in machine learning and recent advances in interpretable and explainable AI.

2.1 Topics

This course will include but not limit to the following contents:

  • Background of interpretable machine learning
    • Interpretability in machine learning
    • Brief introduction of deep learning
  • Techniques in exploring the interpretability of machine learning models
    • Different classes of interpretable models (e.g., prototype based approaches, sparse linear models, rule based techniques, generalized additive models)
    • Post-hoc explanations (e.g., white-box explanations, black-box explanations, counterfactual explanations, saliency maps)
    • Connections between model interpretability and other properties, such as robustness, uncertainty, and fairness
  • Implementation of model interpretability in real-world applications, including natural language processing, computer vision, healthcare, etc.

2.2 Format

For each week, we have one lecture (given by the instructors) and one class for paper discussion (led by the instructors, start from the 2nd week)

2.3 Prerequisites

  • Machine Learning: Students are expected to have machine learning background, for example, by taking one of our machine learning classes (CS 4774 or CS 6316).
  • Programming: Students are also expected to have programming and software engineering skills to work with machine packages using Python (e.g., Sklearn, PyTorch, Tensorflow).

2.4 Textbook/Materials

Acknowledgement

Hanjie Chen is supported by the UVa Engineering Graduate Teaching Internship Program (GTI) for designing and teaching this course.