Russian NLU Transformer Explainability Research

Objectives

The study aims to apply several methods towards machine learning interpretation to study the representation and inference in NLU Transformer-based models fine-tuned for Russian language.

Methods

  1. Structural Probing
  2. Feature Attribution
  3. Intervention-Based Analysis

Application

  • Visualization
  • Language theory