interpretability-and-explainability

There are 19 repositories under interpretability-and-explainability topic.

ruizheliUOA/Awesome-Interpretability-in-Large-Language-Models
This repository collects all relevant resources about interpretability in LLMs
297 5 419
HennyJie/IBGNN
MICCAI 2022 (Oral): Interpretable Graph Neural Networks for Connectome-Based Brain Disorder Analysis
Language:Python57 4 68
Wuyxin/DISC
Discover and Cure: Concept-aware Mitigation of Spurious Correlation (ICML 2023)
Language:Python40 3 35
liugangcode/GREA
[KDD'22] Source codes of "Graph Rationalization with Environment-based Augmentations"
Language:Python36 3 46
WanyuGroup/CVPR2022-OrphicX
Official code for the CVPR 2022 (oral) paper "OrphicX: A Causality-Inspired Latent Variable Model for Interpreting Graph Neural Networks."
Language:Python34 1 314
cwangrun/ST-ProtoPNet
[ICCV 2023] Learning Support and Trivial Prototypes for Interpretable Image Classification
Language:Python21 2 12
vdlad/Remarkable-Robustness-of-LLMs
Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"
Language:Jupyter Notebook16 1 00
interpretable-ml-class/interpretable-ml-class.github.io
Explainable AI: From Simple Rules to Complex Generative Models
Language:HTML9 2 04
warisgill/TraceFL
TraceFL is a novel mechanism for Federated Learning that achieves interpretability by tracking neuron provenance. It identifies clients responsible for global model predictions, achieving 99% accuracy across diverse datasets (e.g., medical imaging) and neural networks (e.g., GPT).
Language:Python7 2 00
Imenbaa/BA-LR
Explainable Speaker Recognition
Language:Python2 1 00
VictorNico/NNs_from_scratch
Build a Neural net from scratch without keras or pytorch just by using numpy for calculus, pandas for data loading.
Language:Jupyter Notebook2 2 00
DimitrisReppas/On_visual_explanation_of_supervised_and_self-supervised_learning
Visualization methods to interpret CNNs and Vision Transformers, trained in a supervised or self-supervised way. The methods are based on CAM or on the attention mechanism of Transformers. The results are evaluated qualitatively and quantitatively.
Language:Python1 1 00
Skyyyy0920/SSCBM
Semi-supervised Concept Bottleneck Models (SSCBM)
Language:Python1 2 02
bishwamittra/nus_thesis
My PhD thesis in NUS. Making it public so that future graduate students may benefit.
Language:TeX0 1 00
fguzman82/PhD-Thesis
Interpretability: Methods for Identification and Retrieval of Concepts in CNN Networks
Language:Jupyter Notebook0 1 00
goz1985/RST-ARM-GLM_-Research
Work on combining Logit model with an information granulation method for better interpretability
Language:R0 1 00
MattScicluna/interpretable_tsne
Implementation of the gradient-based t-SNE sttribution method described in our GLBIO oral presentation: 'Towards Computing Attributions for Dimensionality Reduction Techniques'
Language:Python0 1 01
Skyyyy0920/FGAI
Language:Python0 1 00
swardiantara/DroneLog
Interpretable Anomaly Severity Detection on UAV Flight Log Messages
Language:HTML0 1 00

interpretability-and-explainability

ruizheliUOA/Awesome-Interpretability-in-Large-Language-Models

HennyJie/IBGNN

Wuyxin/DISC

liugangcode/GREA

WanyuGroup/CVPR2022-OrphicX

cwangrun/ST-ProtoPNet

vdlad/Remarkable-Robustness-of-LLMs

interpretable-ml-class/interpretable-ml-class.github.io

warisgill/TraceFL

Imenbaa/BA-LR

VictorNico/NNs_from_scratch

DimitrisReppas/On_visual_explanation_of_supervised_and_self-supervised_learning

Skyyyy0920/SSCBM

bishwamittra/nus_thesis

fguzman82/PhD-Thesis

goz1985/RST-ARM-GLM_-Research

MattScicluna/interpretable_tsne

Skyyyy0920/FGAI

swardiantara/DroneLog