sparse-autoencoder
There are 27 repositories under sparse-autoencoder topic.
PaulPauls/llama3_interpretability_sae
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and fully reproducible.
vgel/repeng
A library for making RepE control vectors
ruizheliUOA/Awesome-Interpretability-in-Large-Language-Models
This repository collects all relevant resources about interpretability in LLMs
wblgers/tensorflow_stacked_denoising_autoencoder
Implementation of the stacked denoising autoencoder in Tensorflow
syorami/Autoencoders-Variants
Pytorch implementations of various types of autoencoders
glami/sansa
SANSA - sparse EASE for millions of items
explanare/ravel
Evaluate interpretability methods on localizing and disentangling concepts in LLMs.
khoink94/tensorflow-Deep-learning
Tensorflow Examples
mrquincle/keras-adversarial-autoencoders
Experiments with Adversarial Autoencoders using Keras
snooky23/K-Sparse-AutoEncoder
Sparse Auto Encoder and regular MNIST classification with mini batch's
tim-lawson/mlsae
Multi-Layer Sparse Autoencoders
shantanu-ai/DPN-SA
Repository of Deep Propensity Network - Sparse Autoencoder(DPN-SA) to calculate propensity score using sparse autoencoder
zer0int/CLIP-SAE-finetune
Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.
MaheepChaudhary/SAE-Ravel
Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the paper "Evaluating Open-Source Sparse Autoencoders on Disentangling Factual Knowledge in GPT-2 Small"
Ki-Seki/Awesome-Transformer-Visualization
Explore visualization tools for understanding Transformer-based large language models (LLMs)
ALPHAYA-Japan/autoencoders
Collection of autoencoder models in Tensorflow
Butanium/tiny-activation-dashboard
A tiny easily hackable implementation of a feature dashboard.
chrisliu298/awesome-sparse-autoencoders
A resource repository of sparse autoencoders for large language models
vivekamin/semi-supervised-learning
Implemented semi-supervised learning for digit recognition using Sparse Autoencoder
aarnphm/tinymorph
exploration WYSIWYG editor
seonglae/emgsd-hermes
Steering GPT2-EMGSD less biased & Generating stereotyped text with vanilla GPT2 without fine tuning or prompt engineering
sushantMoon/isi-nna
Neural Network Architcture | ISI Kolkata
cxcscmu/embedding-scope
Interpret and control dense embedding via sparse autoencoder.
SayanChakraborty126/ML-CODES
This repository contains Python codes for Autoenncoder, Sparse-autoencoder, HMM, Expectation-Maximization, Sum-product Algorithm, ANN, Disparity map, PCA.
Specoptor/bot-iot
Implement a sparse autoencoder on the bot-iot dataset for dimensionality reduction followed by computation of reconstruction error, F1 score, recall, accuracy, weights, and threshold amongst other metrics
sskrishn/Neural-Networks
Folder contains implementation of Multi layer feed forward networks, Autoencoders, Sparse Autoencoders and many..
mcanalesmayo/SparseAutoencoder
Sparse Autoencoder based on the Unsupervised Feature Learning and Deep Learning tutorial from the Stanford University