firstuserhere

Taking apart neural networks and putting them back together for a living. Personal website: https://kunvarthaman.com

firstuserhere's Stars

Jazhyc/llm-sandbag-activation-steering
Language:Jupyter Notebook1
roedoejet/AnyLanguage-Word-Guessing-Game
A word guessing game that can be modified and translated to your language!
Language:TypeScript42192
SambhavG/dine
Language:Svelte8
OlineRanum/Ponita_SLR
Fast, Expressive SE(n) Equivariant Networks through Weight-Sharing in Position-Orientation Space.
Language:Python1
sfcompute/tinynarrations
A synthetic story narration dataset to study small audio LMs.
Language:Python273
michaelneuper/hugo-texify3
A LaTeX-style hugo theme with the gruvbox color scheme for personal blogging
Language:JavaScript4015
randomaccess2023/MG2023
Language:Jupyter Notebook3
kakaobrain/rq-vae-transformer
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)
Language:Jupyter Notebook72579
facebookresearch/llm-transparency-tool
LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/llm-transparency-tool-demo
Language:Python67946
YannDubs/disentangling-vae
Experiments for understanding disentanglement in VAE latent representations
Language:Python778145
1Konny/Beta-VAE
Pytorch implementation of β-VAE
Language:Python509125
neverix/saex
SAEs in Jax
Language:Python31
kronusaturn/lw2-viewer
An alternative frontend for LessWrong 2.0
Language:Common Lisp586
PAIR-code/lit
The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.
Language:TypeScript3.4k350
JohnVinyard/matching-pursuit
This repository contains research and experiments aimed at producing sparse, interpretable representations of audio.
Language:HTML1
evanhanders/superposition-geometry-toys
Experiments for running toy models of superposition as in Anthropic's 2022 paper. These experiments focus on superposition of composed features.
Language:Jupyter Notebook4
shacharKZ/VISIT-Visualizing-Transformers
Language:Python221
saprmarks/feature-circuits
Language:Python639
Nix07/finetuning
This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking".
Language:Jupyter Notebook151
callummcdougall/sae_vis
Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).
Language:HTML9622
fletchel/aisc_oocl_experiments
experiments trying to elicit out of context learning when training a transformer on a simple task
Language:Jupyter Notebook14
thestephencasper/everything-you-need
we got you bro
29
microsoft/generative-ai-for-beginners
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
Language:Jupyter Notebook55.4k28.3k
amlweems/xzbot
notes, honeypot, and exploit demo for the xz backdoor (CVE-2024-3094)
Language:Go3.5k235
Baidicoot/sae_alternatives
Language:Jupyter Notebook1
openai/grok
Language:Python4.1k507
StavC/ComPromptMized
ComPromptMized: Unleashing Zero-click Worms that Target GenAI-Powered Applications
Language:Python18322
openai/transformer-debugger
Language:Python4k232
lucidrains/x-transformers
A simple but complete full-attention transformer with a set of promising experimental features from various papers
Language:Python4.4k370
typeling1578/Year-Progress-Bar
Language:HTML55

firstuserhere

firstuserhere's Stars

Jazhyc/llm-sandbag-activation-steering

roedoejet/AnyLanguage-Word-Guessing-Game

SambhavG/dine

OlineRanum/Ponita_SLR

sfcompute/tinynarrations

michaelneuper/hugo-texify3

randomaccess2023/MG2023

kakaobrain/rq-vae-transformer

facebookresearch/llm-transparency-tool

YannDubs/disentangling-vae

1Konny/Beta-VAE

neverix/saex

kronusaturn/lw2-viewer

PAIR-code/lit

JohnVinyard/matching-pursuit

evanhanders/superposition-geometry-toys

shacharKZ/VISIT-Visualizing-Transformers

saprmarks/feature-circuits

Nix07/finetuning

callummcdougall/sae_vis

fletchel/aisc_oocl_experiments

thestephencasper/everything-you-need

microsoft/generative-ai-for-beginners

amlweems/xzbot

Baidicoot/sae_alternatives

openai/grok

StavC/ComPromptMized

openai/transformer-debugger

lucidrains/x-transformers

typeling1578/Year-Progress-Bar