dtch1997

Mechanistic interpretability researcher. Interested in interpreting multimodal foundation models

dtch1997's Stars

ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
Language:Go91.4k 543 4.5k7.2k
pdm-project/pdm
A modern Python package and dependency manager supporting the latest PEP standards
Language:Python7.8k 37 1.7k391
PAIR-code/lit
The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.
Language:TypeScript3.5k 68 134353
explosion/sense2vec
🦆 Contextually-keyed word vectors
Language:Python1.6k 49 114238
kyegomez/BitNet
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
Language:Python1.6k 40 37143
pfnet/pfrl
PFRL: a PyTorch-based deep reinforcement learning library
Language:Python1.2k 91 75157
octo-models/octo
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
Language:Python785 20 97152
vikashplus/robohive
A unified framework for robot learning
Language:Python500 11 4684
danijar/crafter
Benchmarking the Spectrum of Agent Capabilities
Language:Python375 9 2163
mees/calvin
CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
Language:Python366 6 7955
facebookresearch/r3m
Pre-training Reusable Representations for Robotic Manipulation Using Diverse Human Video Data
Language:Python286 13 3045
davidbau/baukit
Language:Python162 11 411
p-lambda/incontext-learning
Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implicit Bayesian Inference"
Language:Python94 12 512
conglu1997/v-d4rl
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
Language:Python89 4 179
nrimsky/CAA
Steering Llama 2 with Contrastive Activation Addition
Language:Jupyter Notebook85 1 628
KihoPark/linear_rep_geometry
Language:Jupyter Notebook69 4 38
roeehendel/icl_task_vectors
Language:Python69 1 618
younggyoseo/apv
Language:Python69 4 57
FLAIROx/jaxirl
Contains JAX implementation of algorithms for inverse reinforcement learning
Language:Python59 5 02
steering-vectors/steering-vectors
Steering vectors for transformer language models in Pytorch / Huggingface
Language:Python53 2 275
RLAgent/factor-world
Decomposing the Generalization Gap in Imitation Learning for Visual Robotic Manipulation (2023)
Language:Python28 1 23
EleutherAI/elk-generalization
Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from easy questions to hard
Language:Python24 2 14
tdmpc2/tdmpc2-eval
Evaluation of TD-MPC2.
Language:Python22 1 01
kevinzakka/dm_env_wrappers
Standalone library of frequently-used wrappers for dm_env environments.
Language:Python18 7 12
etaoxing/kitchen-shift
KitchenShift: Evaluating Zero-Shot Generalization of Imitation-Based Policy Learning Under Domain Shifts
Language:Python13 2 12
wusche1/CAA_hallucination
Public reposetory for code and results of parts of "Steering Llama 2 via Contrastive Activation Addition" by Rimsky, Gabrieli, Schulz et al.
Language:Python7 2 02
etaoxing/domain-shift-benchmark
Language:Python3 1 12
Joshuaclymer/GENIES
Generalization Analogies: A Testbed for Generalizing AI Oversight to Hard-To-Measure Domains
Language:Python31
oelin/context-free-planning
Finding feasible solutions to planning problems using generative context-free grammars.
Language:Python3 2 00
ethanluoyc/jam
Jam - JAX models
Language:Python1 2 21

dtch1997

dtch1997's Stars

ollama/ollama

pdm-project/pdm

PAIR-code/lit

explosion/sense2vec

kyegomez/BitNet

pfnet/pfrl

octo-models/octo

vikashplus/robohive

danijar/crafter

mees/calvin

facebookresearch/r3m

davidbau/baukit

p-lambda/incontext-learning

conglu1997/v-d4rl

nrimsky/CAA

KihoPark/linear_rep_geometry

roeehendel/icl_task_vectors

younggyoseo/apv

FLAIROx/jaxirl

steering-vectors/steering-vectors

RLAgent/factor-world

EleutherAI/elk-generalization

tdmpc2/tdmpc2-eval

kevinzakka/dm_env_wrappers

etaoxing/kitchen-shift

wusche1/CAA_hallucination

etaoxing/domain-shift-benchmark

Joshuaclymer/GENIES

oelin/context-free-planning

ethanluoyc/jam