dtch1997

Mechanistic interpretability researcher. Interested in interpreting multimodal foundation models

Pinned Repositories

ai-short-stories
A collection of short stories about AI
00
ASE
Language:Python4 0 00
corl2023_rl_cbf
Code accompanying the submission: "Your Value Function is a Control Barrier Function: Verication of Learned Policies using Control Theory"
Language:Python4 1 00
CrowdHuman-dataset-prep
A repository to download and prepare CrowdHuman dataset for training in PyTorch
Language:Python5 1 01
IsaacGymEnvs
AMP implementation for quadruped legged robot in IsaacGymEnvs
Language:Python13 1 21
quadruped-gym
An OpenAI gym environment for the training of legged robots
Language:Jupyter Notebook9 2 00
reasoning-bench
A collection of reasoning benchmarks for LLMs
Language:Python10
rl_cbf
Code accompanying "Value Functions are Control Barrier Functions: Verification of Safe Policies using Control Theory"
Language:Python21 4 20
steering-bench
Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"
Language:Python5 1 00
tms-kit
Toy models of superposition
Language:HTML3 2 70

dtch1997's Repositories

dtch1997/ASE
Language:Python4 0 00
dtch1997/corl2023_rl_cbf
Code accompanying the submission: "Your Value Function is a Control Barrier Function: Verication of Learned Policies using Control Theory"
Language:Python4 1 00
dtch1997/awesome-ml-dev-tools
Collection of development tools for ML engineering or research
3 1 00
dtch1997/awesome-skill-learning
Collection of resources on skill learning methods
2 1 00
dtch1997/fractal-fft
Open-source implementation of "A Fast Fourier Transform for Fractal Approximations"
Language:Python2 2 00
dtch1997/gymnasium-quadruped
Gymnasium environment for training quadruped legged robots.
Language:Python2 1 0
dtch1997/quadruped-nn-control-stack
A framework for deploying NN policies to quadruped robots.
Language:C++2 1 0
dtch1997/TorchFlowMicro
A pipeline for converting PyTorch models to TFLiteMicro models suitable for deployment to edge devices.
Language:C++2 1 00
dtch1997/mujoco_mpc
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Language:C++1 0 0
dtch1997/allenact
An open source framework for research in Embodied-AI from AI2.
Language:Python0 0
dtch1997/awesome-graphic-design
Collection of resources on designing aesthetic graphics for presentations, research papers, etc.
1 0
dtch1997/awesome-remote-computing
A collection of resources / tips on working on remote workstations / PCs
1 0
dtch1997/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python
dtch1997/cs224n-assignments
Repository for my submission of Stanford CS224N 2023 assignments
Language:Python1 0
dtch1997/Deep-Representations-and-Learning
This is a repo for Labs prepared for COMP0188 taught at UCL.
Language:Jupyter Notebook
dtch1997/diffuser
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
dtch1997/disentangle-gen
Implementing regularization for training disentangled generative models.
Language:Python2 1
dtch1997/dm_control
DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Language:Python0 0
dtch1997/gpt-text-gym
Experiments with GPT on text-based gym environments, such as text-wrapped Minigrid and NetHack
Language:Python1 2
dtch1997/gym
A toolkit for developing and comparing reinforcement learning algorithms.
Language:Python0 0
dtch1997/improved-diffusion
Release for Improved Denoising Diffusion Probabilistic Models
Language:Python0 0
dtch1997/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
Language:Python0 0
dtch1997/minigrid-experiments
Experiments in Minigrid
Language:Python1 0
dtch1997/minigrid-language-wrapper
Language wrapper for Minigrid environments
Language:Python1 0
dtch1997/python-template
dtch1997/ql_clbf
Language:Python1 0
dtch1997/quadruped-bc
Behaviour cloning experiments on quadruped
Language:Python1 0
dtch1997/rl_games
RL implementations
dtch1997/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python1 0
dtch1997/transformer-agents
Language:Makefile1 0