dtch1997
Mechanistic interpretability researcher. Interested in interpreting multimodal foundation models
Pinned Repositories
ai-short-stories
A collection of short stories about AI
ASE
corl2023_rl_cbf
Code accompanying the submission: "Your Value Function is a Control Barrier Function: Verication of Learned Policies using Control Theory"
CrowdHuman-dataset-prep
A repository to download and prepare CrowdHuman dataset for training in PyTorch
IsaacGymEnvs
AMP implementation for quadruped legged robot in IsaacGymEnvs
quadruped-gym
An OpenAI gym environment for the training of legged robots
reasoning-bench
A collection of reasoning benchmarks for LLMs
rl_cbf
Code accompanying "Value Functions are Control Barrier Functions: Verification of Safe Policies using Control Theory"
steering-bench
Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"
tms-kit
Toy models of superposition
dtch1997's Repositories
dtch1997/ASE
dtch1997/corl2023_rl_cbf
Code accompanying the submission: "Your Value Function is a Control Barrier Function: Verication of Learned Policies using Control Theory"
dtch1997/awesome-ml-dev-tools
Collection of development tools for ML engineering or research
dtch1997/awesome-skill-learning
Collection of resources on skill learning methods
dtch1997/fractal-fft
Open-source implementation of "A Fast Fourier Transform for Fractal Approximations"
dtch1997/gymnasium-quadruped
Gymnasium environment for training quadruped legged robots.
dtch1997/quadruped-nn-control-stack
A framework for deploying NN policies to quadruped robots.
dtch1997/TorchFlowMicro
A pipeline for converting PyTorch models to TFLiteMicro models suitable for deployment to edge devices.
dtch1997/mujoco_mpc
Real-time behaviour synthesis with MuJoCo, using Predictive Control
dtch1997/allenact
An open source framework for research in Embodied-AI from AI2.
dtch1997/awesome-graphic-design
Collection of resources on designing aesthetic graphics for presentations, research papers, etc.
dtch1997/awesome-remote-computing
A collection of resources / tips on working on remote workstations / PCs
dtch1997/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
dtch1997/cs224n-assignments
Repository for my submission of Stanford CS224N 2023 assignments
dtch1997/Deep-Representations-and-Learning
This is a repo for Labs prepared for COMP0188 taught at UCL.
dtch1997/diffuser
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
dtch1997/disentangle-gen
Implementing regularization for training disentangled generative models.
dtch1997/dm_control
DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
dtch1997/gpt-text-gym
Experiments with GPT on text-based gym environments, such as text-wrapped Minigrid and NetHack
dtch1997/gym
A toolkit for developing and comparing reinforcement learning algorithms.
dtch1997/improved-diffusion
Release for Improved Denoising Diffusion Probabilistic Models
dtch1997/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
dtch1997/minigrid-experiments
Experiments in Minigrid
dtch1997/minigrid-language-wrapper
Language wrapper for Minigrid environments
dtch1997/python-template
dtch1997/ql_clbf
dtch1997/quadruped-bc
Behaviour cloning experiments on quadruped
dtch1997/rl_games
RL implementations
dtch1997/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
dtch1997/transformer-agents