kclary's Stars
karanpratapsingh/system-design
Learn how to design systems at scale and prepare for system design interviews
kornia/kornia
🐍 Geometric Computer Vision Library for Spatial AI
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
hill-a/stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
dimforge/rapier
2D and 3D physics engines focused on performance.
pfnet/pfrl
PFRL: a PyTorch-based deep reinforcement learning library
denisyarats/drq
DrQ: Data regularized Q
princeton-vl/CoqGym
A Learning Environment for Theorem Proving with the Coq proof assistant
WilsonWangTHU/mbbl
usnistgov/dioptra
Test Software for the Characterization of AI Technologies
IntelLabs/causality-lab
Causal discovery algorithms and tools for implementing new ones
johnidm/asm-atari-2600
Sample source code games Atari 2600
bark-simulator/bark-ml
Gym environments and agents for autonomous driving.
cage-challenge/CybORG
Cyber Operations Research Gym
microsoft/IBAC-SNI
Code to reproduce the NeurIPS 2019 paper "Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck" by Maximilian Igl, Kamil Ciosek, Yingzhen Li, Sebastian Tschiatschek, Cheng Zhang, Sam Devlin and Katja Hofmann.
modestyachts/robust-adaptive-lqr
Implementation of robust adaptive control methods for the linear quadratic regulator
jpwahle/lrec22-d3-dataset
The official repository for the LREC 2022 paper "D3: A Massive Dataset of Scholarly Metadata for Analyzing the State of Computer Science Research"
nv-research-israel/causal_comp
This repository hosts the dataset and source code for "A causal view of compositional zero-shot recognition". Yuval Atzmon, Felix Kreuk, Uri Shalit, Gal Chechik (Spotlight)
toybox-rs/Toybox
The Machine Learning Toybox for testing the behavior of autonomous agents.
csxeba/trickster
Reinforcement learning in TensorFlow 2
pokaxpoka/rad_procgen
RAD: Reinforcement Learning with Augmented Data (code for procgen experiments)
borea17/efficient_rl
Reimplementation of "An Object-Oriented Representation for Efficient RL"
jesbu1/carl
Github Repo for CARL: Cautious Adaptation for RL in Safety Critical Settings
gursky1/cygym
Cythonized versions of the OpenAI Gym classic control environments.
lenskit/seedbank
Manage seeds across multiple Python RNGs.
fmaxgarcia/Meta-MDP
KDL-umass/saliency_maps
Code for building and experimenting on saliency maps for RL agents.
thomason-jesse/pokemon_emerald_shuffle
Scripts to push around 'mon.
aivaslab/standoff
Gridworld environment for competitive-feeding-like theory of mind experiments
holderlb/WSU-SAILON-NG