dranaju's Stars
FrankZheng2022/TACO
Code for "TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning"
KindXiaoming/pykan
Kolmogorov Arnold Networks
riiswa/kanrl
Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments
PKU-RL/Plan4MC
Reinforcement learning and planning for Minecraft.
will8211/unimatrix
Python script to simulate the display from "The Matrix" in terminal. Uses half-width katakana unicode characters by default, but can use custom character sets. Accepts keyboard controls while running. Based on CMatrix.
NVIDIA/nvidia-container-toolkit
Build and run containers leveraging NVIDIA GPUs
github/copilot.vim
Neovim plugin for GitHub Copilot
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
salehiac/LanguageGroundedQD
jannerm/trajectory-transformer
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
google-research/google-research
Google Research
maohangyu/TIT_open_source
The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"
HzcIrving/DecisionTransformer_StepbyStep
Decision Transformer: A brand new Offline RL Pattern.
nikhilbarhate99/min-decision-transformer
Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym
lilianweng/lilianweng.github.io
My personal page
kaichiuwong/rlhps
LyWangPX/Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions
Solutions of Reinforcement Learning, An Introduction
ShangtongZhang/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
upb-lea/reinforcement_learning_course_materials
Lecture notes, tutorial tasks including solutions as well as online videos for the reinforcement learning course hosted by Paderborn University
mimoralea/gdrl
Grokking Deep Reinforcement Learning
dangkhoasdc/awesome-ai-residency
List of AI Residency Programs
rddy/ReQueST
Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"
higgsfield-ai/higgsfield
Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters
JiawangBian/SC-SfMLearner-Release
Unsupervised Scale-consistent Depth Learning from Video (IJCV2021 & NeurIPS 2019)
yzcjtr/GeoNet
Code for GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose (CVPR 2018)
ricardoGrando/hydrone_deep_rl
BY571/Soft-Actor-Critic-and-Extensions
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL + D2RL and parallel Environments.
higgsfield/RL-Adventure
Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL
ricardoGrando/hydrone_aerial_underwater_gazebo
microsoft/AirSim
Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research