dranaju

dranaju's Stars

FrankZheng2022/TACO
Code for "TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning"
Language:Python173
KindXiaoming/pykan
Kolmogorov Arnold Networks
Language:Jupyter Notebook13.9k1.2k
riiswa/kanrl
Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments
Language:Python23628
PKU-RL/Plan4MC
Reinforcement learning and planning for Minecraft.
Language:Python14518
will8211/unimatrix
Python script to simulate the display from "The Matrix" in terminal. Uses half-width katakana unicode characters by default, but can use custom character sets. Accepts keyboard controls while running. Based on CMatrix.
Language:Python1.7k156
NVIDIA/nvidia-container-toolkit
Build and run containers leveraging NVIDIA GPUs
Language:Go2k219
github/copilot.vim
Neovim plugin for GitHub Copilot
Language:Vim Script8k279
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python35k5.4k
salehiac/LanguageGroundedQD
Language:Python61
jannerm/trajectory-transformer
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
Language:Python44561
google-research/google-research
Google Research
Language:Jupyter Notebook33.5k7.8k
maohangyu/TIT_open_source
The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"
Language:Python524
HzcIrving/DecisionTransformer_StepbyStep
Decision Transformer: A brand new Offline RL Pattern.
Language:Python311
nikhilbarhate99/min-decision-transformer
Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym
Language:Python23724
lilianweng/lilianweng.github.io
My personal page
Language:HTML40876
kaichiuwong/rlhps
Language:Python83
LyWangPX/Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions
Solutions of Reinforcement Learning, An Introduction
Language:Jupyter Notebook1.9k448
ShangtongZhang/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
Language:Python13.4k4.8k
upb-lea/reinforcement_learning_course_materials
Lecture notes, tutorial tasks including solutions as well as online videos for the reinforcement learning course hosted by Paderborn University
Language:Jupyter Notebook920210
mimoralea/gdrl
Grokking Deep Reinforcement Learning
Language:Jupyter Notebook773221
dangkhoasdc/awesome-ai-residency
List of AI Residency Programs
3k270
rddy/ReQueST
Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"
Language:Python834
higgsfield-ai/higgsfield
Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters
Language:Jupyter Notebook3.3k555
JiawangBian/SC-SfMLearner-Release
Unsupervised Scale-consistent Depth Learning from Video (IJCV2021 & NeurIPS 2019)
Language:Python723149
yzcjtr/GeoNet
Code for GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose (CVPR 2018)
Language:Python717183
ricardoGrando/hydrone_deep_rl
Language:Python1
BY571/Soft-Actor-Critic-and-Extensions
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL + D2RL and parallel Environments.
Language:Python25832
higgsfield/RL-Adventure
Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL
Language:Jupyter Notebook3k587
ricardoGrando/hydrone_aerial_underwater_gazebo
Language:Python3
microsoft/AirSim
Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research
Language:C++16.1k4.5k