DRL-OM

DRL-OM's Stars

google-research/google-research
Google Research
Language:Jupyter Notebook35k 751 1.3k8k
datawhalechina/easy-rl
强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/
Language:Jupyter Notebook10.4k 79 1492k
aladdinpersson/Machine-Learning-Collection
A resource for learning about Machine learning & Deep Learning
Language:Python8k 119 1272.7k
weiaicunzai/pytorch-cifar100
Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet, NasNet, Residual Attention Network, SENet, WideResNet)
Language:Python4.4k 34 821.2k
sweetice/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Language:Python4.1k 36 35870
higgsfield-ai/higgsfield
Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters
Language:Jupyter Notebook3.3k 76 1557
kzl/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
Language:Python2.5k 27 64469
google-research/batch_rl
Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games
Language:Python544 12 3875
jannerm/trajectory-transformer
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
Language:Python482 6 2064
aviralkumar2907/CQL
Code for conservative Q-learning
Language:Python424 5 2270
moon-hotel/TransformerTranslation
A Transformer Framework Based Translation Task
Language:Python146 1 1038
JasonYao81000/MLDS2018SPRING
Machine Learning and having it Deep and Structured (MLDS) in 2018 spring
Language:Python145 5 1047
awslabs/or-rl-benchmarks
The source code for the paper: 'ORL: Reinforcement Learning Benchmarks for Online Stochastic Optimization Problems'
Language:Python84 11 012
OptMLGroup/DeepBeerInventory-RL
The code for the SRDQN algorithm to train an agent for the beer game problem
Language:Python77 4 234
floodsung/a2c_cartpole_pytorch
advantage actor-critic reinforcement learning for openai gym cartpole
Language:Python65 5 312
franrruiz/shopper-src
Code for Shopper, a probabilistic model of shopping baskets
Language:C++52 12 732
ErezSC42/qr_forcaster
Our implementation of the paper "A Multi-Horizon Quantile Recurrent Forecaster"
Language:Python33 1 19
lyeskhalil/CORL
Language:Python24 1 16
keirp/return_transforms
Language:Python19 2 14
kesenzhao/DT4Rec
Language:Python18 2 63
clvrai/agile
Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.
Language:Python17 2 02
rtm2130/MST
Package for building Market Segmentation Trees, Choice Model Trees, and Isotonic Regression Trees
Language:OpenEdge ABL16 2 07
antoinedesir/RUMnet
Language:HTML94
vvmisic/optimalPLD
Code for paper "Exact First-Choice Product Line Optimization" by D. Bertsimas and V. V. Mišić
Language:Julia7 1 05
LiTrans/reslogit
ResLogit models are a family of machine learning based fully interpretable choice models underdevelopment in LiTrans
Language:Jupyter Notebook6 1 02
YahuiSun/GroupSteinerTree
Finding Group Steiner Trees in Graphs with both Vertex and Edge Weights
Language:C++6 1 01
scottemmons/youngs-cql
Conservative Q Learning on top of SAC
Language:Python5 2 00
jono3030/freelancer-web-database-scraper
Python script using Beautiful Soup 4 to scrape indexless online database
Language:Python1 1 00