DRL-OM's Stars
google-research/google-research
Google Research
datawhalechina/easy-rl
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
aladdinpersson/Machine-Learning-Collection
A resource for learning about Machine learning & Deep Learning
weiaicunzai/pytorch-cifar100
Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet, NasNet, Residual Attention Network, SENet, WideResNet)
sweetice/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
higgsfield-ai/higgsfield
Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters
kzl/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
google-research/batch_rl
Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games
jannerm/trajectory-transformer
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
aviralkumar2907/CQL
Code for conservative Q-learning
moon-hotel/TransformerTranslation
A Transformer Framework Based Translation Task
JasonYao81000/MLDS2018SPRING
Machine Learning and having it Deep and Structured (MLDS) in 2018 spring
awslabs/or-rl-benchmarks
The source code for the paper: 'ORL: Reinforcement Learning Benchmarks for Online Stochastic Optimization Problems'
OptMLGroup/DeepBeerInventory-RL
The code for the SRDQN algorithm to train an agent for the beer game problem
floodsung/a2c_cartpole_pytorch
advantage actor-critic reinforcement learning for openai gym cartpole
franrruiz/shopper-src
Code for Shopper, a probabilistic model of shopping baskets
ErezSC42/qr_forcaster
Our implementation of the paper "A Multi-Horizon Quantile Recurrent Forecaster"
lyeskhalil/CORL
keirp/return_transforms
kesenzhao/DT4Rec
clvrai/agile
Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.
rtm2130/MST
Package for building Market Segmentation Trees, Choice Model Trees, and Isotonic Regression Trees
antoinedesir/RUMnet
vvmisic/optimalPLD
Code for paper "Exact First-Choice Product Line Optimization" by D. Bertsimas and V. V. Mišić
LiTrans/reslogit
ResLogit models are a family of machine learning based fully interpretable choice models underdevelopment in LiTrans
YahuiSun/GroupSteinerTree
Finding Group Steiner Trees in Graphs with both Vertex and Edge Weights
scottemmons/youngs-cql
Conservative Q Learning on top of SAC
jono3030/freelancer-web-database-scraper
Python script using Beautiful Soup 4 to scrape indexless online database