houxiao's Stars
thu-ml/tianshou
An elegant PyTorch deep reinforcement learning library.
py-why/EconML
ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to bring automation to complex causal inference problems. To date, the ALICE Python SDK (econml) implements orthogonal machine learning algorithms such as the double machine learning work of Chernozhukov et al. This toolkit is designed to measure the causal effect of some treatment variable(s) t on an outcome variable y, controlling for a set of features x.
AI4Finance-Foundation/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
Zeta36/chess-alpha-zero
Chess reinforcement learning by AlphaGo Zero methods.
AirtestProject/Poco
A cross-engine test automation framework based on UI inspection
starwing/lua-protobuf
A Lua module to work with Google protobuf
takuseno/d3rlpy
An offline deep reinforcement learning library
williamFalcon/DeepRLHacks
Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)
huawei-noah/trustworthyAI
Trustworthy AI related projects
mpx/lua-cjson
Lua CJSON is a fast JSON encoding/parsing module for Lua
facebookresearch/torchbeast
A PyTorch Platform for Distributed RL
DeepReinforcementLearning/DeepReinforcementLearningInAction
Code from the Deep Reinforcement Learning in Action book from Manning, Inc
mokemokechicken/reversi-alpha-zero
Reversi reinforcement learning by AlphaGo Zero methods.
google-research/batch_rl
Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games
miyosuda/unreal
Reinforcement learning with unsupervised auxiliary tasks
mrahtz/learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
bupticybee/elephantfish
elephantfish: 一个只有124行的**象棋引擎
Farama-Foundation/D4RL-Evaluations
bennylp/RL-Taxonomy
Loose taxonomy of reinforcement learning algorithms
bbitmaster/ale_python_interface
A Python Interface for the Arcade Learning Environment (Shared Object)
Mawiszus/TOAD-GAN
Official repository for "TOAD-GAN: Coherent Style Level Generation from a Single Example" by Maren Awiszus, Frederik Schubert and Bodo Rosenhahn.
Yunhui1998/Reinforcement_learning_tutorial
Share notes on learning reinforcement learing
sauxpa/neural_exploration
Study NeuralUCB and regret analysis for contextual bandit with neural decision
kngwyu/rogue-gym
[WIP] Highly customizable rogue-like game for AI expmeriments
banditml/banditml
A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.
DerwenAI/rllib_tutorials
RLlib tutorials
bytedance/raylink
Framework to build and train RL algorithms
rogueinabox/rogueinabox
A python machine learning environment for rogue.
bytedance/pyskynet
PySkynet is a library for using skynet in python.
jaimeyzzz/impala_horovod_gym