houxiao

尽人事 Do the Best

levelup.aiBeijing, China

houxiao's Stars

thu-ml/tianshou
An elegant PyTorch deep reinforcement learning library.
Language:Python7.6k 93 7191.1k
py-why/EconML
ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to bring automation to complex causal inference problems. To date, the ALICE Python SDK (econml) implements orthogonal machine learning algorithms such as the double machine learning work of Chernozhukov et al. This toolkit is designed to measure the causal effect of some treatment variable(s) t on an outcome variable y, controlling for a set of features x.
Language:Jupyter Notebook3.6k 76 553690
AI4Finance-Foundation/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
Language:Python3.5k 52 247818
Zeta36/chess-alpha-zero
Chess reinforcement learning by AlphaGo Zero methods.
Language:Jupyter Notebook2.1k 123 82479
AirtestProject/Poco
A cross-engine test automation framework based on UI inspection
Language:Python1.7k 69 562308
starwing/lua-protobuf
A Lua module to work with Google protobuf
Language:Lua1.7k 92 214385
takuseno/d3rlpy
An offline deep reinforcement learning library
Language:Python1.3k 27 323227
williamFalcon/DeepRLHacks
Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)
1.1k 51 1127
huawei-noah/trustworthyAI
Trustworthy AI related projects
Language:Python922 21 99211
mpx/lua-cjson
Lua CJSON is a fast JSON encoding/parsing module for Lua
Language:C910 60 49468
facebookresearch/torchbeast
A PyTorch Platform for Distributed RL
Language:Python735 17 37114
DeepReinforcementLearning/DeepReinforcementLearningInAction
Code from the Deep Reinforcement Learning in Action book from Manning, Inc
Language:Jupyter Notebook703 22 26302
mokemokechicken/reversi-alpha-zero
Reversi reinforcement learning by AlphaGo Zero methods.
Language:Python677 51 40169
google-research/batch_rl
Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games
Language:Python515 13 3873
miyosuda/unreal
Reinforcement learning with unsupervised auxiliary tasks
Language:Python415 34 29131
mrahtz/learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
Language:Python301 11 966
bupticybee/elephantfish
elephantfish: 一个只有124行的**象棋引擎
Language:Python226 7 244
Farama-Foundation/D4RL-Evaluations
Language:Python186 14 1527
bennylp/RL-Taxonomy
Loose taxonomy of reinforcement learning algorithms
Language:Python149 7 012
bbitmaster/ale_python_interface
A Python Interface for the Arcade Learning Environment (Shared Object)
Language:Python125 13 931
Mawiszus/TOAD-GAN
Official repository for "TOAD-GAN: Coherent Style Level Generation from a Single Example" by Maren Awiszus, Frederik Schubert and Bodo Rosenhahn.
Language:Python105 3 114
Yunhui1998/Reinforcement_learning_tutorial
Share notes on learning reinforcement learing
10111
sauxpa/neural_exploration
Study NeuralUCB and regret analysis for contextual bandit with neural decision
Language:Jupyter Notebook86 3 324
kngwyu/rogue-gym
[WIP] Highly customizable rogue-like game for AI expmeriments
Language:Rust82 6 39
banditml/banditml
A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.
Language:Python64 5 310
DerwenAI/rllib_tutorials
RLlib tutorials
Language:Jupyter Notebook62 4 18
bytedance/raylink
Framework to build and train RL algorithms
Language:Python36 6 04
rogueinabox/rogueinabox
A python machine learning environment for rogue.
Language:Python30 3 010
bytedance/pyskynet
PySkynet is a library for using skynet in python.
Language:C++18 5 04
jaimeyzzz/impala_horovod_gym
Language:Python10 4 42

houxiao

houxiao's Stars

thu-ml/tianshou

py-why/EconML

AI4Finance-Foundation/ElegantRL

Zeta36/chess-alpha-zero

AirtestProject/Poco

starwing/lua-protobuf

takuseno/d3rlpy

williamFalcon/DeepRLHacks

huawei-noah/trustworthyAI

mpx/lua-cjson

facebookresearch/torchbeast

DeepReinforcementLearning/DeepReinforcementLearningInAction

mokemokechicken/reversi-alpha-zero

google-research/batch_rl

miyosuda/unreal

mrahtz/learning-from-human-preferences

bupticybee/elephantfish

Farama-Foundation/D4RL-Evaluations

bennylp/RL-Taxonomy

bbitmaster/ale_python_interface

Mawiszus/TOAD-GAN

Yunhui1998/Reinforcement_learning_tutorial

sauxpa/neural_exploration

kngwyu/rogue-gym

banditml/banditml

DerwenAI/rllib_tutorials

bytedance/raylink

rogueinabox/rogueinabox

bytedance/pyskynet

jaimeyzzz/impala_horovod_gym