bakanaouji's Stars
google-research/google-research
Google Research
optuna/optuna
A hyperparameter optimization framework
google/dopamine
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
JetBrains/ideavim
IdeaVim – A Vim engine for JetBrains IDEs
googlecreativelab/quickdraw-dataset
Documentation on how to access and use the Quick, Draw! Dataset.
blei-lab/edward
A probabilistic programming language in TensorFlow. Deep generative models, variational inference.
python/typeshed
Collection of library stubs for Python, with static types
hill-a/stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
google-deepmind/dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
IntelLabs/coach
Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms
openai/roboschool
DEPRECATED: Open-source software for robot simulation, integrated with OpenAI Gym.
kevinhughes27/TensorKart
self-driving MarioKart with TensorFlow
dfm/emcee
The Python ensemble sampling toolkit for affine-invariant MCMC
CMA-ES/pycma
Python implementation of CMA-ES
openai/random-network-distillation
Code for the paper "Exploration by Random Network Distillation"
casperdcl/git-fame
:star: Pretty-print `git` repository collaborators sorted by contributions
masa-su/pixyz
A library for developing deep generative models in a more concise, intuitive and extendable way
ikostrikov/pytorch-trpo
PyTorch implementation of Trust Region Policy Optimization
icoxfog417/baby-steps-of-rl-ja
Pythonで学ぶ強化学習 -入門から実践まで- サンプルコード
mdabros/SharpLearning
Machine learning for C# .Net
openai/EPG
Code for the paper "Evolved Policy Gradients"
nicknlsn/MarioKart64NEAT
NEAT implementation in Lua for Mario Kart 64 and the BizHawk emulator
jkomiyama/banditlib
Multi-armed bandit simulation library
giuse/DNE
A set of neuroevolution experiments with/towards deep networks
Unity-Technologies/obstacle-tower-challenge
Starter Kit for the Unity Obstacle Tower challenge
ili3p/HORD
Efficient Hyperparameter Optimization of Deep Learning Algorithms Using Deterministic RBF Surrogates
machinalis/mypy-data
mypy typesheds for the Python data stack
automl/HPOlib1.5
ppocma/ppocma
wjaskowski/gecco-2015-sztetris
Code repository for GECCO 2015 paper: "N-Tuple Network for Knowledge-Free Reinforcement Learning in High Dimensions: A Case Study in SZ-Tetris"