shenzebang

shenzebang's Stars

dmlc/dgl
Python package built to ease deep learning on graph, on top of existing DL frameworks.
Language:Python13.6k3k
DartML/Stein-Variational-Gradient-Descent
code for the paper "Stein Variational Gradient Descent (SVGD): A General Purpose Bayesian Inference Algorithm"
Language:Python9540
mariogeiger/hessian
hessian in pytorch
Language:Python18617
behaviorguidedRL/BGRL
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
Language:Jupyter Notebook246
YiifeiWang/Optimal-Transport
Group project "Algorithms for large-scale optimal transport". Implement ADMMs and Sinkhorn's Algorithms.
Language:TeX114
zuoxingdong/mazelab
A customizable framework to create maze and gridworld environments
Language:Python26059
PythonOT/POT
POT : Python Optimal Transport
Language:Python2.5k505
ciwang/policydistillation
Reproducing Policy Distillation (DeepMind paper ICLR 2016)
Language:Python216
jax-ml/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Language:Python30.9k2.8k
jik0730/MAML-in-pytorch
Neat and flexible implementation of MAML in pytorch: https://arxiv.org/abs/1703.03400
Language:Python598
ray-project/ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Language:Python34.6k5.9k
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python3.7k829
ikostrikov/pytorch-trpo
PyTorch implementation of Trust Region Policy Optimization
Language:Python43591
Alfredvc/paac
Open source implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning
Language:Python20658
astooke/rlpyt
Reinforcement Learning in PyTorch
Language:Python2.2k326
jingweiz/pytorch-rl
Deep Reinforcement Learning with pytorch & visdom
Language:Python798143
d2l-ai/d2l-zh
《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
Language:Python64.7k11.1k
d2l-ai/d2l-en
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Language:Python24.4k4.4k
sweetice/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Language:Python4k859
p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
Language:Python5.7k1.2k
google-deepmind/dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Language:Python3.9k679
tristandeleu/pytorch-meta
A collection of extensions and data-loaders for few-shot learning & meta-learning in PyTorch
Language:Python2k256
katerakelly/pytorch-maml
PyTorch implementation of MAML: https://arxiv.org/abs/1703.03400
Language:Jupyter Notebook555127
udacity/deep-reinforcement-learning
Repo for the Deep Reinforcement Learning Nanodegree program
Language:Jupyter Notebook5k2.3k
higgsfield/RL-Adventure
Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL
Language:Jupyter Notebook3k588
Khrylx/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Language:Python1.1k189
openai/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python15.9k4.9k
fKunstner/limitations-empirical-fisher
Limitations of the Empirical Fisher Approximation
Language:Python466
AtomicVar/NG-MAML
MAML with Natural Gradient Adaptation
Language:Python2
thiagopbueno/gradient-estimators-in-stochastic-computation-graphs
Gradient Estimation in Stochastic Computation Graphs using TensorFlow.
Language:Jupyter Notebook172