zwfightzw's Stars
sfujim/TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
ciwang/policydistillation
Reproducing Policy Distillation (DeepMind paper ICLR 2016)
katerakelly/pytorch-maml
PyTorch implementation of MAML: https://arxiv.org/abs/1703.03400
cbfinn/maml_rl
Code for RL experiments in "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
higgsfield-ai/higgsfield
Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters
aravindr93/mjrl
Reinforcement learning algorithms for MuJoCo tasks
floodsung/meta-critic-networks
Pytorch code for Arxiv Paper: Learning to learn: Meta-Critic Networks for Sample-Efficient Learning
google-parfait/tensorflow-federated
An open-source framework for machine learning and other computations on decentralized data.
Kaixhin/Rainbow
Rainbow: Combining Improvements in Deep Reinforcement Learning
junyanz/pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
liyiying/meta-MADDPG
meta-MADDPG (Python implementation)
zwfightzw/MLM
karpathy/paper-notes
Random notes on papers, likely a short-term repo.