zwfightzw

zwfightzw's Stars

sfujim/TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
Language:Python1.7k433
ciwang/policydistillation
Reproducing Policy Distillation (DeepMind paper ICLR 2016)
Language:Python216
katerakelly/pytorch-maml
PyTorch implementation of MAML: https://arxiv.org/abs/1703.03400
Language:Jupyter Notebook553129
cbfinn/maml_rl
Code for RL experiments in "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
Language:Python618180
higgsfield-ai/higgsfield
Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters
Language:Jupyter Notebook3.3k555
aravindr93/mjrl
Reinforcement learning algorithms for MuJoCo tasks
Language:Python35498
floodsung/meta-critic-networks
Pytorch code for Arxiv Paper: Learning to learn: Meta-Critic Networks for Sample-Efficient Learning
Language:Python5621
google-parfait/tensorflow-federated
An open-source framework for machine learning and other computations on decentralized data.
Language:Python2.3k583
Kaixhin/Rainbow
Rainbow: Combining Improvements in Deep Reinforcement Learning
Language:Python1.6k282
junyanz/pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
Language:Python22.7k6.3k
liyiying/meta-MADDPG
meta-MADDPG (Python implementation)
Language:Python173
zwfightzw/MLM
Language:Python1
karpathy/paper-notes
Random notes on papers, likely a short-term repo.
66882