sandrawing
MS in Data Science @ Harvard University, B. Econ in Finance, Minor in Mathematics @ Nankai University
sandrawing's Stars
openai/gym
A toolkit for developing and comparing reinforcement learning algorithms.
jindongwang/transferlearning
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
google-research/simclr
SimCLRv2 - Big Self-Supervised Models are Strong Semi-Supervised Learners
ieee8023/covid-chestxray-dataset
We are building an open database of COVID-19 cases with chest X-ray or CT images.
openai/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
oxwhirl/pymarl
Python Multi-Agent Reinforcement Learning framework
openai/multi-agent-emergence-environments
Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"
openai/maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
quantumiracle/Popular-RL-Algorithms
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
PaddlePaddle/RocketQA
🚀 RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.
lucidrains/mixture-of-experts
A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models
acl-org/aclpubcheck
Tools for checking ACL paper submissions
shariqiqbal2810/maddpg-pytorch
PyTorch Implementation of MADDPG (Lowe et. al. 2017)
starry-sky6688/MADDPG
Pytorch implementation of the MARL algorithm, MADDPG, which correspondings to the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments".
aravindr93/mjrl
Reinforcement learning algorithms for MuJoCo tasks
brucechou1983/CheXNet-Keras
This project is a tool to build CheXNet-like models, written in Keras.
aravindr93/hand_dapg
Repository to accompany RSS 2018 paper on dexterous hand manipulation
chijames/Poly-Encoder
semitable/lb-foraging
Level-based Foraging (LBF): A multi-agent environment for RL
richardrl/rlkit-relational
Codebase for ICRA 2020 paper "Towards Practical Multi-object Manipulation using Relational Reinforcement Learning"
fasrc/User_Codes
Bigpig4396/PyTorch-Counterfactual-Multi-Agent-Policy-Gradients-COMA
matteokarldonati/Counterfactual-Multi-Agent-Policy-Gradients
PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."
UBS-IB/bayesian_tree
johnny12150/GC-SAN
An implementation trying to reproduce "Graph contextualized self-attention network for session-based recommendation" based on SR-GNN code.
AmazaspShumik/Mixture-Models
Hierarchical Mixture of Experts,Mixture Density Neural Network
taochenshh/easyrl
A collection of reinforcement learning algorithms.
Steven-Ho/coma
Multi-agent algorithm based on counterfactual multi-agent policy gradients
Gialbo/COVID-Chest-X-Rays-Deep-Learning-analysis
Comparison and Analysis of different Deep Learning techniques for the COVID-19 Chest X-Rays dataset
quantumsnowball/toy-datasets-collections
A toy datasets collections for machine learning research quick reference