JinXuekun's Stars
twitter/the-algorithm-ml
Source code for Twitter's Recommendation Algorithm
twitter/the-algorithm
Source code for Twitter's Recommendation Algorithm
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
scutan90/DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
yihaosun1124/OfflineRL-Kit
An elegant PyTorch offline reinforcement learning library for researchers.
IBM/ZOSVRG-BlackBox-Adv
ZOSVRG-BlackBox-Adv
Lizhi-sjtu/DRL-code-pytorch
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
rdturnermtl/bbo_challenge_starter_kit
Starter kit for the black box optimization challenge at Neurips 2020
Kautenja/gym-super-mario-bros
An OpenAI Gym interface to Super Mario Bros. & Super Mario Bros. 2 (Lost Levels) on The NES
km1994/RES-Interview-Notes
该仓库主要记录 推荐系统 算法工程师相关的面试题
eyounx/RetroCodes
Codes of our team for the OpenAI Retro Contest of reinforcement learning
kitian616/jekyll-TeXt-theme
💎 🐳 A super customizable Jekyll theme for personal site, team site, blog, project, documentation, etc.
Ceruleanacg/Learning-Notes
💡 Repo of learning notes in DRL and DL, theory, codes, models and notes maybe.
x35f/unstable_baselines
Re-implementations of SOTA RL algorithms.
google-research/deep_ope
aviralkumar2907/CQL
Code for conservative Q-learning
xionghuichen/RLAssistant
RLA is a tool for managing your RL experiments automatically
tianheyu927/mopo
Code for MOPO: Model-based Offline Policy Optimization
Farama-Foundation/D4RL
A collection of reference environments for offline reinforcement learning
sfujim/BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
hanjuku-kaso/awesome-offline-rl
An index of algorithms for offline reinforcement learning (offline-rl)
zhangchuheng123/Reinforcement-Implementation
Implementation of benchmark RL algorithms
AI4Finance-Foundation/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
oxwhirl/comix
xuehy/pytorch-maddpg
A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)
hsvgbkhgbv/SQDDPG
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.
jiangsy/LAMDA-Beamer-Template
A beamer template for LAMDA lab at NJU
openai/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
PaddlePaddle/PARL
A high-performance distributed training framework for Reinforcement Learning
neheller/TensorFlow-PCA
An implementation of principle component analysis using TensorFlow's singular value decomposition