JinXuekun

Nanjing UniversityNanjing, China

JinXuekun's Stars

twitter/the-algorithm-ml
Source code for Twitter's Recommendation Algorithm
Language:Python10.2k2.2k
twitter/the-algorithm
Source code for Twitter's Recommendation Algorithm
Language:Scala62.6k12.2k
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook18.7k2.2k
scutan90/DeepLearning-500-questions
深度学习500问，以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述，以帮助自己及有需要的读者。全书分为18个章节，50余万字。由于水平有限，书中不妥之处恳请广大读者批评指正。未完待续............ 如有意合作，联系scutjy2015@163.com 版权所有，违权必究 Tan 2018.06
Language:JavaScript55.1k15.9k
yihaosun1124/OfflineRL-Kit
An elegant PyTorch offline reinforcement learning library for researchers.
Language:Python29434
IBM/ZOSVRG-BlackBox-Adv
ZOSVRG-BlackBox-Adv
Language:Python1110
Lizhi-sjtu/DRL-code-pytorch
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
Language:Python1.1k182
rdturnermtl/bbo_challenge_starter_kit
Starter kit for the black box optimization challenge at Neurips 2020
Language:Python11328
Kautenja/gym-super-mario-bros
An OpenAI Gym interface to Super Mario Bros. & Super Mario Bros. 2 (Lost Levels) on The NES
Language:Python697142
km1994/RES-Interview-Notes
该仓库主要记录推荐系统算法工程师相关的面试题
52386
eyounx/RetroCodes
Codes of our team for the OpenAI Retro Contest of reinforcement learning
Language:Python9925
kitian616/jekyll-TeXt-theme
💎 🐳 A super customizable Jekyll theme for personal site, team site, blog, project, documentation, etc.
Language:SCSS3.2k2.6k
Ceruleanacg/Learning-Notes
💡 Repo of learning notes in DRL and DL, theory, codes, models and notes maybe.
Language:Jupyter Notebook10019
x35f/unstable_baselines
Re-implementations of SOTA RL algorithms.
Language:Python13012
google-research/deep_ope
Language:Jupyter Notebook859
aviralkumar2907/CQL
Code for conservative Q-learning
Language:Python41771
xionghuichen/RLAssistant
RLA is a tool for managing your RL experiments automatically
Language:Jupyter Notebook716
tianheyu927/mopo
Code for MOPO: Model-based Offline Policy Optimization
Language:Python17243
Farama-Foundation/D4RL
A collection of reference environments for offline reinforcement learning
Language:Python1.4k287
sfujim/BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
Language:Python600140
hanjuku-kaso/awesome-offline-rl
An index of algorithms for offline reinforcement learning (offline-rl)
94887
zhangchuheng123/Reinforcement-Implementation
Implementation of benchmark RL algorithms
Language:Python46282
AI4Finance-Foundation/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
Language:Python3.8k854
oxwhirl/comix
Language:Python4212
xuehy/pytorch-maddpg
A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)
Language:Python621122
hsvgbkhgbv/SQDDPG
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.
Language:Python11642
jiangsy/LAMDA-Beamer-Template
A beamer template for LAMDA lab at NJU
Language:TeX149
openai/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python15.9k4.9k
PaddlePaddle/PARL
A high-performance distributed training framework for Reinforcement Learning
Language:Python3.3k818
neheller/TensorFlow-PCA
An implementation of principle component analysis using TensorFlow's singular value decomposition
Language:Python94

JinXuekun

JinXuekun's Stars

twitter/the-algorithm-ml

twitter/the-algorithm

tloen/alpaca-lora

scutan90/DeepLearning-500-questions

yihaosun1124/OfflineRL-Kit

IBM/ZOSVRG-BlackBox-Adv

Lizhi-sjtu/DRL-code-pytorch

rdturnermtl/bbo_challenge_starter_kit

Kautenja/gym-super-mario-bros

km1994/RES-Interview-Notes

eyounx/RetroCodes

kitian616/jekyll-TeXt-theme

Ceruleanacg/Learning-Notes

x35f/unstable_baselines

google-research/deep_ope

aviralkumar2907/CQL

xionghuichen/RLAssistant

tianheyu927/mopo

Farama-Foundation/D4RL

sfujim/BCQ

hanjuku-kaso/awesome-offline-rl

zhangchuheng123/Reinforcement-Implementation

AI4Finance-Foundation/ElegantRL

oxwhirl/comix

xuehy/pytorch-maddpg

hsvgbkhgbv/SQDDPG

jiangsy/LAMDA-Beamer-Template

openai/baselines

PaddlePaddle/PARL

neheller/TensorFlow-PCA