shibei00
Reinforcement Learning, Natural Languag Processing
The Chinese University of Hong KongHong Kong
Pinned Repositories
AAAI18-code
The code of AAAI18 paper "Learning Structured Representation for Text Classification via Reinforcement Learning".
alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
annotated-transformer
http://nlp.seas.harvard.edu/2018/04/03/attention.html
apex
A continuous deep reinforcement learning framework for robotics
Cross-Lingual-Topic-Model
A topic model which can identify bilingual topics across unaligned corpus using dictionary. An implementation of the paper "Detecting Common Discussion Topics Across Culture From News Reader Comments (Shi et al., ACL2016)"
domain_sentiment_embedding
a model to learn domain-specific and sentiment-aware word embeddings
LeetCode1
The first iteration of leetcode
MINIST
the models related to MINIST digit data set.
RL-Paper-List
A paper list of reinforcement learning for personal interest
shibei00's Repositories
shibei00/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
shibei00/apex
A continuous deep reinforcement learning framework for robotics
shibei00/awd-lstm-lm
LSTM and QRNN Language Model Toolkit for PyTorch
shibei00/awesome-deep-rl
For deep RL and the future of AI.
shibei00/CharacterEval
shibei00/ColossalAI
Making large AI models cheaper, faster and more accessible
shibei00/DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
shibei00/DeepRL_Algorithms
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
shibei00/DouDiZhu
shibei00/emacs-document
translate emacs documents to Chinese for convenient reference
shibei00/football
Check out the new game server:
shibei00/hok_env
shibei00/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
shibei00/Learn-Vim
A book for learning the Vim editor the smart way.
shibei00/llama
Inference code for Llama models
shibei00/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
shibei00/oi-slides
我的信息学竞赛讲课课件
shibei00/open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
shibei00/poker-cfrm
A NLTH Poker Agent using Counterfactual Regret Minimization
shibei00/procgen
Procgen Benchmark: Procedurally Generated Game-Like Gym Environments
shibei00/python-mode
Vim python-mode. PyLint, Rope, Pydoc, breakpoints from box.
shibei00/resume
个人中文简历 Latex 源码 https://hijiangtao.github.io/
shibei00/rlcard
Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.
shibei00/seed_rl
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.
shibei00/Seq2seqChatbots
A wrapper around tensor2tensor to flexibly train, interact, and generate data for neural chatbots.
shibei00/tetris_mcts
MCTS project for Tetris
shibei00/text-to-text-transfer-transformer
shibei00/tleague_projpage
shibei00/trainable-agents
Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"
shibei00/trfl
TensorFlow Reinforcement Learning