shibei00

Reinforcement Learning, Natural Languag Processing

The Chinese University of Hong KongHong Kong

Pinned Repositories

AAAI18-code
The code of AAAI18 paper "Learning Structured Representation for Text Classification via Reinforcement Learning".
Language:Python0 2 00
alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Language:Jupyter Notebook0 1 00
annotated-transformer
http://nlp.seas.harvard.edu/2018/04/03/attention.html
Language:Jupyter Notebook0 1 00
apex
A continuous deep reinforcement learning framework for robotics
Language:Jupyter Notebook0 2 00
Cross-Lingual-Topic-Model
A topic model which can identify bilingual topics across unaligned corpus using dictionary. An implementation of the paper "Detecting Common Discussion Topics Across Culture From News Reader Comments (Shi et al., ACL2016)"
Language:Roff13 3 00
domain_sentiment_embedding
a model to learn domain-specific and sentiment-aware word embeddings
Language:Python3 3 00
LeetCode1
The first iteration of leetcode
Language:C++1 2 00
MINIST
the models related to MINIST digit data set.
1 2 00
RL-Paper-List
A paper list of reinforcement learning for personal interest
2 3 00

shibei00's Repositories

shibei00/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Language:Jupyter Notebook0 1 00
shibei00/apex
A continuous deep reinforcement learning framework for robotics
Language:Jupyter Notebook0 2 00
shibei00/awd-lstm-lm
LSTM and QRNN Language Model Toolkit for PyTorch
Language:Python2 0
shibei00/awesome-deep-rl
For deep RL and the future of AI.
Language:HTML2 0
shibei00/CharacterEval
Language:Python0 0
shibei00/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python0 0
shibei00/DeepLearning-500-questions
深度学习500问，以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述，以帮助自己及有需要的读者。全书分为18个章节，50余万字。由于水平有限，书中不妥之处恳请广大读者批评指正。未完待续............ 如有意合作，联系scutjy2015@163.com 版权所有，违权必究 Tan 2018.06
Language:JavaScript1 0
shibei00/DeepRL_Algorithms
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Language:Python1 0
shibei00/DouDiZhu
Language:Python1 0
shibei00/emacs-document
translate emacs documents to Chinese for convenient reference
Language:Shell1 0
shibei00/football
Check out the new game server:
Language:Python2 0
shibei00/hok_env
Language:Python1 0
shibei00/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Language:C++2 0
shibei00/Learn-Vim
A book for learning the Vim editor the smart way.
1 0
shibei00/llama
Inference code for Llama models
shibei00/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
shibei00/oi-slides
我的信息学竞赛讲课课件
Language:TeX1 0
shibei00/open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
Language:C++2 0
shibei00/poker-cfrm
A NLTH Poker Agent using Counterfactual Regret Minimization
Language:C++1 0
shibei00/procgen
Procgen Benchmark: Procedurally Generated Game-Like Gym Environments
Language:C++2 0
shibei00/python-mode
Vim python-mode. PyLint, Rope, Pydoc, breakpoints from box.
Language:Vim Script1 0
shibei00/resume
个人中文简历 Latex 源码 https://hijiangtao.github.io/
shibei00/rlcard
Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.
Language:Python2 0
shibei00/seed_rl
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.
Language:Python1 0
shibei00/Seq2seqChatbots
A wrapper around tensor2tensor to flexibly train, interact, and generate data for neural chatbots.
Language:Python2 0
shibei00/tetris_mcts
MCTS project for Tetris
Language:Python1 0
shibei00/text-to-text-transfer-transformer
Language:Python2 0
shibei00/tleague_projpage
Language:HTML1 0
shibei00/trainable-agents
Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"
Language:Python0 0
shibei00/trfl
TensorFlow Reinforcement Learning
Language:Python1 0