zawnpn
Ph.D. Candidate, School of Computer Science, Peking University.
Peking UniversityBeijing, China
zawnpn's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
github/gitignore
A collection of useful .gitignore templates
fatedier/frp
A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.
chinese-poetry/chinese-poetry
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
chenfei-wu/TaskMatrix
coolsnowwolf/lede
Lean's LEDE source
wenyan-lang/wenyan
文言文編程語言 A programming language for the ancient Chinese.
b3log/baidu-netdisk-downloaderx
⚡️ 一款图形界面的百度网盘不限速下载器,支持 Windows、Linux 和 Mac。
ShangtongZhang/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
antimatter15/alpaca.cpp
Locally run an Instruction-Tuned Chat-Style LLM
wangyu-/udp2raw
A Tunnel which Turns UDP Traffic into Encrypted UDP/FakeTCP/ICMP Traffic by using Raw Socket,helps you Bypass UDP FireWalls(or Unstable UDP Environment)
probml/pml-book
"Probabilistic Machine Learning" - a book series by Kevin Murphy
mshumer/gpt-llm-trainer
rail-berkeley/rlkit
Collection of reinforcement learning algorithms
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
google-deepmind/bsuite
bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent
facebookresearch/mbrl-lib
Library for Model Based RL
iffiX/machin
Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...
danijar/crafter
Benchmarking the Spectrum of Agent Capabilities
zawnpn/ZHANGWP
My Blog (https://www.zhangwp.com).
YangRui2015/Modular_HER
Modular-HER is revised from OpenAI baselines and supports many improvements for Hindsight Experience Replay as modules.
zawnpn/RL_RunFast
一款基于DQN算法的牌类游戏AI框架 / An AI framework for card games based on DQN algorithm
PKU-RL/AdaRefiner
AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)
zawnpn/Markdown_Toolkit
Markdown 编译工具 / Simple toolkit for Markdown
PKU-RL/COREP
Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation (ICML 2024)
PKU-RL/EnDi
rumusan/PRML-mindmap
PRML