zawnpn

Ph.D. Candidate, School of Computer Science, Peking University.

Peking UniversityBeijing, China

zawnpn's Stars

Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Language:Python166k 1.6k 2.6k44.1k
github/gitignore
A collection of useful .gitignore templates
161k 3.4k 083.2k
fatedier/frp
A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.
Language:Go84.4k 1.6k 3.5k13.2k
chinese-poetry/chinese-poetry
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人，21050首词。
Language:JavaScript47.9k 1.2k 2039.6k
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python36.4k 349 1.8k4.5k
chenfei-wu/TaskMatrix
Language:Python34.5k 300 3523.3k
coolsnowwolf/lede
Lean's LEDE source
Language:C29.5k 747 8.6k19.5k
wenyan-lang/wenyan
文言文編程語言 A programming language for the ancient Chinese.
Language:TypeScript19.6k 249 4961.1k
b3log/baidu-netdisk-downloaderx
⚡️ 一款图形界面的百度网盘不限速下载器，支持 Windows、Linux 和 Mac。
Language:JavaScript17.1k3.2k
ShangtongZhang/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
Language:Python13.5k 556 994.8k
antimatter15/alpaca.cpp
Locally run an Instruction-Tuned Chat-Style LLM
Language:C10.3k 102 187912
wangyu-/udp2raw
A Tunnel which Turns UDP Traffic into Encrypted UDP/FakeTCP/ICMP Traffic by using Raw Socket,helps you Bypass UDP FireWalls(or Unstable UDP Environment)
Language:C++7.1k 221 4781.2k
probml/pml-book
"Probabilistic Machine Learning" - a book series by Kevin Murphy
Language:Jupyter Notebook4.9k 88 646587
mshumer/gpt-llm-trainer
Language:Jupyter Notebook3.9k 69 22503
rail-berkeley/rlkit
Collection of reinforcement learning algorithms
Language:Python2.5k 61 131550
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
Language:Python1.8k 25 46107
google-deepmind/bsuite
bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent
Language:Python1.5k 60 31181
facebookresearch/mbrl-lib
Library for Model Based RL
Language:Python953 25 67154
iffiX/machin
Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...
Language:Python397 4 1851
danijar/crafter
Benchmarking the Spectrum of Agent Capabilities
Language:Python373 9 2161
zawnpn/ZHANGWP
My Blog (https://www.zhangwp.com).
30 3 04
YangRui2015/Modular_HER
Modular-HER is revised from OpenAI baselines and supports many improvements for Hindsight Experience Replay as modules.
Language:Python15 3 22
zawnpn/RL_RunFast
一款基于DQN算法的牌类游戏AI框架 / An AI framework for card games based on DQN algorithm
Language:Python10 2 01
PKU-RL/AdaRefiner
AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)
Language:Python90
zawnpn/Markdown_Toolkit
Markdown 编译工具 / Simple toolkit for Markdown
Language:TeX5 2 01
PKU-RL/COREP
Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation (ICML 2024)
Language:Python4 2 00
PKU-RL/EnDi
Language:Python3 0 11
rumusan/PRML-mindmap
PRML
2 2 00

zawnpn

zawnpn's Stars

Significant-Gravitas/AutoGPT

github/gitignore

fatedier/frp

chinese-poetry/chinese-poetry

lm-sys/FastChat

chenfei-wu/TaskMatrix

coolsnowwolf/lede

wenyan-lang/wenyan

b3log/baidu-netdisk-downloaderx

ShangtongZhang/reinforcement-learning-an-introduction

antimatter15/alpaca.cpp

wangyu-/udp2raw

probml/pml-book

mshumer/gpt-llm-trainer

rail-berkeley/rlkit

facebookresearch/chameleon

google-deepmind/bsuite

facebookresearch/mbrl-lib

iffiX/machin

danijar/crafter

zawnpn/ZHANGWP

YangRui2015/Modular_HER

zawnpn/RL_RunFast

PKU-RL/AdaRefiner

zawnpn/Markdown_Toolkit

PKU-RL/COREP

PKU-RL/EnDi

rumusan/PRML-mindmap