jackaduma/ChatGLM-LoRA-RLHF-PyTorch
A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM
PythonMIT
Stargazers
- AIchenkai
- cao1184481543
- DaMoLi2020
- gimmy49699Nanjing
- GUORUIWANG
- imaxwel
- jackadumaChocolate Factory
- jet-yangqs
- Kaleido0
- kbakdevPoland
- keain
- laurencecwj
- liuyijiang1994WHU
- ljw23
- mishidemudongchengdu
- mqx0465
- neotype
- ouwei2013Japan Advanced Institute of Science and Technology
- qiguanqiangNone yet
- RainGather
- ray075hlNWPU
- reborm
- sixeightw0lf
- skywatcher0
- Trangle
- ufwt
- we1l1n
- wwfcnu
- XiaofengZHOU
- xiaoyichaoBeijing,China
- xillig
- yc-huang
- yiranvang
- Zarc98
- ze00roPeter Lab
- zengwanning