ScottWang96's Stars
nomic-ai/gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
OptimalScale/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
microsoft/DeepSpeedExamples
Example models using DeepSpeed
lyogavin/airllm
AirLLM 70B inference with single 4GB GPU
Instruction-Tuning-with-GPT-4/GPT-4-LLM
Instruction Tuning with GPT-4
FranxYao/chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
THUDM/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
hkust-nlp/ceval
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
OpenLMLab/MOSS-RLHF
MOSS-RLHF
intelligent-machine-learning/dlrover
DLRover: An Automatic Distributed Deep Learning System
openai/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
thunlp/OpenDelta
A plug-and-play library for parameter-efficient-tuning (Delta Tuning)
microsoft/ToRA
ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].
thu-coai/Safety-Prompts
Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。
jianzhnie/awesome-instruction-datasets
A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。
XueFuzhao/InstructionWild
suzgunmirac/BIG-Bench-Hard
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
InternLM/InternLM-Math
State-of-the-art bilingual open-sourced Math reasoning LLMs.
raunak-agarwal/instruction-datasets
All available datasets for Instruction Tuning of Large Language Models
vwxyzjn/summarize_from_feedback_details
Ckins/Dict-of-Sensitive-Words