ScottWang96

Shanghai JiaoTong UniversityShanghai, P. R. China

ScottWang96's Stars

nomic-ai/gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Language:C++69.6k 643 1.9k7.6k
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Language:Python40.4k 394 1.3k5.2k
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python36.5k 348 1.8k4.5k
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Language:Python31.4k 198 4.9k3.9k
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python29.4k 339 2674k
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook18.6k 153 4692.2k
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。
15.1k 192 241.4k
OptimalScale/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
Language:Python8.2k 72 407820
microsoft/DeepSpeedExamples
Example models using DeepSpeed
Language:Python6k 74 5341k
lyogavin/airllm
AirLLM 70B inference with single 4GB GPU
Language:Jupyter Notebook4.2k 110 161351
Instruction-Tuning-with-GPT-4/GPT-4-LLM
Instruction Tuning with GPT-4
Language:HTML4.2k 43 34299
FranxYao/chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
Language:Jupyter Notebook2.5k 37 34126
THUDM/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Language:Python2.1k 29 138149
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Language:Python2.1k 21 249205
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python2k 19 81165
hkust-nlp/ceval
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
Language:Python1.6k 15 8174
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language:Python1.3k 17 84119
OpenLMLab/MOSS-RLHF
MOSS-RLHF
Language:Python1.3k 34 5297
intelligent-machine-learning/dlrover
DLRover: An Automatic Distributed Deep Learning System
Language:Python1.2k 50 236151
openai/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
Language:Python1.2k 24 15163
thunlp/OpenDelta
A plug-and-play library for parameter-efficient-tuning (Delta Tuning)
Language:Python983 17 6177
microsoft/ToRA
ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].
Language:Python939 18 2769
thu-coai/Safety-Prompts
Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts，用于评估和提升大模型的安全性。
850 7 2180
jianzhnie/awesome-instruction-datasets
A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。
499 5 023
XueFuzhao/InstructionWild
451 9 641
suzgunmirac/BIG-Bench-Hard
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
415 3 924
InternLM/InternLM-Math
State-of-the-art bilingual open-sourced Math reasoning LLMs.
Language:Python414 8 2325
raunak-agarwal/instruction-datasets
All available datasets for Instruction Tuning of Large Language Models
230 7 011
vwxyzjn/summarize_from_feedback_details
Language:Python100 4 011
Ckins/Dict-of-Sensitive-Words
20

ScottWang96

ScottWang96's Stars

nomic-ai/gpt4all

THUDM/ChatGLM-6B

lm-sys/FastChat

hiyouga/LLaMA-Factory

tatsu-lab/stanford_alpaca

tloen/alpaca-lora

HqWu-HITCS/Awesome-Chinese-LLM

OptimalScale/LMFlow

microsoft/DeepSpeedExamples

lyogavin/airllm

Instruction-Tuning-with-GPT-4/GPT-4-LLM

FranxYao/chain-of-thought-hub

THUDM/AgentBench

OpenRLHF/OpenRLHF

eric-mitchell/direct-preference-optimization

hkust-nlp/ceval

PKU-Alignment/safe-rlhf

OpenLMLab/MOSS-RLHF

intelligent-machine-learning/dlrover

openai/lm-human-preferences

thunlp/OpenDelta

microsoft/ToRA

thu-coai/Safety-Prompts

jianzhnie/awesome-instruction-datasets

XueFuzhao/InstructionWild

suzgunmirac/BIG-Bench-Hard

InternLM/InternLM-Math

raunak-agarwal/instruction-datasets

vwxyzjn/summarize_from_feedback_details

Ckins/Dict-of-Sensitive-Words