Chen-Wang-CUHK

Hong Kong

Chen-Wang-CUHK's Stars

meta-llama/llama
Inference code for Llama models
Language:Python56.6k 526 1k9.6k
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python37.1k 353 1.8k4.6k
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python35.6k 345 2.8k4.1k
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python35.1k 213 5.4k4.3k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python30.9k 251 5.4k4.7k
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python29.6k 342 2684.1k
THUDM/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Language:Python15.7k 132 6151.9k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python14.4k 120 1.1k1.4k
triton-lang/triton
Development repository for the Triton language and compiler
Language:C++13.5k 195 1.5k1.7k
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python10.7k 163 7882.4k
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.7k 77 567621
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
Language:Python4.1k 41 395297
hiyouga/ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
Language:Python3.7k 32 374473
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
3.5k 65 55247
dvlab-research/LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Language:Python2.6k 12 173275
allenai/longformer
Longformer: The Long-Document Transformer
Language:Python2.1k 42 228276
LC1332/Chat-Haruhi-Suzumiya
Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.
Language:Jupyter Notebook1.8k 17 62164
AetherCortex/Llama-X
Open Academic Research on Improving LLaMA to SOTA LLM
Language:Python1.6k 42 21103
zhenbench/z-bench
Z-Bench 1.0 by 真格基金：一个麻瓜的大语言模型中文测试集。Z-Bench is a LLM prompt dataset for non-technical users, developed by an enthusiastic AI-focused team in Zhenfund.
480 9 842
CLUEbenchmark/pCLUE
pCLUE: 1000000+多任务提示学习数据集
Language:Jupyter Notebook471 7 956
songhaoyu/BoB
The released codes for ACL 2021 paper 'BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data'
Language:Python136 2 1924
nuochenpku/Harry-Potter-Dialogue-Dataset
[EMNLP 2023]This the repository of Harry Potter Dialogue Dataset.
118 1 54
mutonix/RefGPT
Language:Python93 2 36
MikeGu721/XiezhiBenchmark
Language:Python91 1 94
billvsme/my_openai_api
部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ，实现了OpenAI中Chat, Models和Completions接口，包含流式响应
Language:Python84 2 68
morecry/CharacterChat
repository for CharacterChat, a personalized social support system
Language:Python63 3 47
jackaduma/Alpaca-LoRA-RLHF-PyTorch
A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT but with Alpaca
Language:Python56 5 16
launchnlp/BOLT
Code for ACL 2023 paper "BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases".
Language:Python19 3 32
agi-templar/LaMer
A slick text style transfer framework.
Language:Python8 2 41
yangkexin/Tailor
Language:Python4 1 140

Chen-Wang-CUHK

Chen-Wang-CUHK's Stars

meta-llama/llama

lm-sys/FastChat

microsoft/DeepSpeed

hiyouga/LLaMA-Factory

vllm-project/vllm

tatsu-lab/stanford_alpaca

THUDM/ChatGLM2-6B

Dao-AILab/flash-attention

triton-lang/triton

NVIDIA/Megatron-LM

facebookresearch/xformers

baichuan-inc/Baichuan2

hiyouga/ChatGLM-Efficient-Tuning

esbatmop/MNBVC

dvlab-research/LongLoRA

allenai/longformer

LC1332/Chat-Haruhi-Suzumiya

AetherCortex/Llama-X

zhenbench/z-bench

CLUEbenchmark/pCLUE

songhaoyu/BoB

nuochenpku/Harry-Potter-Dialogue-Dataset

mutonix/RefGPT

MikeGu721/XiezhiBenchmark

billvsme/my_openai_api

morecry/CharacterChat

jackaduma/Alpaca-LoRA-RLHF-PyTorch

launchnlp/BOLT

agi-templar/LaMer

yangkexin/Tailor