hiyouga's Stars
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
THUDM/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
nlpxucan/WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
PKU-YuanGroup/ChatLaw
ChatLaw:A Powerful LLM Tailored for Chinese Legal. 中文法律大模型
InternLM/InternLM
Official release of InternLM2.5 base and chat models. 1M context support
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
enricoros/big-AGI
Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.
imoneoi/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
salesforce/CodeGen
CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
PanQiWei/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
qwopqwop200/GPTQ-for-LLaMa
4 bits quantization of LLaMA using GPTQ
google/BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
facebookresearch/ijepa
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
srush/MiniChain
A tiny library for coding with large language models.
OpenLMLab/LOMO
LOMO: LOw-Memory Optimization
wangyuxinwhy/uniem
unified embedding model
phohenecker/switch-cuda
A simple bash script for switching between installed versions of CUDA.
kmeng01/rome
Locating and editing factual associations in GPT (NeurIPS 2022)
ydli-ai/CSL
[COLING 2022] CSL: A Large-scale Chinese Scientific Literature Dataset 中文科学文献数据集
OpenLMLab/GAOKAO-Bench
GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.
CLUEbenchmark/pCLUE
pCLUE: 1000000+多任务提示学习数据集
tomwojcik/starlette-context
Middleware for Starlette that allows you to store and access the context data of a request. Can be used with logging so logs automatically use request headers such as x-request-id or x-correlation-id.
HugAILab/HugNLP
CIKM2023 Best Demo Paper Award. HugNLP is a unified and comprehensive NLP library based on HuggingFace Transformer. Please hugging for NLP now!😊
bojone/NBCE
Naive Bayes-based Context Extension
ictnlp/BayLing
“百聆”是一个基于LLaMA的语言对齐增强的英语/中文大语言模型,具有优越的英语/中文能力,在多语言和通用任务等多项测试中取得ChatGPT 90%的性能。BayLing is an English/Chinese LLM equipped with advanced language alignment, showing superior capability in English/Chinese generation, instruction following and multi-turn interaction.
neukg/TechGPT
TechGPT: Technology-Oriented Generative Pretrained Transformer