zhanqiuzhang's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
zylon-ai/private-gpt
Interact with your documents using the power of GPT, 100% privately, no data leaks
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
karpathy/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
joonspk-research/generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
chroma-core/chroma
the AI-native open-source embedding database
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
huggingface/trl
Train transformer language models with reinforcement learning.
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
THUDM/CodeGeeX2
CodeGeeX2: A More Powerful Multilingual Code Generation Model
liaokongVFX/LangChain-Chinese-Getting-Started-Guide
LangChain 的中文入门教程
ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
shibing624/text2vec
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
Facico/Chinese-Vicuna
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca
CLUEbenchmark/SuperCLUE
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
hkust-nlp/ceval
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
hiyouga/FastEdit
🩹Editing large language models within 10 seconds⚡
HillZhang1999/llm-hallucination-survey
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
zjunlp/KnowledgeEditingPapers
Must-read Papers on Knowledge Editing for Large Language Models.
SinclairCoder/Instruction-Tuning-Papers
Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).
xverse-ai/XVERSE-13B
XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.
LowinLi/transformers-stream-generator
This is a text generation method which returns a generator, streaming out each token in real-time during inference, based on Huggingface/Transformers.
lyogavin/Anima
Moved to here: https://github.com/lyogavin/airllm