Gingersna's Stars
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Mintplex-Labs/anything-llm
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
eosphoros-ai/DB-GPT
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
mlc-ai/web-llm
High-performance In-browser LLM Inference Engine
ludwig-ai/ludwig
Low-code framework for building custom LLMs, neural networks, and other AI models
stas00/ml-engineering
Machine Learning Engineering Open Book
microsoft/promptflow
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
nlpxucan/WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
activeloopai/deeplake
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
weaviate/Verba
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate
pathwaycom/llm-app
Dynamic RAG for enterprise. Ready to run with Docker,⚡in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.
yizhongw/self-instruct
Aligning pretrained language models with instruction data generated by themselves.
argilla-io/argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
karpathy/ng-video-lecture
modelscope/data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
infiniflow/infinity
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text
THUDM/WebGLM
WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)
argilla-io/distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
junruxiong/IncarnaMind
Connect and chat with your multiple documents (pdf and txt) through GPT 3.5, GPT-4 Turbo, Claude and Local Open-Source LLMs
onlyphantom/llm-python
Large Language Models (LLMs) tutorials & sample scripts, ft. langchain, openai, llamaindex, gpt, chromadb & pinecone
gabriben/awesome-generative-information-retrieval
predibase/llm_distillation_playbook
Best practices for distilling large language models.
itsnamgyu/reasoning-teacher
Official code for "Large Language Models Are Reasoning Teachers", ACL 2023
AI21Labs/in-context-ralm
WangRongsheng/Aurora
🐳 Aurora is a [Chinese Version] MoE model. Aurora is a further work based on Mixtral-8x7B, which activates the chat capability of the model's Chinese open domain.
RCGAI/SimplyRetrieve
Lightweight chat AI platform featuring custom knowledge, open-source LLMs, prompt-engineering, retrieval analysis. Highly customizable. For Retrieval-Centric & Retrieval-Augmented Generation.
Hannibal046/SelfMemory
[Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory
Shark-NLP/self-adaptive-ICL
self-adaptive in-context learning
xinyadu/RGQA