awesome-LLM-resourses: A repository from betashepherd

全世界最好的中文大语言模型资源汇总持续更新

微调

LLaMA-Factory: Unify Efficient Fine-Tuning of 100+ LLMs.
unsloth: 2-5X faster 80% less memory LLM finetuning.
TRL: Transformer Reinforcement Learning.
Firefly: Firefly: 大模型训练工具，支持训练数十种大模型
Xtuner: An efficient, flexible and full-featured toolkit for fine-tuning large models.
torchtune: A Native-PyTorch Library for LLM Fine-tuning.
Swift: Use PEFT or Full-parameter to finetune 200+ LLMs or 15+ MLLMs.
AutoTrain: A new way to automatically train, evaluate and deploy state-of-the-art Machine Learning models.
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework (Support 70B+ full tuning & LoRA & Mixtral & KTO).

推理

ollama: Get up and running with Llama 3, Mistral, Gemma, and other large language models.
Open WebUI: User-friendly WebUI for LLMs (Formerly Ollama WebUI).
Text Generation WebUI: A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
Xinference: A powerful and versatile library designed to serve language, speech recognition, and multimodal models.
LangChain: Build context-aware reasoning applications.
LlamaIndex: A data framework for your LLM applications.
lobe-chat: an open-source, modern-design LLMs/AI chat framework. Supports Multi AI Providers, Multi-Modals (Vision/TTS) and plugin system.
TensorRT-LLM: TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.
vllm: A high-throughput and memory-efficient inference and serving engine for LLMs.
LlamaChat: Chat with your favourite LLaMA models in a native macOS app.
NVIDIA ChatRTX: ChatRTX is a demo app that lets you personalize a GPT large language model (LLM) connected to your own content—docs, notes, or other data.
LM Studio: Discover, download, and run local LLMs.
chat-with-mlx: Chat with your data natively on Apple Silicon using MLX Framework.
LLM Pricing: Quickly Find the Perfect Large Language Models (LLM) API for Your Budget! Use Our Free Tool for Instant Access to the Latest Prices from Top Providers.
Open Interpreter: A natural language interface for computers.
Chat-ollama: An open source chatbot based on LLMs. It supports a wide range of language models, and knowledge base management.
chat-ui: Open source codebase powering the HuggingChat app.
MemGPT: Create LLM agents with long-term memory and custom tools.
koboldcpp: A simple one-file way to run various GGML and GGUF models with KoboldAI's UI.
LLMFarm: llama and other large language models on iOS and MacOS offline using GGML library.
enchanted: Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama.
Flowise: Drag & drop UI to build your customized LLM flow.
Jan: Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM).

评估

lm-evaluation-harness: A framework for few-shot evaluation of language models.
opencompass: OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
llm-comparator: LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed.

RAG

AnythingLLM: The all-in-one AI app for any LLM with full RAG and AI Agent capabilites.
MaxKB: 基于 LLM 大语言模型的知识库问答系统。开箱即用，支持快速嵌入到第三方业务系统
RAGFlow: An open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Dify: An open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
FastGPT: A knowledge-based platform built on the LLM, offers out-of-the-box data processing and model invocation capabilities, allows for workflow orchestration through Flow visualization.
Langchain-Chatchat: 基于 Langchain 与 ChatGLM 等不同大语言模型的本地知识库问答
QAnything: Question and Answer based on Anything.
Quivr: A personal productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Local & Private alternative to OpenAI GPTs & ChatGPT powered by retrieval-augmented generation.
RAG-GPT: RAG-GPT, leveraging LLM and RAG technology, learns from user-customized knowledge bases to provide contextually relevant answers for a wide range of queries, ensuring rapid and accurate information retrieval.
Verba: Retrieval Augmented Generation (RAG) chatbot powered by Weaviate.

书籍

课程

斯坦福 CS224N: Natural Language Processing with Deep Learning
吴恩达: Generative AI for Everyone
吴恩达: LLM series of courses
ACL 2023 Tutorial: Retrieval-based Language Models and Applications
llm-course: Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
微软: Generative AI for Beginners
微软: State of GPT
HuggingFace NLP Course
清华 NLP 刘知远团队大模型公开课
斯坦福 CS25: Transformers United V4
斯坦福 CS324: Large Language Models
普林斯顿 COS 597G (Fall 2022): Understanding Large Language Models
约翰霍普金斯 CS 601.471/671 NLP: Self-supervised Models
李宏毅 GenAI课程
openai-cookbook: Examples and guides for using the OpenAI API.
Hands on llms: Learn about LLM, LLMOps, and vector DBS for free by designing, training, and deploying a real-time financial advisor LLM system.
滑铁卢大学 CS 886: Recent Advances on Foundation Models
Mistral: Getting Started with Mistral
斯坦福 CS25: Transformers United V4
Coursera: Chatgpt 应用提示工程

betashepherd/awesome-LLM-resourses

微调

推理

评估

RAG

书籍

课程

教程