nlpBeginner's Stars
infiniflow/infinity
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
RUC-NLPIR/FlashRAG
⚡FlashRAG: A Python Toolkit for Efficient RAG Research
Alir3z4/html2text
Convert HTML to Markdown-formatted text.
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
FlowiseAI/Flowise
Drag & drop UI to build your customized LLM flow
explosion/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
zilliztech/akcio
Akcio is a demonstration project for Retrieval Augmented Generation (RAG). It leverages the power of LLM to generate responses and uses vector databases to fetch relevant documents to enhance the quality and relevance of the output.
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
liucongg/ChatGPTBook
《ChatGPT原理与实战:大型语言模型的算法、技术和私有化》
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
defog-ai/sqlcoder
SoTA LLM for converting natural language questions to SQL queries
Unstructured-IO/unstructured-api
Layout-Parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
huggingface/text-generation-inference
Large Language Model Text Generation Inference
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
chatchat-space/Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
ZhangErling/ChatGLM-6B
提供Windows部署文档的版本 | ChatGLM-6B:开源双语对话语言模型 | An Open Bilingual Dialogue Language Model
wangyuxinwhy/uniem
unified embedding model
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
summanlp/textrank
TextRank implementation for Python 3.
ZhuiyiTechnology/WoBERT
以词为基本单位的中文BERT
HLTCHKUST/Mem2Seq
Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems
NLP-LOVE/ML-NLP
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
neulab/nn4nlp-concepts
A repository of concepts related to neural networks for NLP
CLUEbenchmark/CLGE
Chinese Language Generation Evaluation 中文生成任务基准测评
didi/ChineseNLP
Datasets, SOTA results of every fields of Chinese NLP
ChenChengKuan/awesome-text-generation
A curated list of recent models of text generation and application
songyingxin/NLPer-Interview
该仓库主要记录 NLP 算法工程师相关的面试题