reborm

reborm's Stars

PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Language:Python41.1k 435 9.2k7.5k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python20.8k 177 3902k
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。
13.4k 185 211.2k
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Language:Python12.3k 77 7791.2k
netease-youdao/QAnything
Question and Answer based on Anything.
Language:Python10.7k 97 3361k
Unstructured-IO/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Language:HTML7.7k 52 1.1k613
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
Language:Python4.2k 43 349323
InsaneLife/ChineseNLPCorpus
中文自然语言处理数据集，平时做做实验的材料。欢迎补充提交合并。
Language:Python4.2k 86 9780
baidu/lac
百度NLP：分词，词性标注，命名实体识别，词重要性
Language:C++3.8k 106 247592
letiantian/TextRank4ZH
:deciduous_tree:从中文文本中自动提取关键词和摘要
Language:Python3.2k 103 34845
defog-ai/sqlcoder
SoTA LLM for converting natural language questions to SQL queries
Language:Jupyter Notebook3.1k 32 103197
MarcSkovMadsen/awesome-streamlit
The purpose of this project is to share knowledge on how awesome Streamlit is and can be
Language:HTML2k 45 30349
MetaGLM/FinGLM
FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目，利用开源开放来促进「AI+金融」。
Language:HTML1.6k 27 28232
eosphoros-ai/Awesome-Text2SQL
Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.
1.4k 15 4107
km1994/LLMs_interview_notes
该仓库主要记录大模型（LLMs）算法工程师相关的面试题
1.2k 10 193
X-PLUG/mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
Language:Python1.2k 27 9768
HillZhang1999/llm-hallucination-survey
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
862 12 346
SuperpoweredAI/spRAG
Retrieval engine for unstructured data
Language:Python518 6 833
yunwei37/Prompt-Engineering-Guide-zh-CN
🐙 关于提示词工程（prompt）的指南、论文、讲座、笔记本和资源大全（自动持续更新）
Language:Jupyter Notebook404 5 336
SpongebBob/Finetune-ChatGLM2-6B
ChatGLM2-6B 全参数微调，支持多轮对话的高效微调。
Language:Python394 8 2242
stacklens/django-vue-tutorial
用 django-rest-framework 和 vue 搭建前后端分离的个人博客
Language:Vue388 9 496
NLPJCL/RAG-Retrieval
Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT,Cross Encoder
Language:Python370 6 1932
WangRongsheng/MedQA-ChatGLM
🛰️ 基于真实医疗对话数据在ChatGLM上进行LoRA、P-Tuning V2、Freeze、RLHF等微调，我们的眼光不止于医疗问答
Language:Python291 5 1244
THUDM/AlignBench
大模型多维度中文对齐评测基准 (ACL 2024)
Language:Python263 11 2820
nickrosh/evol-teacher
Open Source WizardCoder Dataset
Language:Python143 2 511
JBoRu/Awesome-KBQA
Paper list of KBQA
61 2 037
cuplv/text-to-sql-wizardcoder
Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom Spider training dataset. The resultant model, achieves 61% execution accuracy, incorporating database context for validation.
Language:Jupyter Notebook42 6 14
5663015/LLMs_train
一套代码指令微调大模型
Language:Python31 1 13
DengYangyong/textrank_summarization
用textrank算法做中文新闻自动摘要
Language:Jupyter Notebook1714
yernenip/CodeLlama-LangChain-MySql
Prototype sample code demonstrating how we can leverage CodeLlama locally and connect it to MySQL using LangChain
Language:Jupyter Notebook16 1 07