reborm's Stars
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
netease-youdao/QAnything
Question and Answer based on Anything.
Unstructured-IO/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
InsaneLife/ChineseNLPCorpus
中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。
baidu/lac
百度NLP:分词,词性标注,命名实体识别,词重要性
letiantian/TextRank4ZH
:deciduous_tree:从中文文本中自动提取关键词和摘要
defog-ai/sqlcoder
SoTA LLM for converting natural language questions to SQL queries
MarcSkovMadsen/awesome-streamlit
The purpose of this project is to share knowledge on how awesome Streamlit is and can be
MetaGLM/FinGLM
FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目,利用开源开放来促进「AI+金融」。
eosphoros-ai/Awesome-Text2SQL
Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.
km1994/LLMs_interview_notes
该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题
X-PLUG/mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
HillZhang1999/llm-hallucination-survey
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
SuperpoweredAI/spRAG
Retrieval engine for unstructured data
yunwei37/Prompt-Engineering-Guide-zh-CN
🐙 关于提示词工程(prompt)的指南、论文、讲座、笔记本和资源大全(自动持续更新)
SpongebBob/Finetune-ChatGLM2-6B
ChatGLM2-6B 全参数微调,支持多轮对话的高效微调。
stacklens/django-vue-tutorial
用 django-rest-framework 和 vue 搭建前后端分离的个人博客
NLPJCL/RAG-Retrieval
Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT,Cross Encoder
WangRongsheng/MedQA-ChatGLM
🛰️ 基于真实医疗对话数据在ChatGLM上进行LoRA、P-Tuning V2、Freeze、RLHF等微调,我们的眼光不止于医疗问答
THUDM/AlignBench
大模型多维度中文对齐评测基准 (ACL 2024)
nickrosh/evol-teacher
Open Source WizardCoder Dataset
JBoRu/Awesome-KBQA
Paper list of KBQA
cuplv/text-to-sql-wizardcoder
Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom Spider training dataset. The resultant model, achieves 61% execution accuracy, incorporating database context for validation.
5663015/LLMs_train
一套代码指令微调大模型
DengYangyong/textrank_summarization
用textrank算法做中文新闻自动摘要
yernenip/CodeLlama-LangChain-MySql
Prototype sample code demonstrating how we can leverage CodeLlama locally and connect it to MySQL using LangChain