qiufengyuyi's Stars
scutan90/DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—foundation models
eosphoros-ai/DB-GPT
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
opendatalab/MinerU
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
phidatahq/phidata
Build AI Assistants with memory, knowledge and tools.
vanna-ai/vanna
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
artidoro/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
VikParuchuri/surya
OCR, layout analysis, reading order, line detection in 90+ languages
microsoft/promptflow
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
yangjianxin1/Firefly
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
microsoft/promptbase
All things prompt engineering
adithya-s-k/omniparse
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
CosmosShadow/gptpdf
Using GPT to parse PDF
jeinlee1991/chinese-llm-benchmark
中文大模型能力评测榜单:目前已囊括115个大模型,覆盖chatgpt、gpt4o、百度文心一言、阿里通义千问、讯飞星火、商汤senseChat、minimax等商用模型, 以及百川、qwen2、glm4、yi、书生internLM2、llama3等开源大模型,多维度能力评测。不仅提供能力评分排行榜,也提供所有模型的原始输出结果!
ChineseGLUE/ChineseGLUE
Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
ssbuild/chatglm_finetuning
chatglm 6b finetuning and alpaca finetuning
OpenLMLab/MOSS-RLHF
MOSS-RLHF
hikariming/chat-dataset-baseline
人工精调的中文对话数据集和一段chatglm的微调代码
liucongg/NLPDataSet
记录本人整理的一些数据集
yanqiangmiffy/InstructGLM
ChatGLM-6B 指令学习|指令数据|Instruct
deepcs233/jieba_fast
Use C Api and Swig to Speed up jieba 高效的中文分词库
naver/sqlova
jerry1993-tech/Cornucopia-LLaMA-Fin-Chinese
聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)
lazyFrogLOL/llmdocparser
A package for parsing PDFs and analyzing their content using LLMs.
vtuber-plan/langport
Langport is a language model inference service
pyvandenbussche/transformers-ner
Experiment on NER task using Huggingface state-of-the-art Transformers Natural Language Models library
Decem-Y/sohu_text_matching_Rank2
2021搜狐校园文本匹配算法大赛Top2方案