successor-yu's Stars
jingyaogong/minimind
【大模型】3小时完全从0训练一个仅有26M的小参数GPT,最低仅需2G显卡即可推理训练!
WangRongsheng/awesome-LLM-resourses
🧑🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.
chaoql/rag-best-practices
大模型检索增强生成技术最佳实践。
karpathy/build-nanogpt
Video+code lecture on building nanoGPT from scratch
unslothai/unsloth
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Xnhyacinth/Awesome-LLM-Long-Context-Modeling
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
lizhe2004/Awesome-LLM-RAG-Application
the resources about the application based on LLM with RAG pattern
VikParuchuri/surya
OCR, layout analysis, reading order, line detection in 90+ languages
ArtificialZeng/llama3_explained
the newest version of llama3,source code explained line by line using Chinese
Filimoa/open-parse
Improved file parsing for LLM’s
eosphoros-ai/Awesome-Text2SQL
Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.
wangshusen/SearchEngine
搜索引擎原理
hymie122/RAG-Survey
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
PharMolix/OpenBioMed
hrwleo/dwnlpinterview
Datawhale NLP 面筋
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
km1994/nlp_paper_study
该仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记
hellotransformers/Natural_Language_Processing_with_Transformers
Natural Language Processing with Transformers 中译本,最权威Transformers教程
stanleylsx/llms_tool
一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测,低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。
liucongg/ChatGLM-Finetuning
基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等
OpenMOSS/CoLLiE
Collaborative Training of Large Language Models in an Efficient Way
jrt-20/pytorch_parallel
pytorch分布式数据并行、模型并行
jwj7140/Gugugo
Gugugo: 한국어 오픈소스 번역 모델 프로젝트
FreedomIntelligence/LLMZoo
⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
km1994/LLMsNineStoryDemonTower
【LLMs九层妖塔】分享 LLMs在自然语言处理(ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等)、信息检索(langchain)、语言合成、语言识别、多模态等领域(Stable Diffusion、MiniGPT-4、VisualGLM-6B、Ziya-Visual等)等 实战与经验。
yaodongC/awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
hikariming/chat-dataset-baseline
人工精调的中文对话数据集和一段chatglm的微调代码
yuanzhoulvpi2017/zero_nlp
中文nlp解决方案(大模型、数据、模型、训练、推理)