successor-yu

successor-yu's Stars

jingyaogong/minimind
【大模型】3小时完全从0训练一个仅有26M的小参数GPT，最低仅需2G显卡即可推理训练！
Language:Python1.9k219
WangRongsheng/awesome-LLM-resourses
🧑‍🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.
1.4k172
chaoql/rag-best-practices
大模型检索增强生成技术最佳实践。
Language:Python334
karpathy/build-nanogpt
Video+code lecture on building nanoGPT from scratch
Language:Python3.4k470
unslothai/unsloth
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Language:Python15.9k1.1k
Xnhyacinth/Awesome-LLM-Long-Context-Modeling
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
83532
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook13.1k1.1k
lizhe2004/Awesome-LLM-RAG-Application
the resources about the application based on LLM with RAG pattern
75648
VikParuchuri/surya
OCR, layout analysis, reading order, line detection in 90+ languages
Language:Python9.9k644
ArtificialZeng/llama3_explained
the newest version of llama3，source code explained line by line using Chinese
Language:Python212
Filimoa/open-parse
Improved file parsing for LLM’s
Language:Python2.4k90
eosphoros-ai/Awesome-Text2SQL
Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.
1.6k124
wangshusen/SearchEngine
搜索引擎原理
1.4k115
hymie122/RAG-Survey
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
1.1k83
PharMolix/OpenBioMed
Language:Python67776
hrwleo/dwnlpinterview
Datawhale NLP 面筋
146174
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
Language:HTML9.3k909
km1994/nlp_paper_study
该仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记
Language:C++3.9k661
hellotransformers/Natural_Language_Processing_with_Transformers
Natural Language Processing with Transformers 中译本，最权威Transformers教程
36989
stanleylsx/llms_tool
一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测，低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。
Language:Python19819
liucongg/ChatGLM-Finetuning
基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型，进行下游具体任务微调，涉及Freeze、Lora、P-tuning、全参微调等
Language:Python2.6k292
OpenMOSS/CoLLiE
Collaborative Training of Large Language Models in an Efficient Way
Language:Python40758
jrt-20/pytorch_parallel
pytorch分布式数据并行、模型并行
Language:Python1
jwj7140/Gugugo
Gugugo: 한국어 오픈소스 번역 모델 프로젝트
Language:Jupyter Notebook676
FreedomIntelligence/LLMZoo
⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡
Language:Python2.9k199
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python29.4k4k
km1994/LLMsNineStoryDemonTower
【LLMs九层妖塔】分享 LLMs在自然语言处理（ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等）、信息检索（langchain）、语言合成、语言识别、多模态等领域（Stable Diffusion、MiniGPT-4、VisualGLM-6B、Ziya-Visual等）等实战与经验。
1.7k167
yaodongC/awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
1.1k59
hikariming/chat-dataset-baseline
人工精调的中文对话数据集和一段chatglm的微调代码
Language:Jupyter Notebook1.1k95
yuanzhoulvpi2017/zero_nlp
中文nlp解决方案(大模型、数据、模型、训练、推理)
Language:Jupyter Notebook2.9k356