suolyer's Stars
xai-org/grok-1
Grok open release
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
eosphoros-ai/DB-GPT
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
wgwang/awesome-LLMs-In-China
**大模型
modelscope/data-juicer
Making data higher-quality, juicier, and more digestible for foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
ModelTC/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
thunlp/UltraChat
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
sysuexam/SYSU-Exam
收集整理SYSU期末考试卷子、资料
google-research/deduplicate-text-datasets
Xnhyacinth/Awesome-LLM-Long-Context-Modeling
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
hankinghu/literature-books
书籍txt
facebookresearch/cc_net
Tools to download and cleanup Common Crawl data
SciPhi-AI/synthesizer
A multi-purpose LLM framework for RAG and data creation.
jerry1993-tech/Cornucopia-LLaMA-Fin-Chinese
聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)
opendatalab/WanJuan1.0
万卷1.0多模态语料
VikParuchuri/textbook_quality
Generate textbook-quality synthetic LLM pretraining data
chaoswork/sft_datasets
开源SFT数据集整理,随时补充
chaoyi-wu/Finetune_LLAMA
简单易懂的LLaMA微调指南。
LLaMafia/llamafia.github
Strivin0311/long-llms-learning
A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks
tjunlp-lab/M3KE
A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark
OpenJarvisAI/TianMu
TianMu: A modern AI tool with multi-platform support, markdown support, multimodal, continuous conversation, and customizable commands. 一个APP支持文心一言、通义千问、LLaMa、ChatGPT等,开源的大模型客户端!
beichao1314/Open-Llama
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
kyegomez/FlashAttention20Triton
Triton implementation of Flash Attention2.0
JackHCC/Arxiv-NLP-Reporter
每日自动获取Arxiv上NLP相关最新论文【Arxiv Natural Language Processing Paper Automatic Crawl Daily】
UnstoppableCurry/High-quality-Chinese-Q-A-dataset
最大开源中文问答数据集 ,助力中文LLM.The largest open-source Chinese Q&A dataset, supporting Chinese LLM
lovit/text-dedup
Python package for memory-friendly text de-duplication
robotcator/flash-attention
Fast and memory-efficient exact attention
lessw2020/triton_flashv2_alibi
working repo for Triton based Flash2 supporting alibi pos embeddings