qzl164

qzl164's Stars

ray-project/ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Language:Python34.2k 476 19k5.8k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.3k 226 2653.1k
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook15.4k 194 3822.2k
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.6k 152 3551k
nlpxucan/WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
Language:Python9.3k 113 190719
huggingface/text-generation-inference
Large Language Model Text Generation Inference
Language:Python9.2k 103 1.4k1.1k
OpenBMB/MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
Language:Jupyter Notebook7.2k 76 217458
apple/corenet
CoreNet: A library for training deep neural networks
Language:Jupyter Notebook7k 65 21541
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python6.1k 52 629479
p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
Language:Python5.7k 106 711.2k
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
Language:Python4.8k 47 199485
SCIR-HI/Huatuo-Llama-Med-Chinese
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草（原名：华驼）模型仓库，基于中文医学知识的大语言模型指令微调
Language:Python4.6k 49 106458
bbycroft/llm-viz
3D Visualization of an GPT-style LLM
Language:TypeScript4.1k 33 14449
openmlsys/openmlsys-zh
《Machine Learning Systems: Design and Implementation》- Chinese Version
Language:TeX4.1k 47 202436
ModelTC/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Language:Python2.6k 23 185209
DLLXW/baby-llama2-chinese
用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库；24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.
Language:Python2.5k 17 76312
InternLM/InternLM-XComposer
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Language:Python2.5k 43 391156
LLMBook-zh/LLMBook-zh.github.io
《大语言模型》作者：赵鑫，李军毅，周昆，唐天一，文继荣
2.3k 16 37156
315386775/DeepLearing-Interview-Awesome-2024
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓，同时包含工作和科研过程中的新想法、新问题、新资源与新项目
1.8k 27 1176
charent/ChatLM-mini-Chinese
中文对话0.2B小模型（ChatLM-Chinese-0.2B），开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调，给出三元组信息抽取微调示例。
Language:Python1.3k 14 50152
databricks/megablocks
Language:Python1.2k 16 61175
allenai/dolma
Data and tools for generating and inspecting OLMo pre-training data.
Language:Python1k 20 75108
lilacai/lilac
Curate better data for LLMs
Language:Python946 13 29389
Denis2054/Transformers-for-NLP-2nd-Edition
Transformer models from BERT to GPT-4, environments from Hugging Face to OpenAI. Fine-tuning, training, and prompt engineering examples. A bonus section with ChatGPT, GPT-3.5-turbo, GPT-4, and DALL-E including jump starting GPT-4, speech-to-text, text-to-speech, text to image generation with DALL-E, Google Cloud AI,HuggingGPT, and more
Language:Jupyter Notebook816 22 5307
HIT-SCIR/Chinese-Mixtral-8x7B
中文Mixtral-8x7B（Chinese-Mixtral-8x7B）
Language:Python642 15 3031
ymcui/Chinese-Mixtral
中文Mixtral混合专家大模型（Chinese Mixtral MoE LLMs）
Language:Python587 15 1042
Denis2054/Transformers-for-NLP-and-Computer-Vision-3rd-Edition
Transformers 3rd Edition
Language:Jupyter Notebook337 8 1126
thunlp/Ouroboros
Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)
Language:Python77 6 69
xverse-ai/XVERSE-MoE-A4.2B
XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.
Language:Python36 5 16
opendatalab/WanJuan2.0-WanJuan-CC
WanJuan-CC是以CommonCrawl为基础，经过数据抽取，规则清洗，去重，安全过滤，质量清洗等步骤得到的高质量数据。
12 4 10