feizhouxiaozhu

feizhouxiaozhu's Stars

xai-org/grok-1
Grok open release
Language:Python49.5k 562 2098.3k
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Language:Jupyter Notebook37.6k 392 674k
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
17.8k 371 241.4k
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python11.7k 206 2.2k2.4k
EleutherAI/gpt-neo
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
Language:Python8.2k 177 137945
THUDM/CodeGeeX
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Language:Python8.2k 86 216598
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:Python7.7k 108 156451
THUDM/CodeGeeX2
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Language:Python7.6k 65 248532
openlm-research/open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
7.4k 121 91374
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Language:Python6.9k 123 434998
yangjianxin1/Firefly
Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Language:Python5.7k 56 279518
togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
Language:Python4.5k 76 89346
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
2.5k 46 3159
km1994/LLMsNineStoryDemonTower
【LLMs九层妖塔】分享 LLMs在自然语言处理（ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等）、信息检索（langchain）、语言合成、语言识别、多模态等领域（Stable Diffusion、MiniGPT-4、VisualGLM-6B、Ziya-Visual等）等实战与经验。
1.7k 18 0168
WangRongsheng/awesome-LLM-resourses
🧑‍🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.
1.5k 31 2188
km1994/LLMs_interview_notes
该仓库主要记录大模型（LLMs）算法工程师相关的面试题
1.4k 10 199
bigscience-workshop/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python1.3k 24 144214
huggingface/llm-vscode
LLM powered development for VSCode
Language:TypeScript1.2k 20 81131
EleutherAI/math-lm
Language:Python1k 15 4478
liucongg/NLPDataSet
记录本人整理的一些数据集
992 11 3133
bigscience-workshop/bigscience
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
Language:Shell976 38 19102
Atome-FE/llama-node
Believe in AI democratization. llama for nodejs backed by llama-rs, llama.cpp and rwkv.cpp, work locally on your laptop CPU. support llama/alpaca/gpt4all/vicuna/rwkv model.
Language:Rust862 15 7062
alibaba/Pai-Megatron-Patch
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
Language:Python674 9 13294
alibaba/Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
Language:Python611 6 6251
benman1/generative_ai_with_langchain
Build large language model (LLM) apps with Python, ChatGPT and other models. This is the companion repository for the book on generative AI with LangChain.
Language:Jupyter Notebook596 15 42241
RUCAIBox/LLMBox
A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.
Language:Python589 6 977
epfLLM/Megatron-LLM
distributed trainer for LLMs
Language:Python526 18 5976
liucongg/ChatGPTBook
《ChatGPT原理与实战：大型语言模型的算法、技术和私有化》
Language:Python326 11 1064
yanqiangmiffy/how-to-train-tokenizer
怎么训练一个LLM分词器
Language:Python126 6 227
LydiaXiaohongLi/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python193