peiji1981's Stars
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Aider-AI/aider
aider is AI pair programming in your terminal
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
THUDM/CodeGeeX
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
sweepai/sweep
Sweep: open-source AI-powered Software Developer for small features and bug fixes.
OpenBMB/MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
openmlsys/openmlsys-zh
《Machine Learning Systems: Design and Implementation》- Chinese Version
NExT-GPT/NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
mbzuai-oryx/MobiLlama
MobiLlama : Small Language Model tailored for edge devices
mlc-ai/mlc-zh
OpenBMB/BMTrain
Efficient Training (including pre-training and fine-tuning) for Big Models
declare-lab/instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
aceimnorstuvwxz/toutiao-text-classfication-dataset
今日头条中文新闻(文本)分类数据集
HFAiLab/hai-platform
一种任务级GPU算力分时调度的高性能深度学习训练平台
Curated-Awesome-Lists/Awesome-Open-AI-Sora
Sora AI Awesome List – Your go-to resource hub for all things Sora AI, OpenAI's groundbreaking model for crafting realistic scenes from text. Explore a curated collection of articles, videos, podcasts, and news about Sora's capabilities, advancements, and more.
FlagOpen/FlagScale
FlagScale is a large model toolkit based on open-sourced projects.
Gryphe/BlockMerge_Gradient
Merge Transformers language models by use of gradient parameters.
neulab/code-bert-score
CodeBERTScore: an automatic metric for code generation, based on BERTScore
Silver267/pytorch-to-safetensor-converter
A simple converter which converts pytorch bin files to safetensor, intended to be used for LLM conversion.
Stability-AI/stability-hpc
Deploy your HPC Cluster on AWS in 20min. with just 1-Click.
Stability-AI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.