peiji1981

OpenCSG Algorithm Engineer

peiji1981's Stars

hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python39k 386 1.7k4.3k
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python38.5k 387 3206.2k
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python38k 222 5.7k4.7k
Aider-AI/aider
aider is AI pair programming in your terminal
Language:Python24.8k 162 2.4k2.3k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python23.1k 192 5322.3k
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell11.7k 73 901705
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Language:Python11.1k 100 8221.1k
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Language:Python9.3k 85 38882
THUDM/CodeGeeX
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Language:Python8.3k 88 222614
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:Python8.1k 110 158490
sweepai/sweep
Sweep: open-source AI-powered Software Developer for small features and bug fixes.
Language:Jupyter Notebook7.5k 45 918436
OpenBMB/MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
Language:Jupyter Notebook7.3k 78 224464
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Language:Python7k 128 4511k
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Language:Python6k 71 270518
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
Language:Python5k 51 215522
openmlsys/openmlsys-zh
《Machine Learning Systems: Design and Implementation》- Chinese Version
Language:TeX4.2k 47 202440
NExT-GPT/NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Language:Python3.4k 60 109344
mbzuai-oryx/MobiLlama
MobiLlama : Small Language Model tailored for edge devices
Language:Python616 13 1448
mlc-ai/mlc-zh
Language:Python597 25 466
OpenBMB/BMTrain
Efficient Training (including pre-training and fine-tuning) for Big Models
Language:Python575 11 8777
declare-lab/instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
Language:Python539 13 3043
aceimnorstuvwxz/toutiao-text-classfication-dataset
今日头条中文新闻（文本）分类数据集
Language:Python364 3 1063
HFAiLab/hai-platform
一种任务级GPU算力分时调度的高性能深度学习训练平台
Language:Python363 8 1646
Curated-Awesome-Lists/Awesome-Open-AI-Sora
Sora AI Awesome List – Your go-to resource hub for all things Sora AI, OpenAI's groundbreaking model for crafting realistic scenes from text. Explore a curated collection of articles, videos, podcasts, and news about Sora's capabilities, advancements, and more.
226 7 015
FlagOpen/FlagScale
FlagScale is a large model toolkit based on open-sourced projects.
Language:Python207 8 1752
Gryphe/BlockMerge_Gradient
Merge Transformers language models by use of gradient parameters.
Language:Python202 5 322
neulab/code-bert-score
CodeBERTScore: an automatic metric for code generation, based on BERTScore
Language:Jupyter Notebook178 6 616
Silver267/pytorch-to-safetensor-converter
A simple converter which converts pytorch bin files to safetensor, intended to be used for LLM conversion.
Language:Python57 2 43
Stability-AI/stability-hpc
Deploy your HPC Cluster on AWS in 20min. with just 1-Click.
Language:Shell53 4 016
Stability-AI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Language:Python13 4 07

peiji1981

peiji1981's Stars

hpcaitech/ColossalAI

karpathy/nanoGPT

hiyouga/LLaMA-Factory

Aider-AI/aider

hpcaitech/Open-Sora

QwenLM/Qwen2.5

Lightning-AI/litgpt

karpathy/minbpe

THUDM/CodeGeeX

jzhang38/TinyLlama

sweepai/sweep

OpenBMB/MiniCPM

EleutherAI/gpt-neox

Lightning-AI/lit-llama

allenai/OLMo

openmlsys/openmlsys-zh

NExT-GPT/NExT-GPT

mbzuai-oryx/MobiLlama

mlc-ai/mlc-zh

OpenBMB/BMTrain

declare-lab/instruct-eval

aceimnorstuvwxz/toutiao-text-classfication-dataset

HFAiLab/hai-platform

Curated-Awesome-Lists/Awesome-Open-AI-Sora

FlagOpen/FlagScale

Gryphe/BlockMerge_Gradient

neulab/code-bert-score

Silver267/pytorch-to-safetensor-converter

Stability-AI/stability-hpc

Stability-AI/gpt-neox