zyeric's Stars
ggerganov/llama.cpp
LLM inference in C/C++
meta-llama/llama
Inference code for Llama models
ray-project/ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
state-spaces/mamba
Mamba SSM architecture
mlfoundations/open_clip
An open source implementation of CLIP.
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
Oneflow-Inc/oneflow
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
google-deepmind/alphageometry
mosaicml/llm-foundry
LLM training code for Databricks foundation models
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
suragnair/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
pytorch/torchtitan
A PyTorch native library for large model training
google-research/scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
tensorflow/lingvo
Lingvo
merrymercy/awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
microsoft/mup
maximal update parametrization (µP)
NVIDIA/nccl-tests
NCCL Tests
volcengine/veScale
A PyTorch Native LLM Training Framework
google-research/vmoe
llvm/Polygeist
C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!
facebookresearch/iopath
A python library that provides common I/O interface across different storage backends.
microsoft/nnscaler
nnScaler: Compiling DNN models for Parallel Training