zyeric

honest, modest, reliable

Microsoft Research AsiaBeijing

zyeric's Stars

ggerganov/llama.cpp
LLM inference in C/C++
Language:C++74.3k 571 4.5k10.7k
meta-llama/llama
Inference code for Llama models
Language:Python57.9k 532 1.1k9.7k
ray-project/ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Language:Python36.2k 479 19.8k6.1k
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Language:Python33.6k 318 9624.9k
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook28.1k 325 4173.5k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python16.5k 125 1.3k1.6k
openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Language:Python15.8k 267 2172.7k
state-spaces/mamba
Mamba SSM architecture
Language:Python14.4k 105 6281.3k
mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python11.4k 83 5341.1k
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python9.2k 76 612653
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Language:Python8.5k 96 1.8k1.1k
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
Language:Python7.6k 79 702780
Oneflow-Inc/oneflow
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
Language:C++7.6k 245 1k833
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python7k 46 88633
google-deepmind/alphageometry
Language:Python4.4k 57 138505
mosaicml/llm-foundry
LLM training code for Databricks foundation models
Language:Python4.2k 48 397557
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
Language:Python4.1k 40 395299
suragnair/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Language:Jupyter Notebook4.1k 113 1831.1k
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
3.8k 67 60265
pytorch/torchtitan
A PyTorch native library for large model training
Language:Python3.5k 52 278320
google-research/scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
Language:Python3.5k 36 271452
tensorflow/lingvo
Lingvo
Language:Python2.8k 117 254448
merrymercy/awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
2.5k 115 1309
microsoft/mup
maximal update parametrization (µP)
Language:Jupyter Notebook1.5k 29 6298
NVIDIA/nccl-tests
NCCL Tests
Language:Cuda1k 26 257268
volcengine/veScale
A PyTorch Native LLM Training Framework
Language:Python762 32 2142
google-research/vmoe
Language:Jupyter Notebook614 13 1753
llvm/Polygeist
C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!
Language:C++526 21 142128
facebookresearch/iopath
A python library that provides common I/O interface across different storage backends.
Language:Python141 14 1324
microsoft/nnscaler
nnScaler: Compiling DNN models for Parallel Training
Language:Python103 8 1513

zyeric

zyeric's Stars

ggerganov/llama.cpp

meta-llama/llama

ray-project/ray

huggingface/pytorch-image-models

openai/CLIP

Dao-AILab/flash-attention

openai/evals

state-spaces/mamba

mlfoundations/open_clip

facebookresearch/xformers

huggingface/accelerate

modelscope/modelscope

Oneflow-Inc/oneflow

facebookresearch/DiT

google-deepmind/alphageometry

mosaicml/llm-foundry

baichuan-inc/Baichuan2

suragnair/alpha-zero-general

esbatmop/MNBVC

pytorch/torchtitan

google-research/scenic

tensorflow/lingvo

merrymercy/awesome-tensor-compilers

microsoft/mup

NVIDIA/nccl-tests

volcengine/veScale

google-research/vmoe

llvm/Polygeist

facebookresearch/iopath

microsoft/nnscaler