Pinned Repositories
Distributed-ResNet-Tensorflow
A Distributed ResNet on multi-machines each with one GPU card.
LLMRoofline
Compare different hardware platforms via the Roofline Model for LLM inference tasks.
LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
long-context-attention
Sequence Parallel Attention for Long Context LLM Model Training and Inference
SWCaffe
A Deep Learning Framework customized for Sunway TaihuLight
swDNN
a highly-efficient library for deep neural networks based on Sunway TaihuLight supercomputer.
ColossalAI
Making large AI models cheaper, faster and more accessible
PipeFusion
A Suite of Parallel Approaches for Inference of Diffusion Transformer Models on GPU Clusters
PatrickStar
PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.
TurboTransformers
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
feifeibear's Repositories
feifeibear/LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
feifeibear/long-context-attention
Sequence Parallel Attention for Long Context LLM Model Training and Inference
feifeibear/LLMRoofline
Compare different hardware platforms via the Roofline Model for LLM inference tasks.
feifeibear/Odysseus-Transformer
feifeibear/PyTorchMemTracer
Depict GPU memory footprint during DNN training of PyTorch
feifeibear/ColoBloom
feifeibear/LLM-Viewer
Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.
feifeibear/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
feifeibear/ssh-passwd-free
Method to set passwd-free for a set of IPs
feifeibear/TensorrtBenchmark
Benchmark bert using TensorRT
feifeibear/ColossalAI
Colossal-AI: A Unified Deep Learning System for Big Model Era
feifeibear/DTensor
Study PyTorch DTensor
feifeibear/MoE-Megatron-LM
feifeibear/CommTest
Test for PyTorch Async Collective Communication
feifeibear/getpy
A Vectorized Python Dict/Set
feifeibear/KsanaLLM
feifeibear/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
feifeibear/ring-flash-attention
Ring attention implementation with flash attention
feifeibear/Awesome-LLM-System-Papers
feifeibear/BM-Training
Dive into Big Model Training
feifeibear/chinese-poetry
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
feifeibear/Conchmark
A benchmark liberary for Colossal-AI.
feifeibear/EnergonAI
Large-scale model inference.
feifeibear/feifeibear
feifeibear/GeminiBenchmark
feifeibear/leptonai
A Pythonic framework to simplify AI service building
feifeibear/minisora
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
feifeibear/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
feifeibear/TracerComparison
feifeibear/transformers-rlfh