Pinned Repositories
ColoBloom
ColossalAI
Making large AI models cheaper, faster and more accessible
EnergonAI
Large-scale model inference.
ai_lib
awesome-Auto-Parallelism
A baseline repository of Auto-Parallelism in Training Neural Networks
awesome-RLHF
collecting RLHF papers
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
EnergonAI
Large-scale model inference.
IRsend_FPGA
IRsend && Bluetooth Connecting on FPGA
VideoSys
VideoSys: An easy and efficient system for video generation
ht-zhou's Repositories
ht-zhou/awesome-Auto-Parallelism
A baseline repository of Auto-Parallelism in Training Neural Networks
ht-zhou/awesome-RLHF
collecting RLHF papers
ht-zhou/binary-bert
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
ht-zhou/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
ht-zhou/EnergonAI
Large-scale model inference.
ht-zhou/Best-README-Template
An awesome README template to jumpstart your projects!
ht-zhou/binary-quantization-Meta
Code repo for the paper BiT Robustly Binarized Multi-distilled Transformer
ht-zhou/bitsandbytes
8-bit CUDA functions for PyTorch
ht-zhou/ColoBloom
ht-zhou/ColossalAI
Colossal-AI: A Unified Deep Learning System for Big Model Era
ht-zhou/Colossalai-Bloom
ht-zhou/ColossalAI-Examples
Examples of training models with hybrid parallelism using ColossalAI
ht-zhou/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
ht-zhou/FQ-ViT
[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer
ht-zhou/ht-zhou
ht-zhou/InfiAgent.github.io
InfiAgent website
ht-zhou/Int8TP
ht-zhou/leecode
ht-zhou/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
ht-zhou/lightseq
LightSeq: A High Performance Library for Sequence Processing and Generation
ht-zhou/MNN
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba
ht-zhou/model-quantization
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
ht-zhou/Nonuniform-to-Uniform-Quantization
Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.
ht-zhou/parallelformers
Parallelformers: An Efficient Model Parallelization Toolkit for Deployment
ht-zhou/prm800k
800,000 step-level correctness labels on LLM solutions to MATH problems
ht-zhou/smoothquant
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
ht-zhou/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
ht-zhou/TensorRT_Tutorial
ht-zhou/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
ht-zhou/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)