Jzz24

SenseTimeBeijing

Jzz24's Stars

ggerganov/llama.cpp
LLM inference in C/C++
Language:C++70.3k 559 4.3k10.1k
halfrost/LeetCode-Go
✅ Solutions to LeetCode by Go, 100% test coverage, runtime beats 100% / LeetCode 题解
Language:Go33.2k 626 795.7k
NVIDIA/TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
Language:C++11k 157 3.8k2.1k
Megvii-BaseDetection/YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
Language:Python9.6k 77 1.5k2.2k
THUDM/GLM-130B
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Language:Python7.7k 99 199608
harvardnlp/annotated-transformer
An annotated implementation of the Transformer paper.
Language:Jupyter Notebook5.9k 65 911.3k
TimDettmers/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
Language:Python5.8k 48 968584
kamyu104/LeetCode-Solutions
🏋️ Python / Modern C++ Solutions of All 3415 LeetCode Problems (Weekly Update)
Language:C++4.8k 170 491.6k
open-mmlab/mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Language:Python4.4k 42 1.4k1.3k
PINTO0309/PINTO_model_zoo
A repository for storing models that have been inter-converted between various frameworks. Supported frameworks are TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8), EdgeTPU, CoreML.
Language:Python3.7k 114 341580
qwopqwop200/GPTQ-for-LLaMa
4 bits quantization of LLaMA using GPTQ
Language:Python3k 42 218460
intel/neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Language:Python2.3k 33 211259
D-X-Y/Awesome-AutoDL
Automated Deep Learning: Neural Architecture Search Is Not the End (a curated list of AutoDL resources and an in-depth analysis)
Language:Python2.3k 111 18317
IST-DASLab/gptq
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
Language:Python2k 29 50160
htqin/awesome-model-quantization
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
1.8k 61 12203
SwinTransformer/Swin-Transformer-Object-Detection
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.
Language:Python1.8k 23 219380
openppl-public/ppq
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
Language:Python1.5k 17 219225
openppl-public/ppl.nn
A primitive library for neural network
Language:C++1.3k 36 113212
ModelTC/MQBench
Model Quantization Benchmark
Language:Shell776 14 199140
Jermmy/pytorch-quantization-demo
A simple network quantization demo using pytorch from scratch.
Language:Python514 10 2198
openppl-public/ppl.cv
ppl.cv is a high-performance image processing library of openPPL supporting various platforms.
Language:C++485 16 41108
megvii-research/FQ-ViT
[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer
Language:Python315 5 4648
yhhhli/BRECQ
Pytorch implementation of BRECQ, ICLR 2021
Language:Python261 6 4358
AI-performance/embedded-ai.bench
benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.
Language:Python204 14 3929
ucbrise/actnn
ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training
Language:Python201 7 2830
openppl-public/CuAssembler
An unofficial cuda assembler, for all generations of SASS, hopefully ：）
Language:Python72 2 010
sony-si/ai-research
Language:Python47 5 214
gilshm/sparq
Post-training sparsity-aware quantization
Language:Python33 2 25
openppl-public/ppl.common
Common libraries for PPL projects
Language:C++27 6 222
openppl-public/ppq_tools
Language:Python7 1 02

Jzz24

Jzz24's Stars

ggerganov/llama.cpp

halfrost/LeetCode-Go

NVIDIA/TensorRT

Megvii-BaseDetection/YOLOX

THUDM/GLM-130B

harvardnlp/annotated-transformer

TimDettmers/bitsandbytes

kamyu104/LeetCode-Solutions

open-mmlab/mmaction2

PINTO0309/PINTO_model_zoo

qwopqwop200/GPTQ-for-LLaMa

intel/neural-compressor

D-X-Y/Awesome-AutoDL

IST-DASLab/gptq

htqin/awesome-model-quantization

SwinTransformer/Swin-Transformer-Object-Detection

openppl-public/ppq

openppl-public/ppl.nn

ModelTC/MQBench

Jermmy/pytorch-quantization-demo

openppl-public/ppl.cv

megvii-research/FQ-ViT

yhhhli/BRECQ

AI-performance/embedded-ai.bench

ucbrise/actnn

openppl-public/CuAssembler

sony-si/ai-research

gilshm/sparq

openppl-public/ppl.common

openppl-public/ppq_tools