amadeus-zte's Stars
Adlik/smoothquantplus
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Adlik/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Adlik/llma
Adlik/model_optimizer
Adlik/yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
YanshiShield/YanshiShield
realYurkOfGitHub/translation-Introduction-to-HPC
为 Eijhout 教授的Introduction to HPC提供中文翻译、 PPT和Lab。
Adlik/model_zoo
BBuf/tvm_mlir_learn
compiler learning resources collect.
Adlik/Adlik
Adlik: Toolkit for Accelerating Deep Learning Inference
Adlik/zen_nas
Zen-NAS, a lightning fast, training-free Neural Architecture Searching algorithm
zwang4/awesome-machine-learning-in-compilers
Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation
Adlik/object_detection
merrymercy/awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
Adlik/mlperf_benchmark
A benchmark suite to used to compare the performance of various models that are optimized by Adlik.
OAID/Tengine
Tengine is a lite, high performance, modular inference engine for embedded device
SeldonIO/alibi
Algorithms for explaining machine learning models
ray-project/ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Adlik/model_optimizer_tf
Model optimizer used in Adlik.