Pinned Repositories
memray
Memray is a memory profiler for Python
neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
terminal
The new Windows Terminal and the original Windows console host, all in the same place!
docs
This repo contains documents of the OPEA project
GenAIExamples
Intel Generative AI Examples (e.g., ChatQnA with RAG) on Xeon and Gaudi2
INC-
lb_eval
neural-compressor
Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance.
probot
test-azure
XuehaoSun's Repositories
XuehaoSun/test-azure
XuehaoSun/docs
This repo contains documents of the OPEA project
XuehaoSun/GenAIExamples
Intel Generative AI Examples (e.g., ChatQnA with RAG) on Xeon and Gaudi2
XuehaoSun/INC-
XuehaoSun/lb_eval
XuehaoSun/neural-compressor
Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance.
XuehaoSun/probot