XuehaoSun

IntelChina, Shanghai

Pinned Repositories

memray
Memray is a memory profiler for Python
Language:Python13.5k 59 202397
neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Language:Python2.3k 33 209258
terminal
The new Windows Terminal and the original Windows console host, all in the same place!
Language:C++96.2k 1.3k 13k8.4k
docs
This repo contains documents of the OPEA project
00
GenAIExamples
Intel Generative AI Examples (e.g., ChatQnA with RAG) on Xeon and Gaudi2
Language:Svelte00
INC-
00
lb_eval
Language:Python01
neural-compressor
Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance.
Language:Python00
probot
Language:TypeScript00
test-azure
Language:Shell10

XuehaoSun/test-azure
Language:Shell10
XuehaoSun/docs
This repo contains documents of the OPEA project
00
XuehaoSun/GenAIExamples
Intel Generative AI Examples (e.g., ChatQnA with RAG) on Xeon and Gaudi2
Language:Svelte00
XuehaoSun/INC-
00
XuehaoSun/lb_eval
Language:Python01
XuehaoSun/neural-compressor
Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance.
Language:Python00
XuehaoSun/probot
Language:TypeScript00