Pinned Repositories
bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
CS-Awesome-Courses
计算机的优秀课程
GenAIComps
GenAI components at micro-service level; GenAI service composer to create mega-service
GenAIEval
Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety, and hallucination
optimum
🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
optimum-intel
Accelerate inference of 🤗 Transformers with Intel optimization tools
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
lm-evaluation-harness
A framework for few-shot evaluation of language models.
intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
changwangss's Repositories
changwangss/GenAIComps
GenAI components at micro-service level; GenAI service composer to create mega-service
changwangss/GenAIEval
Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety, and hallucination
changwangss/bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
changwangss/CS-Awesome-Courses
计算机的优秀课程
changwangss/CUDA-Programming-Guide-in-Chinese
This is a Chinese translation of the CUDA programming guide
changwangss/intel-extension-for-transformers
Extending Hugging Face transformers APIs for Transformer-based models and improve the productivity of inference deployment. With extremely compressed models, the toolkit can greatly improve the inference efficiency on Intel platforms.
changwangss/lm-evaluation-harness
A framework for few-shot evaluation of language models.
changwangss/optimum
🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
changwangss/optimum-intel
Accelerate inference of 🤗 Transformers with Intel optimization tools
changwangss/transformers
🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
changwangss/lpot
Intel® Low Precision Optimization Tool, targeting to provide a unified low precision inference interface cross different deep learning frameworks, and support auto-tune with specified accuracy criterion to find out best quantized model.
changwangss/optimum-habana
Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)
changwangss/tgi-gaudi
Large Language Model Text Generation Inference on Habana Gaudi