mengniwang95

Pinned Repositories

diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Language:Python27.1k 211 4.4k5.6k
auto-round
SOTA Weight-only Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"
Language:Python0 0 00
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Language:Python0 0 00
models
A collection of pre-trained, state-of-the-art models in the ONNX format
Language:Jupyter Notebook0 0 00
neural-compressor
Language:Python0 0 00
onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Language:C++0 0 00
onnxruntime-inference-examples
Examples for using ONNX Runtime for machine learning inferencing.
Language:Python0 0 00
optimum-habana
Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)
Language:Python00
optimum-intel
🤗 Optimum Intel: Accelerate inference with Intel optimization tools
Language:Python0 0 00
onnx
Open standard for machine learning interoperability
Language:Python18.2k 435 2.9k3.7k

mengniwang95's Repositories

mengniwang95/auto-round
SOTA Weight-only Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"
Language:Python0 0 00
mengniwang95/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Language:Python0 0 00
mengniwang95/models
A collection of pre-trained, state-of-the-art models in the ONNX format
Language:Jupyter Notebook0 0 00
mengniwang95/neural-compressor
Language:Python0 0 00
mengniwang95/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Language:C++0 0 00
mengniwang95/onnxruntime-inference-examples
Examples for using ONNX Runtime for machine learning inferencing.
Language:Python0 0 00
mengniwang95/optimum-habana
Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)
Language:Python00
mengniwang95/optimum-intel
🤗 Optimum Intel: Accelerate inference with Intel optimization tools
Language:Python0 0 00