huggingface/optimum

🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools

PythonApache-2.0

Pinned issues

Community contribution - `BetterTransformer` integration for more models!

#488 opened 2 years ago by younesbelkada

Open25

Community contribution - `optimum.exporters.onnx` support for new models!

#555 opened a month ago by michaelbenayoun

Closed43

[Quick poll] Give your opinion on the future of the Hugging Face Open Source ecosystem!

#568 opened 2 years ago by LysandreJik

Open0

Issues

Support for GliNER
#2182 opened a month ago by polodealvarado
1
Support onnx conversion for wav2vec2-bert
#2082 opened 4 months ago by fawazahmed0
3
Pruning & Knowledge Distillation For Onnx Format
#2178 opened a month ago by harshakhmk
1
Feature Request: ONNX Opset 21 Support
#2180 opened a month ago by aendk
1
Add ONNX export optimization support for ModernBERT
#2177 opened a month ago by amas0
1
PEFT to ONNX conversion
#2189 opened 25 days ago by morteza89
1
Failed to export Llama with past-key-values to ONNX
#2204 opened 9 days ago by ahmedehabessa
1
Bug exporting Whisper?
#2200 opened 14 days ago by AlArgente
1
Support for Exporting Specific Sub-Modules (e.g., Encoder, Decoder)
#2148 opened 2 months ago by happyme531
3
Adding Phi3 support in BetterTransformer (to use the microsoft/phi-4 model)
#2171 opened 7 days ago by majdabd
2
Support for RT-DETR model export to onnx
#2176 opened 7 days ago by fahadishaq1
1
Qwen RuntimeError: The serialized model is larger than the 2GiB
#2144 opened 2 months ago by santoshk17
1
optimum cant use custom pipelines
#2170 opened 8 days ago by xiaoyao9184
1
Issue when converting Exaone 3.0 7.8B model
#2202 opened 11 days ago by Zhaeong
1
Fix optimum-cli export executorch
#2172 opened a month ago by guangy10
2
Support for `jina-embeddings-v3`
#2166 opened 13 days ago by arianomidi
1
Revisit PR 1874
#2160 opened 17 days ago by jambonne
2
ORTModel.device setter raises unexpected error
#2197 opened 18 days ago by mdambski
1
Export-to-ExecuTorch via Optimum integration
#2128 opened 23 days ago by guangy10
4
no __version__ attribute
#2188 opened a month ago by umbilnm
0
RuntimeError: Unable to find data type for weight_name='/model/layers.0/self_attn/k_proj/MatMul_output_0'
#2186 opened a month ago by dongwonmoon
1
Support for ONNX export of UMT5
#2142 opened a month ago by cyanic-selkie
1
Support for ONNX export of SeamlessM4TModel
#2174 opened a month ago by AlArgente
2
An installed package with a different distribution name is not properly detected by Optimum
#2163 opened a month ago by kazssym
1
Support for exaone models
#2167 opened 2 months ago by Zhaeong
0
OnnxRuntime Support for Text2Video and Img2Video Pipelines
#2168 opened a month ago by jdp8
1
Convert Stable Diffusion Inpainting model to FP16 with FP32 inputs
#2147 opened a month ago by jdp8
0
Allow access restricted models in the CI
#2127 opened 2 months ago by guangy10
1
Support for Converting Sentence-Transformers to TFLite
#2124 opened 2 months ago by adityasahugit
1
--dtype fp16 does not decrease the model size
#2156 opened 2 months ago by chansonzhang
0
Doesnt recognize model type 'modernbert'
#2149 opened 2 months ago by kguruswamy
1
cannot quantize bge onnx model(embedding model) without performace loss
#2145 opened 2 months ago by chuangzhidan
0
Slim pypi packages
#2143 opened 2 months ago by twoertwein
1
VisionEncoderDecoderModel ONNX Conversion - Swinv2-Xlm-roberta-base
#2141 opened 2 months ago by Billybeast2003
2
KeyError: 'swinv2 model type is not supported yet in NormalizedConfig.
#2140 opened 2 months ago by Billybeast2003
0
High CUDA Memory Usage in ONNX Runtime with Inconsistent Memory Release
#2069 opened 2 months ago by niyathimariya
6
Pass torch_dtype when exporting a Sentence Transformers model to ONNX
#2115 opened 3 months ago by sjrl
1
install instructions result is pip version conflicts.
#2125 opened 3 months ago by hpcpony
0
GPTQ kernel inference not compatible with some models
#2120 opened 3 months ago by Qubitium
1
Please don't kill BetterTransformer — 1.88x faster inference than SDPA
#2083 opened 3 months ago by umarbutler
2
Add support for RemBERT in the ONNX export
#2092 opened 3 months ago by mlynatom
1
Flux Pipeline doesn't work
#2103 opened 3 months ago by clintg6
5
Stable Diffusion 3 ONNX support
#2093 opened 4 months ago by gmarcosf
4
Add support for Musicgen Melody in the ONNX export
#2095 opened 4 months ago by rubeniskov
2
TFJS support model.json to ONNX conversion
#2097 opened 4 months ago by JohnRSim
0
"ValueError: Trying to export a codesage model" while trying to export codesage/codesage-large
#2080 opened 4 months ago by TurboEncabulator9000
0
LLama 3.2 vision - unable to convert
#2079 opened 4 months ago by pdufour
0
Problem converting tinyllama to onnx model with optimum-cli
#2076 opened 5 months ago by hayyaw
0
Problem converting DeBERTaV3 to ONNX using optimum-cli
#2075 opened 5 months ago by marcovzla
0
Conversion innaccuracy specific Opus-MT model
#2068 opened 5 months ago by FricoRico
0