openvinotoolkit/nncf
Neural Network Compression Framework for enhanced OpenVINO™ inference
PythonApache-2.0
Issues
- 3
untimeError: CUDA is available for PyTorch, but NNcF could not compile GPu quan tization extensions.
#3033 opened by huangqiu15444 - 0
PyTorch 2.5.0
#3036 opened by alexsu52 - 4
[TorchFX] Torch FX/PyTorch 2 Export Quantization
#2766 opened by alexsu52 - 2
Object Detection Quantization fails with OpenVINO 2024.3
#2889 opened by fmishra747 - 5
How to use custom dataloder in QAT?
#2982 opened by huangqiu15444 - 16
- 15
[Good First Issue][NNCF][FBC/BC] Add a case with depthwise/transpose convolutions to the template test
#2916 opened by daniil-lyakhov - 0
[Torch FX] INT4 data-free weights compression
#3005 opened by alexsu52 - 10
- 5
The question about function create_compressed_model():RuntimeError: CUDA error: device-side assert triggered
#2688 opened by zbnlala - 0
[TorchFX] Add edges shape to `test_models.py`
#2904 opened by daniil-lyakhov - 1
[TorchFX] Support NNCFGraph.is_shared method
#2938 opened by daniil-lyakhov - 5
KeyError: 'bf16'
#2980 opened by hezhiqian01 - 16
- 15
- 1
[Torch FX] Support for ExportedProgram from torch.export.export in Weights Compression
#2987 opened by anzr299 - 8
- 4
- 12
NNCF can not compile model: Default output not supported (opset1::VariadicSplit)
#2943 opened by cansik - 0
- 2
Accuracy Aware Quantization- Object Segmentation fails for OpenVINO 2024.3 and ultralytics_8.2.74 with yolov8-seg model and coco128-seg dataset
#2907 opened by fmishra747 - 0
[TorchFX][Optimization] Run eliminate dead code once for constant correction commands
#2929 opened by daniil-lyakhov - 0
- 0
- 5
Failed to export NNCF QAT model for Ultralytics YOLOv8
#2846 opened by quinnZE - 5
- 2
[TorchFX] TorchFX metatypes support
#2778 opened by daniil-lyakhov - 2
- 10
nncf + ultralytics yolov8 training-time compression
#2486 opened by SofyaLL - 1
test_autoq_precision_init Fails due to connection timeout
#2843 opened by anzr299 - 1
DeformConv2d Cannot be Quantized
#2794 opened by anzr299 - 2
When will there be a c++ interface?
#2753 opened by qq978358810 - 5
- 10
- 1
[DISCUSSION]: Metrics for NNCF
#2763 opened by AbhijithGanesh - 2
IndexError: list index out of range When I try to quantize llama models using OVQuantizer
#2755 opened by Alwahsh - 4
AttributeError: 'list' object has no attribute 'keys' when executing yolov8_quantize_with_accuracy_control example
#2669 opened by openvino-book - 3
- 19
- 4
- 19
[Good First Issue][NNCF]: Add INT8 weight compression conformance test for Tinyllama-1.1b PyTorch model
#2527 opened by alexsu52 - 8
[Good First Issue][NNCF]: Optimize memory footprint by removing redundant collected statistics
#2557 opened by kshpv - 1
Ultralytics yolov8 QAT example
#2489 opened by alexsu52 - 1
PTQ of Fast R-CNN crashes in PyTorch backend
#2568 opened by alexsu52 - 10
[Good First Issue][NNCF]: check number of u8, u4 constants in weight compression tests
#2565 opened by ljaljushkin - 2
Why doesn't the size and precision of the model change after INT4 quantization?
#2553 opened by xingfenghaizeiwang - 5
- 6
[Good First Issue][NNCF]: Fixing NNCFGraph export for visualization in Netron
#2552 opened by andrey-churkin - 1
Thanks to our Contributors
#2499 opened by alexsu52 - 0
Compressed models that call torch.is_floating_point() during inference are traced with runtime error.
#2432 opened by alexsu52