Pinned Repositories
AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
bitsandbytes
LLM.int8() paper: 8-bit CUDA functions for PyTorch
blog
Public repo for HF blog posts
CVPR24_samples
NNCF Model Optimization examples for CPVR 2024 workshop
mobilenetv2_food101
Training of MobileNet v2 from Torchvision on Food-101 dataset
nncf_timm
NNCF scaling proposal based on the PyTorch Timm project
openvino_training_extensions
Trainable models and NN optimization tools
pyopenvino
Simplified Python API for OpenVINO
stable_diffusion_quantization
Quantization of Stable Diffusion POC
tomesd
Speed up Stable Diffusion with this one simple trick!
AlexKoff88's Repositories
AlexKoff88/tomesd
Speed up Stable Diffusion with this one simple trick!
AlexKoff88/stable_diffusion_quantization
Quantization of Stable Diffusion POC
AlexKoff88/CVPR24_samples
NNCF Model Optimization examples for CPVR 2024 workshop
AlexKoff88/openvino_training_extensions
Trainable models and NN optimization tools
AlexKoff88/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
AlexKoff88/blog
Public repo for HF blog posts
AlexKoff88/CLIP_benchmark
CLIP-like model evaluation
AlexKoff88/model_api
OpenVINO Model API
AlexKoff88/nncf_pytorch
Neural Network Compression Framework for PyTorch*
AlexKoff88/datasets
π€ The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
AlexKoff88/H2O
[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
AlexKoff88/hg_compression_service
Hugging Face Model Optimization service for OpenVINO
AlexKoff88/langchain
π¦π Build context-aware reasoning applications
AlexKoff88/llama.cpp
Port of Facebook's LLaMA model in C/C++
AlexKoff88/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
AlexKoff88/nncf_tensorrt
Experiments for NNCF quantization and inference with TensorRT
AlexKoff88/open_clip
An open source implementation of CLIP.
AlexKoff88/open_model_zoo
Pre-trained Deep Learning models and samples (high quality and extremely fast)
AlexKoff88/openvino
OpenVINOβ’ Toolkit - Deep Learning Deployment Toolkit repository
AlexKoff88/openvino.genai
AlexKoff88/openvino_contrib
Repository for OpenVINO's extra modules
AlexKoff88/openvino_devtools
Tools for easier OpenVINO development/debugging
AlexKoff88/openvino_notebooks
π A collection of Jupyter notebooks for learning and experimenting with OpenVINO π
AlexKoff88/optimum
π Accelerate training and inference of π€ Transformers and π€ Diffusers with easy to use hardware optimization tools
AlexKoff88/optimum-intel
π€ Optimum Intel: Accelerate inference with Intel optimization tools
AlexKoff88/ov.cpu.llm.experimental
ovLLM
AlexKoff88/setfit
Efficient few-shot learning with Sentence Transformers
AlexKoff88/transformers
π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
AlexKoff88/ultralytics
NEW - YOLOv8 π in PyTorch > ONNX > CoreML > TFLite
AlexKoff88/who_what_benchmark