auto-tuning
There are 30 repositories under auto-tuning topic.
intel/neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
oracle/bpftune
bpftune uses BPF to auto-tune Linux systems
zwang4/awesome-machine-learning-in-compilers
Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation
KernelTuner/kernel_tuner
Kernel Tuner
sbu-fsl/kernel-ml
Machine Learning Framework for Operating Systems - Brings ML to Linux kernel
ROCm/Tensile
Stretching GPU performance for GEMMs and tensor contractions.
CNugteren/CLTune
CLTune: An automatic OpenCL & CUDA kernel tuner
ederwander/PyAutoTune
Autotune Module for Python "PyAutoTune"
HAL-42/AlchemyCat
Alchemy Cat —— 🔥Config System for SOTA
SUSE/phoebe
Phoebe
tlc-pack/TLCBench
Benchmark scripts for TVM
weixingsun/jBProF
ebpf profiler for jvm
ctuning/ck-crowdtuning
Collective Knowledge crowd-tuning extension to let users crowdsource their experiments (using portable Collective Knowledge workflows) such as performance benchmarking, auto tuning and machine learning across diverse platforms with Linux, Windows, MacOS and Android provided by volunteers. Demo of DNN crowd-benchmarking and crowd-tuning:
addb-swstarlab/K2vTune
K2vTune (A Workload-aware Configuration Tuning for RocksDB)
cornell-zhang/uptune
A Generic Distributed Auto-Tuning Infrastructure
NTNU-HPC-Lab/BAT
A GPU benchmark suite for autotuners
go-playground/backoff
:bowtie: Backoff uses an exponential backoff algorithm to backoff between retries with optional auto-tuning functionality.
umayrh/sparktuner
Autotuner for Spark applications
AutoTuningAssociation/autotuning_methodology
This software package accompanies the paper "A Methodology for Comparing Auto-Tuning Optimization Algorithms" (https://doi.org/10.1016/j.future.2024.05.021), making the guidelines in the methodology easy to apply.
arcari-galimberti/margot-aspect
MarGotAspect - An AspectC++ code generator for the mARGOt framework
hibbannn/pool-manager
Pool Manager dirancang untuk mengelola pooling objek secara efisien dalam aplikasi Anda. Dengan fitur-fitur seperti sharding, caching, auto-tuning, dan kebijakan eviksi, package ini membantu meningkatkan performa dan efisiensi penggunaan memori.
jokopi/GSWITCH
A pattern-based algorithmic auto-tuner for graph processing on GPUs
mergian/matog
MATOG: CUDA Array Access Auto-Tuner
david-andrew/Ensemble
Autotuning Google Text-to-Speech
isazi/tuning_metrics
Library to compute auto-tuning and performance metrics.
kilitary/fann-related
fann networks+forex + MILK + met8
polycloze/polycloze
A self-hosted language learning website
AnonymousForATC/AIE
AIE: an Adaptive Inference Engine for Decision Tree Ensemble on GPU
DeanDev94/LabVIEW-Assignments
Four assignments from two LabVIEW Modules. LabVIEW Visual Programming and LabVIEW App Development