audiovention

audiovention's Stars

Aider-AI/aider
aider is AI pair programming in your terminal
Language:Python23.7k 157 2.3k2.2k
unslothai/unsloth
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
Language:Python19.7k 134 1.2k1.4k
tensorflow/tfjs
A WebGL accelerated JavaScript library for training and deploying ML models.
Language:TypeScript18.6k 326 4.2k1.9k
taskflow/taskflow
A General-purpose Task-parallel Programming System using Modern C++
Language:C++10.4k 255 4701.2k
tiny-dnn/tiny-dnn
header only, dependency-free deep learning framework in C++14
Language:C++5.9k 335 5771.4k
NVlabs/tiny-cuda-nn
Lightning fast C++/CUDA neural network framework
Language:C++3.8k 49 395464
flame/blis
BLAS-like Library Instantiation Software Framework
Language:C2.3k 79 446370
mil-tokyo/webdnn
The Fastest DNN Running Framework on Web Browser
Language:TypeScript2k 61 391146
pikvm/ustreamer
µStreamer - Lightweight and fast MJPEG-HTTP streamer
Language:C1.8k 42 207245
mackron/dr_libs
Audio decoding libraries for C/C++, each in a single source file.
Language:C1.3k 46 207207
libxsmm/libxsmm
Library for specialized dense and sparse matrix operations, and deep learning primitives.
Language:C853 51 343187
romeric/Fastor
A lightweight high performance tensor algebra framework for modern C++
Language:C++764 28 16070
webgpu/webgpufundamentals
Language:HTML689 29 35100
andravin/wincnn
Winograd minimal convolution algorithm generator for convolutional neural networks.
Language:Python610 30 27145
siboehm/SGEMM_CUDA
Fast CUDA matrix multiplication from scratch
Language:Cuda545 4 1366
libocca/occa
Portable and vendor neutral framework for parallel programming on heterogeneous platforms.
Language:C++405 30 34886
G4brym/R2-Explorer
A Google Drive Interface for your Cloudflare R2 Buckets!
Language:Vue350 10 3869
yzhaiustc/Optimizing-SGEMM-on-NVIDIA-Turing-GPUs
Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.
Language:Cuda296 7 745
intel/pti-gpu
Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysis on Intel(R) Processor Graphics easily
Language:C++210 17 5856
ROCm/clr
Language:C++110 19 7651
gplhegde/convolution-flavors
Implementation of convolution layer in different flavors
Language:C68 4 126
mattdean1/cuda
An implementation of parallel exclusive scan in CUDA
Language:Cuda60 3 020
OrangeOwlSolutions/General-CUDA-programming
Language:Cuda42 6 112
Expander/polylogarithm
Implementation of polylogarithms in C/C++/Fortran
Language:C++33 5 34
unevens/avec
A little library for using SIMD instructions for x86 and ARM, wrapping Agner Fog's vectorclass for x86 and filling some of its functionality for ARM, and providing containers for aligned memory with views and interleaving/deinterleaving.
Language:C++15 4 01
stoneberry-webgpu/stoneberry
core WebGPU shaders
Language:TypeScript13 2 00
blu/gemm
Musings in GEMM (General Matrix Multiplication)
Language:C++12 2 06
yui0/ugemm
GEMM
Language:C10 2 23
yuzhouhe2000/Dilated-Winograd-Convolution
Parallelized Winograd 2D dilated convolution
Language:Jupyter Notebook2 2 00
gcp/sgemm
A collection of AVX/FMA SGEMM routines for small matrices, plus benchmark
Language:C++1 4 01