acceleration
There are 191 repositories under acceleration topic.
linearmouse/linearmouse
The mouse and trackpad utility for Mac.
gkjohnson/three-mesh-bvh
A BVH implementation to speed up raycasting and enable spatial queries against three.js meshes.
mit-han-lab/temporal-shift-module
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
mit-han-lab/once-for-all
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
mit-han-lab/proxylessnas
[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
mit-han-lab/torchsparse
[MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.
ethanhe42/channel-pruning
Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)
react-native-sensors/react-native-sensors
A developer friendly approach for sensors in React Native
polygonplanet/chillout
Reduce CPU usage by non-blocking async loop and psychologically speed up in JavaScript
staticallyio/statically
The CDN for developers.
mayankk2308/set-egpu
Display-agnostic acceleration of macOS applications using external GPUs.
Syncleus/aparapi
The New Official Aparapi: a framework for executing native Java and Scala code on the GPU.
mit-han-lab/distrifuser
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
microsoft/hyperspace
An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
wenwei202/caffe
Caffe for Sparse and Low-rank Deep Neural Networks
Media-Smart/volksdep
volksdep is an open-source toolbox for deploying and accelerating PyTorch, ONNX and TensorFlow models with TensorRT.
gin66/FastAccelStepper
A high speed stepper library for Atmega 168/328p (nano), Atmega32u4, Atmega 2560, ESP32, ESP32S2, ESP32S3, ESP32C3 and Atmel SAM Due
lmbxmu/HRank
Pytorch implementation of our paper accepted by CVPR 2020 (Oral) -- HRank: Filter Pruning using High-Rank Feature Map
lmxyy/sige
[NeurIPS 2022] Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models
jingwood/d2dlib
A .NET library for hardware-accelerated, high performance, immediate mode rendering via Direct2D.
BUAA-CI-LAB/Literatures-on-GNN-Acceleration
A reading list for deep graph learning acceleration.
mit-han-lab/inter-operator-scheduler
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
ros-acceleration/robotic_processing_unit
A robot-specific processing unit. Contains CPUs, FPGAs and GPUs and maps ROS efficiently to them for best performance.
robotperf/benchmarks
Benchmarking suite to evaluate 🤖 robotics computing performance. Vendor-neutral. ⚪Grey-box and ⚫Black-box approaches.
Cultrarius/Swarmz
A free, header-only C++ swarming (flocking) library for real-time applications
Infini-AI-Lab/TriForce
TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
mbroemme/vdi-stream-client
VDI Stream Client is a very tiny, low latency and GPU accelerated client to connect to Windows running Parsec Host.
firebuild/firebuild
Automatic build accelerator cache for Linux
nebuly-ai/exploring-AI-optimization
Curated list of awesome material on optimization techniques to make artificial intelligence faster and more efficient 🚀
juliagusak/model-compression-and-acceleration-progress
Repository to track the progress in model compression and acceleration
obss/BIOBSS
A package for processing signals recorded using wearable sensors, such as Electrocardiogram (ECG), Photoplethysmogram (PPG), Electrodermal activity (EDA) and 3-axis acceleration (ACC).
intel/hexl-fpga
Intel Homomorphic Encryption Acceleration Library for FPGAs, including open source implementation of FPGA kernels for accelerating NTT, INTT, Keyswitch and Dyadic Multiplication modular arithmetic operations, FPGA runtime, and host APIs for connecting to third-party homomorphic encryption libraries.
xtknight/vdpau-va-driver-vp9
Experimental VP9 codec support for vdpau-va-driver (NVIDIA VDPAU-VAAPI wrapper) and chromium-vaapi
GitSquared/rinzler
An autonomous parallel processing engine for the browser.
whitelok/tvm-lesson
动手学习TVM核心原理教程
ghamerly/fast-kmeans
Code to speed up k-means clustering. Originally at BaylorCS/baylorml.