acceleration
There are 201 repositories under acceleration topic.
linearmouse/linearmouse
The mouse and trackpad utility for Mac.
gkjohnson/three-mesh-bvh
A BVH implementation to speed up raycasting and enable spatial queries against three.js meshes.
mit-han-lab/temporal-shift-module
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
mit-han-lab/once-for-all
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
mit-han-lab/proxylessnas
[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
mit-han-lab/torchsparse
[MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.
ethanhe42/channel-pruning
Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)
react-native-sensors/react-native-sensors
A developer friendly approach for sensors in React Native
mit-han-lab/distrifuser
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
polygonplanet/chillout
Reduce CPU usage by non-blocking async loop and psychologically speed up in JavaScript
staticallyio/statically
The CDN for developers.
mayankk2308/set-egpu
Display-agnostic acceleration of macOS applications using external GPUs.
Syncleus/aparapi
The New Official Aparapi: a framework for executing native Java and Scala code on the GPU.
microsoft/hyperspace
An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
wenwei202/caffe
Caffe for Sparse and Low-rank Deep Neural Networks
gin66/FastAccelStepper
A high speed stepper library for Atmega 168/328p (nano), Atmega32u4, Atmega 2560, ESP32, ESP32S2, ESP32S3, ESP32C3, ESP32C6 and Atmel SAM Due
Media-Smart/volksdep
volksdep is an open-source toolbox for deploying and accelerating PyTorch, ONNX and TensorFlow models with TensorRT.
lmxyy/sige
[NeurIPS 2022, T-PAMI 2023] Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models
lmbxmu/HRank
Pytorch implementation of our paper accepted by CVPR 2020 (Oral) -- HRank: Filter Pruning using High-Rank Feature Map
jingwood/d2dlib
A .NET library for hardware-accelerated, high performance, immediate mode rendering via Direct2D.
Infini-AI-Lab/TriForce
[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
BUAA-CI-LAB/Literatures-on-GNN-Acceleration
A reading list for deep graph learning acceleration.
mit-han-lab/inter-operator-scheduler
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
robotperf/benchmarks
Benchmarking suite to evaluate 🤖 robotics computing performance. Vendor-neutral. ⚪Grey-box and ⚫Black-box approaches.
ros-acceleration/robotic_processing_unit
A robot-specific processing unit. Contains CPUs, FPGAs and GPUs and maps ROS efficiently to them for best performance.
Cultrarius/Swarmz
A free, header-only C++ swarming (flocking) library for real-time applications
mbroemme/vdi-stream-client
VDI Stream Client is a very tiny, low latency and GPU accelerated client to connect to Windows running Parsec Host.
firebuild/firebuild
Automatic build accelerator cache for Linux
nebuly-ai/exploring-AI-optimization
Curated list of awesome material on optimization techniques to make artificial intelligence faster and more efficient 🚀
obss/BIOBSS
A package for processing signals recorded using wearable sensors, such as Electrocardiogram (ECG), Photoplethysmogram (PPG), Electrodermal activity (EDA) and 3-axis acceleration (ACC).
juliagusak/model-compression-and-acceleration-progress
Repository to track the progress in model compression and acceleration
intel/hexl-fpga
Intel Homomorphic Encryption Acceleration Library for FPGAs, including open source implementation of FPGA kernels for accelerating NTT, INTT, Keyswitch and Dyadic Multiplication modular arithmetic operations, FPGA runtime, and host APIs for connecting to third-party homomorphic encryption libraries.
xtknight/vdpau-va-driver-vp9
Experimental VP9 codec support for vdpau-va-driver (NVIDIA VDPAU-VAAPI wrapper) and chromium-vaapi
GitSquared/rinzler
An autonomous parallel processing engine for the browser.
whitelok/tvm-lesson
动手学习TVM核心原理教程
defparam/BAR-Tender
An FPGA I/O Device which services physical memory reads/writes via UMDF2 driver