acceleration

There are 201 repositories under acceleration topic.

linearmouse/linearmouse
The mouse and trackpad utility for Mac.
Language:Swift4k 16 34868
gkjohnson/three-mesh-bvh
A BVH implementation to speed up raycasting and enable spatial queries against three.js meshes.
Language:JavaScript2.5k 40 413268
mit-han-lab/temporal-shift-module
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
Language:Python2.1k 42 220417
mit-han-lab/once-for-all
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
Language:Python1.9k 53 75333
mit-han-lab/proxylessnas
[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
Language:C++1.4k 70 0285
mit-han-lab/torchsparse
[MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.
Language:Cuda1.2k 16 262143
ethanhe42/channel-pruning
Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)
Language:Python1.1k 47 125310
react-native-sensors/react-native-sensors
A developer friendly approach for sensors in React Native
Language:Objective-C907 17 173225
mit-han-lab/distrifuser
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
Language:Python596 9 2323
polygonplanet/chillout
Reduce CPU usage by non-blocking async loop and psychologically speed up in JavaScript
Language:JavaScript596 13 520
staticallyio/statically
The CDN for developers.
Language:JavaScript585 22 7489
mayankk2308/set-egpu
Display-agnostic acceleration of macOS applications using external GPUs.
Language:Shell481 33 2642
Syncleus/aparapi
The New Official Aparapi: a framework for executing native Java and Scala code on the GPU.
Language:Java466 53 11459
microsoft/hyperspace
An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
Language:Scala424 34 205115
wenwei202/caffe
Caffe for Sparse and Low-rank Deep Neural Networks
Language:C++379 35 35134
gin66/FastAccelStepper
A high speed stepper library for Atmega 168/328p (nano), Atmega32u4, Atmega 2560, ESP32, ESP32S2, ESP32S3, ESP32C3, ESP32C6 and Atmel SAM Due
Language:C++313 20 24671
Media-Smart/volksdep
volksdep is an open-source toolbox for deploying and accelerating PyTorch, ONNX and TensorFlow models with TensorRT.
Language:Python286 10 1543
lmxyy/sige
[NeurIPS 2022, T-PAMI 2023] Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models
Language:Python259 6 29
lmbxmu/HRank
Pytorch implementation of our paper accepted by CVPR 2020 (Oral) -- HRank: Filter Pruning using High-Rank Feature Map
Language:Python252 12 2549
jingwood/d2dlib
A .NET library for hardware-accelerated, high performance, immediate mode rendering via Direct2D.
Language:C#246 16 7945
Infini-AI-Lab/TriForce
[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
Language:Python230 1 912
BUAA-CI-LAB/Literatures-on-GNN-Acceleration
A reading list for deep graph learning acceleration.
226 13 119
mit-han-lab/inter-operator-scheduler
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
Language:C++195 8 2432
robotperf/benchmarks
Benchmarking suite to evaluate 🤖 robotics computing performance. Vendor-neutral. ⚪Grey-box and ⚫Black-box approaches.
Language:Python155 10 2416
ros-acceleration/robotic_processing_unit
A robot-specific processing unit. Contains CPUs, FPGAs and GPUs and maps ROS efficiently to them for best performance.
150 6 26
Cultrarius/Swarmz
A free, header-only C++ swarming (flocking) library for real-time applications
Language:C++134 6 19
mbroemme/vdi-stream-client
VDI Stream Client is a very tiny, low latency and GPU accelerated client to connect to Windows running Parsec Host.
Language:C127 6 88
firebuild/firebuild
Automatic build accelerator cache for Linux
Language:C++121 3 4884
nebuly-ai/exploring-AI-optimization
Curated list of awesome material on optimization techniques to make artificial intelligence faster and more efficient 🚀
112 10 011
obss/BIOBSS
A package for processing signals recorded using wearable sensors, such as Electrocardiogram (ECG), Photoplethysmogram (PPG), Electrodermal activity (EDA) and 3-axis acceleration (ACC).
Language:Python108 6 1420
juliagusak/model-compression-and-acceleration-progress
Repository to track the progress in model compression and acceleration
104 8 020
intel/hexl-fpga
Intel Homomorphic Encryption Acceleration Library for FPGAs, including open source implementation of FPGA kernels for accelerating NTT, INTT, Keyswitch and Dyadic Multiplication modular arithmetic operations, FPGA runtime, and host APIs for connecting to third-party homomorphic encryption libraries.
Language:C++95 13 025
xtknight/vdpau-va-driver-vp9
Experimental VP9 codec support for vdpau-va-driver (NVIDIA VDPAU-VAAPI wrapper) and chromium-vaapi
Language:C74 14 1411
GitSquared/rinzler
An autonomous parallel processing engine for the browser.
Language:TypeScript62 5 04
whitelok/tvm-lesson
动手学习TVM核心原理教程
Language:Python59 7 016
defparam/BAR-Tender
An FPGA I/O Device which services physical memory reads/writes via UMDF2 driver
Language:Verilog54 11 110

acceleration

linearmouse/linearmouse

gkjohnson/three-mesh-bvh

mit-han-lab/temporal-shift-module

mit-han-lab/once-for-all

mit-han-lab/proxylessnas

mit-han-lab/torchsparse

ethanhe42/channel-pruning

react-native-sensors/react-native-sensors

mit-han-lab/distrifuser

polygonplanet/chillout

staticallyio/statically

mayankk2308/set-egpu

Syncleus/aparapi

microsoft/hyperspace

wenwei202/caffe

gin66/FastAccelStepper

Media-Smart/volksdep

lmxyy/sige

lmbxmu/HRank

jingwood/d2dlib

Infini-AI-Lab/TriForce

BUAA-CI-LAB/Literatures-on-GNN-Acceleration

mit-han-lab/inter-operator-scheduler

robotperf/benchmarks

ros-acceleration/robotic_processing_unit

Cultrarius/Swarmz

mbroemme/vdi-stream-client

firebuild/firebuild

nebuly-ai/exploring-AI-optimization

obss/BIOBSS

juliagusak/model-compression-and-acceleration-progress

intel/hexl-fpga

xtknight/vdpau-va-driver-vp9

GitSquared/rinzler

whitelok/tvm-lesson

defparam/BAR-Tender