gpu-computing
There are 749 repositories under gpu-computing topic.
catboost/catboost
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
gyroflow/gyroflow
Video stabilization using gyroscope data
NVIDIA/thrust
[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl
google/tf-quant-finance
High-performance TensorFlow library for quantitative finance.
ProjectPhysX/FluidX3D
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs via OpenCL.
raphamorim/rio
A hardware-accelerated GPU terminal emulator focusing to run in desktops and browsers.
tensorflow/lingvo
Lingvo
microsoft/pai
Resource scheduling and cluster management for AI
jbush001/NyuziProcessor
GPGPU microprocessor architecture
SciML/SciMLBook
Parallel Computing and Scientific Machine Learning (SciML): Methods and Applications (MIT 18.337J/6.338J)
inducer/pycuda
CUDA integration for Python, plus shiny features
coreylowman/dfdx
Deep learning in Rust, with shape checked tensors and neural networks
calebwin/emu
The write-once-run-anywhere GPGPU library for Rust
KomputeProject/kompute
General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases. Backed by the Linux Foundation.
BindsNET/bindsnet
Simulation of spiking neural networks (SNNs) using PyTorch.
mikbry/awesome-webgpu
😎 Curated list of awesome things around WebGPU ecosystem.
mratsim/Arraymancer
A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
beehive-lab/TornadoVM
TornadoVM: A practical and efficient heterogeneous programming framework for managed languages
LuxCoreRender/LuxCore
LuxCore source repository
NVIDIA/MatX
An efficient C++17 GPU numerical computing library with Python-like syntax
stotko/stdgpu
stdgpu: Efficient STL-like Data Structures on the GPU
AdaptiveCpp/AdaptiveCpp
Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!
uncomplicate/neanderthal
Fast Clojure Matrix Library
AccelerateHS/accelerate
Embedded language for high-performance array computations
NVIDIA/cccl
CUDA C++ Core Libraries
Langhalsdino/Kubernetes-GPU-Guide
This guide should help fellow researchers and hobbyists to easily automate and accelerate there deep leaning training with their own Kubernetes GPU cluster.
eyalroz/cuda-api-wrappers
Thin, unified, C++-flavored wrappers for the CUDA APIs
zszazi/Deep-learning-in-cloud
List of Deep Learning Cloud Providers
LuxCoreRender/BlendLuxCore
Blender Integration for LuxCore
ComputationalRadiationPhysics/picongpu
Performance-Portable Particle-in-Cell Simulations for the Exascale Era :sparkles:
huiscliu/Tutorials
Parallel programming tutorials
googlefonts/compute-shader-101
Sample code for compute shader 101 training
triSYCL/triSYCL
Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group
AmesingFlank/taichi.js
Modern GPU Compute and Rendering in Javascript
smistad/FAST
A framework for high-performance medical image processing, neural network inference and visualization
ginkgo-project/ginkgo
Numerical linear algebra software package