gpu-acceleration

There are 646 repositories under gpu-acceleration topic.

  • tensorflow/tfjs

    A WebGL accelerated JavaScript library for training and deploying ML models.

    Language:TypeScript18.6k3264.2k1.9k
  • NVIDIA/TensorRT

    NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

    Language:C++11k1563.8k2.1k
  • tensorflow/tfjs-core

    WebGL-accelerated ML // linear algebra // automatic differentiation for JavaScript.

    Language:TypeScript8.5k2960950
  • rio

    raphamorim/rio

    A hardware-accelerated GPU terminal emulator focusing to run in desktops and browsers.

    Language:Rust4.5k22558147
  • cornellius-gp/gpytorch

    A highly efficient implementation of Gaussian Processes in PyTorch

    Language:Python3.6k581.3k561
  • NVIDIA/GenerativeAIExamples

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Language:Python2.6k6355603
  • Hedgehog-Computing/hedgehog-lab

    Run, compile and execute JavaScript for Scientific Computing and Data Visualization TOTALLY TOTALLY TOTALLY in your BROWSER! An open source scientific computing environment for JavaScript TOTALLY in your browser, matrix operations with GPU acceleration, TeX support, data visualization and symbolic computation.

    Language:TypeScript2.4k5347140
  • blazingsql

    BlazingDB/blazingsql

    BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.

    Language:C++1.9k55715184
  • TianZerL/Anime4KCPP

    A high performance anime upscaler

    Language:C++1.8k20113141
  • coreylowman/dfdx

    Deep learning in Rust, with shape checked tensors and neural networks

    Language:Rust1.8k35456103
  • emacs-ng/emacs-ng

    A new approach to Emacs - Including TypeScript, Threading, Async I/O, and WebRender.

    Language:Emacs Lisp1.7k3224272
  • emu

    calebwin/emu

    The write-once-run-anywhere GPGPU library for Rust

    Language:Rust1.6k384152
  • NVIDIA/cccl

    CUDA Core Compute Libraries

    Language:C++1.4k321.6k178
  • beehive-lab/TornadoVM

    TornadoVM: A practical and efficient heterogeneous programming framework for managed languages

    Language:Java1.2k42175116
  • stdgpu

    stotko/stdgpu

    stdgpu: Efficient STL-like Data Structures on the GPU

    Language:C++1.2k303687
  • TerraForge3D

    Jaysmito101/TerraForge3D

    Cross Platform Professional Procedural Terrain Generation & Texturing Tool

    Language:C++998213092
  • NVIDIA-Merlin/HugeCTR

    HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training

    Language:C++96341372200
  • Liu-xiandong/How_to_optimize_in_GPU

    This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

    Language:Cuda8841315141
  • hughperkins/VeriGPU

    OpenSource GPU, in Verilog, loosely based on RISC-V ISA

    Language:SystemVerilog876301699
  • dgasmith/opt_einsum

    ⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.

    Language:Python8722113269
  • NVlabs/sionna

    Sionna: An Open-Source Library for Next-Generation Physical Layer Research

    Language:Python85239258249
  • PhotonCamera

    eszdman/PhotonCamera

    Android Camera that uses Enhanced image processing

    Language:Java812338672
  • NVIDIA-Merlin/Merlin

    NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference in production.

    Language:Python79533444120
  • limbo018/DREAMPlace

    Deep learning toolkit-enabled VLSI placement

    Language:C++73722167208
  • ttddee/Cascade

    Node-based image editor with GPU-acceleration.

    Language:C++731147131
  • coreylowman/cudarc

    Safe rust wrapper around CUDA toolkit

    Language:Rust6811314985
  • Sergio0694/NeuralNetwork.NET

    A TensorFlow-inspired neural network library built from scratch in C# 7.3 for .NET Standard 2.0, with GPU support through cuDNN

    Language:C#555362788
  • philferriere/dlwin

    GPU-accelerated Deep Learning on Windows 10 native

    Language:Python5195139100
  • DavidDiazGuerra/gpuRIR

    Python library for Room Impulse Response (RIR) simulation with GPU acceleration

    Language:Cuda500105996
  • MegviiRobot/MegBA

    MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment

    Language:Cuda456212261
  • uncomplicate/bayadera

    High-performance Bayesian Data Analysis on the GPU in Clojure

    Language:Clojure36529824
  • andrewmilson/ministark

    🏃‍♂️💨 GPU accelerated STARK prover built on @arkworks-rs

    Language:Rust351131335
  • ProjectPhysX/OpenCL-Wrapper

    OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents many people from getting started. I created this lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance.

    Language:C++35191438
  • Glavnokoman/vuh

    Vulkan compute for people

    Language:C++347244034
  • DataCanvasIO/HyperGBM

    A full pipeline AutoML tool for tabular data

    Language:Python344165546
  • gpufit/Gpufit

    GPU-accelerated Levenberg-Marquardt curve fitting in CUDA

    Language:C++3162010995