Pinned Repositories
AccDNN
A compiler from AI model to RTL (Verilog) accelerator in FPGA hardware with auto design space exploration.
antares
Antares: an automatic engine for multi-platform kernel generation and optimization. Supporting CPU, CUDA, ROCm, DirectX12, GraphCore, SYCL for CPU/GPU, OpenCL for AMD/NVIDIA, Android CPU/GPU backends.
awesome-real-time-AI
This is a list of awesome edgeAI inference related papers.
edge-ai
A curated list of resources for embedded AI
gemm_spmm
Hardware accelerator for pruned nertworks
How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the program on the GPU in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
onnx-simplifier
Simplify your onnx model
SEAsynth
A synthesize-able CNN accelerator based on systolic arrays 🌊
TENNA
TENNA: Tiny Embedded Neural Network Accelerator
xla
A machine learning compiler for GPUs, CPUs, and ML accelerators
Deepware's Repositories
Deepware doesn’t have any repository yet.