zhxfl

ustcchina

zhxfl's Stars

google/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Language:Python30k 328 5.5k2.7k
pjreddie/darknet
Convolutional Neural Networks
Language:C25.9k 912 2.4k21.3k
microsoft/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Language:C++15.3k 251 6.9k3k
mozilla/TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Language:Jupyter Notebook9.5k 185 5661.3k
alibaba/MNN
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba
Language:C++8.9k 199 2.6k1.7k
gperftools/gperftools
Main gperftools repository
Language:C++8.5k 362 1.3k1.5k
wang-xinyu/tensorrtx
Implementation of popular deep learning networks with TensorRT network definition API
Language:C++7.1k 105 1.3k1.8k
flashlight/wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
Language:C++6.4k 247 9251k
halide/Halide
a language for fast, portable data-parallel computation
Language:C++5.9k 237 2.7k1.1k
NVIDIA/thrust
[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl
Language:C++4.9k 207 778757
MegEngine/MegEngine
MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架
Language:C++4.8k 137 371541
NVIDIA-AI-IOT/torch2trt
An easy to use PyTorch to TensorRT converter
Language:Python4.6k 73 727679
mindspore-ai/mindspore
MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.
Language:C++4.4k 148 281715
keithito/tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Language:Python3k 147 323958
herumi/xbyak
A JIT assembler for x86/x64 architectures supporting MMX, SSE (1-4), AVX (1-2, 512), FPU, APX, and AVX10.2
Language:C++2.1k 115 94276
NVIDIA/cub
[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl
Language:Cuda1.7k 90 281448
mapillary/inplace_abn
In-Place Activated BatchNorm for Memory-Optimized Training of DNNs
Language:Python1.3k 39 231187
NervanaSystems/maxas
Assembler for NVIDIA Maxwell architecture
Language:Sass962 89 11164
eddieantonio/imgcat
It's like cat, but for images.
Language:C891 9 4032
onnx/onnx-mlir
Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure
Language:C++793 36 718326
zerollzeng/tiny-tensorrt
Deploy your model with TensorRT quickly.
Language:C++764 28 6998
NVIDIA/nv-wavenet
Reference implementation of real-time autoregressive wavenet inference
Language:Cuda736 47 75126
DavidDiazGuerra/gpuRIR
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
Language:Cuda500 10 5996
NVIDIA/cnmem
A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory
Language:C++294 41 776
daadaada/turingas
Assembler for NVIDIA Volta and Turing GPUs
Language:Python203 12 1040
PaddlePaddle/CINN
Compiler Infrastructure for Neural Networks
Language:C++145 19 56114
dmlc/nnvm-fusion
Kernel Fusion and Runtime Compilation Based on NNVM
Language:C++69 10 127
ap-hynninen/cutt
CUDA Tensor Transpose (cuTT) library
Language:C++51 4 927
XiuYuLi/deepcore_source_code
Subpart source code of of deepcore v0.7
Language:C27 2 214
jeng1220/cuGemmProf
A simple tool to profile performance of multiple combinations of GEMM of cuBLAS
Language:C++24 3 17

zhxfl

zhxfl's Stars

google/jax

pjreddie/darknet

microsoft/onnxruntime

mozilla/TTS

alibaba/MNN

gperftools/gperftools

wang-xinyu/tensorrtx

flashlight/wav2letter

halide/Halide

NVIDIA/thrust

MegEngine/MegEngine

NVIDIA-AI-IOT/torch2trt

mindspore-ai/mindspore

keithito/tacotron

herumi/xbyak

NVIDIA/cub

mapillary/inplace_abn

NervanaSystems/maxas

eddieantonio/imgcat

onnx/onnx-mlir

zerollzeng/tiny-tensorrt

NVIDIA/nv-wavenet

DavidDiazGuerra/gpuRIR

NVIDIA/cnmem

daadaada/turingas

PaddlePaddle/CINN

dmlc/nnvm-fusion

ap-hynninen/cutt

XiuYuLi/deepcore_source_code

jeng1220/cuGemmProf