NVIDIA/cudnn-frontend

cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it

C++MIT

Issues

Bug in Flash with rng dropout sample
#80 opened a month ago by wfoy
1
Why is graph::check_support really slow?
#70 opened 2 months ago by ZoroDerVonCodier
5
Missing header files in the package
#76 opened 2 months ago by DEKHTIARJonathan
2
How to use cudnn frontend in Jax?
#89 opened 2 months ago by MoFHeka
4
Trouble when fusing layernorm with pointwise operation.
#88 opened 3 months ago by xcwang1999
0
Question About Reduce Node
#87 opened 3 months ago by xcwang1999
0
Question about calling MHA
#85 opened 3 months ago by GonChen
2
question about memory layout of the convolution
#83 opened 3 months ago by jinz2014
1
Default `cudnnConvolutionMode_t` and how to set it?
#82 opened 4 months ago by ogoidmatos
0
What's the difference of flash attention implement between cudnn and Dao-AILab?
#52 opened 9 months ago by MoFHeka
19
Matmul test failure
#78 opened 4 months ago by shiwenloong
2
cudnn._compiled_module.cudnnGraphNotSupportedError: [cudnn_frontend] Error: No execution plans built successfully.
#75 opened 5 months ago by ifromeast
1
Error running Flash & BatchNormalization tests
#77 opened 4 months ago by nravic
1
[ERROR] Exception CUDNN_BACKEND_TENSOR_DESCRIPTOR cudnnFinalize failed cudnn_status: CUDNN_STATUS_NOT_INITIALIZED
#71 opened 6 months ago by Tr-buaa
2
Support "make install"
#64 opened 7 months ago by iskunk
1
Support use of external/system Catch2 installation
#63 opened 7 months ago by iskunk
2
CUDNN_FRONTEND_BUILD_UNIT_TESTS option is broken
#62 opened 7 months ago by iskunk
2
identifier "geomlib::_NV_ANON_NAMESPACE::kEps" is undefined in device code
#67 opened 7 months ago by zcc2xj
2
Windows build error
#66 opened 7 months ago by tianleiwu
3
Inference result of deep learning model is all NAN
#23 opened 7 months ago by akineeic
4
INT8 sample didn't work?
#31 opened 7 months ago by vincentccc
1
Many samples don't work for me
#30 opened 7 months ago by KarelPeeters
5
Cudnn Error InstanceNormalizationPlugin
#58 opened 8 months ago by ninono12345
5
Is dgrad+relu with fp32 supported?
#40 opened 10 months ago by tangjicheng46
4
Forward conv1d + transposition + conv1d ?
#9 opened 10 months ago by touisteur
5
Why `cudnnConvolutionBackwardData` call `cudnn::ops::convertTensor_kernel<__half, __half, float, 0>(float, __half const*` ?
#21 opened 10 months ago by strint
2
Number of heuristic engine configs mismatch by calling getEngineConfigCount and getEngineConfig
#17 opened 10 months ago by infloop777
8
question about the fusion_sample
#42 opened 10 months ago by cheneeheng
1
Update single header file for nlohmann json
#50 opened a year ago by ernestyalumni
2
Execute matmul op faild
#39 opened a year ago by Gebixiaochen
13
Why implicit_convolveNd_hhgemm consume too long
#22 opened a year ago by yc-gao
3
error: ambiguous overload for ‘operator*’ in test_list.cpp
#47 opened a year ago by Kevin-Delnoije
2
About cudnn backend
#29 opened 2 years ago by pdd-vn
0
Install CUDNN on NVIDIA Jetson AGX conda environment
#19 opened 3 years ago by sivashankar28
2
how to map to the original algorithm
#20 opened 3 years ago by jackmsye
2
need default return value for cudnn_frontend::PointWiseDesc_v8::getPortCount() const
#25 opened 3 years ago by azrael417
1
How to use cudnn backend API to do int8x32 convolution calculation on Ampere?
#8 opened 3 years ago by ZhaoJob
5
Lack of activation function LeakyReLU
#18 opened 3 years ago by akineeic
6
CUDNN not working with RTX A4000
#16 opened 3 years ago by vsantosu
2
Error: CUDNN_STATUS_EXECUTION_FAILED
#12 opened 3 years ago by akineeic
8
Support for half2 type convolution
#13 opened 3 years ago by infloop777
5
Cannot build nvidia-tensorflow with v0.5
#15 opened 3 years ago by ziyuang
5