/rwkv_cuda

Primary LanguageHTMLApache License 2.0Apache-2.0

rwkv_cuda

simple minimal dependency test

layernorm / softmax / wkv_forward using oneflow style custom cuda kernel

gemm / gemv using slightly modified cutlass 3.1

argsort using thrust

minimal test: under bin folder invoked with nodejs and koffi