harrisonvanderbyl/rwkv-cpp-accelerated

A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependencies

C++MIT

Issues

NumCpp NdArray not initialized
#34 opened 2 years ago by montecarlo26
5
Questions about int8 quantification.
#39 opened a year ago by zzczzc20
2
converter failure
#38 opened a year ago by malv-c
2
ACO ERROR: Unsupported opcode
#36 opened a year ago by Jipok
0
Failed to run `examples/storygen/amd.sh` on ROCm 5.6
#35 opened a year ago by marty1885
1
something went wrong while convert model
#33 opened 2 years ago by innnk
3
Radeon Open Compute support
#31 opened 2 years ago by erkinalp
1
Training, in -cpp-cuda, on one machine?
#30 opened 2 years ago by SCRIER-org
0
compiling for windows
#29 opened 2 years ago by RichardErkhov
1
is it possible to compile for amd or vulkan for windows?
#28 opened 2 years ago by RichardErkhov
1
fail to load on windows
#27 opened 2 years ago by flizzywine
2
输出都是attout浮点数是什么情况？
#26 opened 2 years ago by wangshankun
2
Error converting to bin
#21 opened 2 years ago by sammyf
2
Faster model loading
#13 opened 2 years ago by nenkoru
1
Add Dockerfile
#2 opened 2 years ago by nenkoru
6
Memory leak in rwkv.h
#17 opened 2 years ago by maksmaisak
3
Cannot build
#11 opened 2 years ago by dillfrescott
1
Endless <|endoftext|> bug
#4 opened 2 years ago by Murugurugan
6
Not working on 7B & 14B models | Torch Binding
#10 opened 2 years ago by nenkoru
1
Add basic example of a chat using Python
#6 opened 2 years ago by nenkoru
1
Multi-GPU support
#3 opened 2 years ago by nenkoru
0