INT4 and FP16 inference on CPU for RWKV language model
Primary LanguageC++MIT LicenseMIT
This repository is not active