AGENDD/RWKV-ASR
This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the idea of SLAM_ASR and used the RWKV language model as the LLM, and instead of directly writing a prompt template we directly finetuned the initial state of the RWKV model.
Python
Stargazers
- Alic-LiChina.zhejiang.shaoxing
- atuxhe
- brightLLer
- cahya-wirawanVienna, Austria
- cgisky1980
- dlxj
- Evilran/dev/sd
- flowspeech
- fly51flyPRIS
- HaiFengZeng
- jadexlaw
- kakashidan
- liziruBeijing
- lovemefanGuangZhou
- manbaaaa
- May-KiriharaHarakiri-Works
- nmfisherMelbourne
- nshmyrevAlpha Cephei Inc
- OpenMOSEAsian
- pccaiUBI
- qgzangUSTC
- Ryu1845
- SolarWindRider
- splinter21
- tuocheng0824
- Wissotsky
- wxbool
- xiaol
- yuzhaouoe
- zw76859420Trip
- zyzisyzLidun Jia University