bzhangGo/transformer-aan
souce code for "Accelerating Neural Transformer via an Average Attention Network"
PythonBSD-3-Clause
Stargazers
- aliceatlasBrooklyn, NY
- almightyGOSUSingapore
- arvidztBeijing
- chinakook
- codealphago
- crack521
- egrccAlibaba Group
- fly51flyPRIS
- hiyougaMillennium Science School
- hsjkdjj
- Kaixin-Wushenyang, China
- kelayamatoz
- li910802
- luciencho
- LuJunruCoventry, UK
- luyaojieICIP, ISCAS
- PromptExpertBeijing
- Qsevent
- sanmusunrise
- shihuaxinghttp://www.deepintell.cn/
- sunnybest1990
- TB-SeChaJia
- techstone
- vanishcode@okx
- vanzytayGoogle
- walmsley
- wang19742008
- whr94621Nanjing University
- xiangliu886
- XiaoqingNLPJD
- xingjin2017
- yichuan9527Institute of Automation of ,Chinese Academy of Sciences
- youngornever
- zhangjcqq
- ZhbChopin
- zhengzx-nlpShanghai, China