xinghaochen/SLAB
[ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization"
Python
Issues
- 0
SLAB Swin Base Pretrained Weight
#8 opened by YangLi309 - 9
Setting about step
#6 opened by Journey7331 - 2
About RepBN
#7 opened by leily578 - 2
参数warm,iter和total_step
#5 opened by sdreamforchen - 1
About total step in your code
#3 opened by Zhaohuii-Wang - 1
Can SLA be used for cross-attention?
#2 opened by L1bertad - 5
Question about dataset
#4 opened by Xiaozeeze - 1
Any comparison with RMSNorm?
#1 opened by radarFudan