Classic Transformer Model

  • 学习地址:
模型名称 原论文 源代码 本项目测试数据集及结果 参考视频 参考文章
Vision Transformer An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale(arxiv.orgreadpaper.com googleWZMIAOMIAOKKKSQJ 论文讲解结构讲解代码讲解 Multi-Head Attention讲解ViT结构讲解zhihu.com
Swin Transformer Swin Transformer: Hierarchical Vision Transformer using Shifted Windows(arxiv.orgreadpaper.com microsoftWZMIAOMIAOKKKSQJ 论文讲解结构讲解代码讲解 结构讲解zhihu.comCSDN博客
CoAtNet CoAtNet: Marrying Convolution and Attention for All Data Sizes(arxiv.orgreadpaper.com KKKSQJ 论文精读代码讲解-上代码讲解-下 zhihu.com