Scalable Hierarchical Self-Attention with Learnable Hierarchy for Long-Range Interactions
Primary LanguagePython