peregilk opened this issue 8 months ago · 0 comments
Is Flash attention (or Splash attention that it is called in MaxText) implemented in T5X? Or are there any plans to implement it?