google-research/t5x

Flash attention

peregilk opened this issue · 0 comments

Is Flash attention (or Splash attention that it is called in MaxText) implemented in T5X? Or are there any plans to implement it?