google-research/nested-transformer

This seems like it would be a great option for increasing context window in sequences. Have you tried that yet?

Tylersuard opened this issue · 0 comments

Just suggesting you try a similar architecture with sequences rather than images.