/convert_checkpoint_to_lsg

Efficient Attention for Long Sequence Processing

Primary LanguagePythonMIT LicenseMIT