togethercomputer/stripedhyena
Repository for StripedHyena, a state-of-the-art beyond Transformer architecture
PythonApache-2.0
Issues
- 0
gradient checkpointing is not implement
#22 opened by xiyang-aads-lilly - 0
- 1
flash attention not compatible?
#20 opened by oxPJ - 1
- 0
- 2
Apple Silicon support
#5 opened by amrohendawi - 1
- 0
- 2
import FlashDepthwiseConv1d?
#3 opened by Hambaobao - 7
docker build crashes my machine
#2 opened by dustyatx