/StartTransformer_0

🌱StartTransformer is a new transformer structure build with time-wise normalization and a new way to allocate params for FFN in order to train a transformer-kind structure with much lower params stably and its basic idea can be used on developing a lot of another stuctructures

Primary LanguagePython

No issues in this repository yet.