Investigating a soft constraint on the recurrent weight matrix orthogonality. Comparison to gradient clipping on sequential MNIST.
Primary LanguagePython
No issues in this repository yet.