/merge

a collection of transformer-based model implementations from various papers

Primary LanguagePythonMIT LicenseMIT

merge

my collection of paper implementations and experiments. built to be modular, easy to extend, and experiment with.