SOTA discrete codec models with forty tokens one second for audio language modeling
Primary LanguagePythonMIT LicenseMIT