Unofficial implementations for optimized decoding strategies of large language models
Primary LanguageJupyter Notebook