experiments in finetuning language models
This "live" repository contains experimental work on finetuning language models (LM).
The focus is on decoder-only transformers. see: selected models : https://github.com/almugabo/LMFit/blob/main/selected_models.md
Its structure may/will change over time but for now it is organized around 4 topics
with scripts on efficient generation of text from LM
experimenting with fine tuning language models
experimenting with the evaluation of language models