tinyevals Capability evaluations of language models trained on TinyStories dataset setup make python 3.10 virtual env in .venv install dependencies pip install -r requirements.txt install the project in editable state pip install -e . run tests pytest