/tinyevals

Capability evaluations of language models trained on TinyStories dataset

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

tinyevals

Capability evaluations of language models trained on TinyStories dataset

setup

  1. make python 3.10 virtual env in .venv
  2. install dependencies pip install -r requirements.txt
  3. install the project in editable state pip install -e .
  4. run tests pytest