This repository attempts to run benchmarks on some popular openly available language models.
Running unit tests requires pytest module invoked as follows:
python -m pytest test
Published docker container can be used as starting point for model configuration.
https://hub.docker.com/repository/docker/aakarsh/llm_calibration/general