LLM Benchmarking

This code was used to generate benchmarking results comparing text generation speeds on g5.12xlarge (AWS) and 1 x A100 (Vultr) hardware setups.

krohling/llm-benchmark