/babilong

BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

Issues