/AQA-Eval

Primary LanguagePython

IQA - benchmarking interactive information seeking from large language models

  • main.py contains the main exection code.
  • models/ contains the model definitions or API calls to LLMs.
  • data/ contains the data files.
  • benchmarks/ contains the benchmark definitions.