A small generated dataset for testing LLM knowledge of facts related to 50 randomly sampled countries. Partially human verified and comes with answer predictions made by 3 open source models LLMs.
gilpasternak35/GCF-QA
A small generated dataset for testing LLM knowledge of facts related to 50 randomly sampled countries. Partially human verified and comes with answer predictions made by 3 open source models LLMs.
MIT