- Interface:
src/hf_benchmark_sample/task_base.py
- Interface for Tasks
- Task:
src/hf_benchmark_sample/tasks/sample_rte_task.py
- Sample implementation of RTE task
- Sample run script:
sample_run.py
- Sample script that runs the evaluation and uploads the results to a dataset repo on the Hub