Sample HF Benchmark

  • Interface: src/hf_benchmark_sample/task_base.py
    • Interface for Tasks
  • Task: src/hf_benchmark_sample/tasks/sample_rte_task.py
    • Sample implementation of RTE task
  • Sample run script: sample_run.py
    • Sample script that runs the evaluation and uploads the results to a dataset repo on the Hub