bigscience-workshop/biomedical

Outdated import in GGPONC2 loader

nachollorca opened this issue · 0 comments

Describe the bug

The GGPONC2 loader is using bigbio.utils.constants import Tasks from an older BigBIO version instead of the new from .bigbiohub import Tasks. This prevents the data from being loaded when load_dataset("bigbio/ggponc2").

Steps to reproduce the bug

from datasets import load_dataset
load_dataset("bigbio/ggponc2", data_dir='...', name='...')

Expected results

The dataset is loaded.

Actual results

The loading crashes on the import.

Environment info

  • datasets version: 2.9.0
  • Platform: Linux-5.4.0-92-generic-x86_64-with-glibc2.29
  • Python version: 3.8.10
  • PyArrow version: 11.0.0
  • Pandas version: 1.5.3