omegaconf.errors.ConfigAttributeError: Missing key datafiles

Question

omegaconf.errors.ConfigAttributeError: Missing key datafiles

swcrazyfan opened this issue 3 years ago · 3 comments

🐛 Bug

I follow everything in the documentary precisely on Kaggle, Colab, and a TPU VM, but it always give me a missing key datafiles error.

To Reproduce

Steps to reproduce the behavior:

Run this Kaggle notebook. Then, see the final output.

This is the error I get, but you can read the full thing in the notebook:

Error executing job with overrides: ['dataset.cfg.train_file=../input/netkjvsourcetarget/train.json', 'dataset.cfg.validation_file=../input/netkjvsourcetarget/eval.json', 'task=nlp/summarization', 'trainer.gpus=1']
Traceback (most recent call last):
  File "train.py", line 10, in hydra_entry
    main(cfg)
  File "/kaggle/working/lightning-transformers/lightning_transformers/cli/train.py", line 77, in main
    logger=logger,
  File "/kaggle/working/lightning-transformers/lightning_transformers/cli/train.py", line 52, in run
    data_module.setup("fit")
  File "/opt/conda/lib/python3.7/site-packages/pytorch_lightning/core/datamodule.py", line 474, in wrapped_fn
    fn(*args, **kwargs)
  File "/kaggle/working/lightning-transformers/lightning_transformers/core/nlp/data.py", line 31, in setup
    dataset = self.load_dataset()
  File "/kaggle/working/lightning-transformers/lightning_transformers/core/nlp/data.py", line 62, in load_dataset
    elif self.cfg.datafiles:
omegaconf.errors.ConfigAttributeError: Missing key datafiles
    full_key: datafiles
    object_type=dict

Set the environment variable HYDRA_FULL_ERROR=1 for a complete stack trace.

Answer 1 · 2022-04-19T09:37:58.000Z

Can anyone point me in the right direction?

Answer 2 · 2022-06-19T01:26:00.000Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

Answer 3 · 2022-06-21T14:02:44.000Z

We're moving away from Hydra configs existing in this repo, and all examples are now pure Pytorch Lightning. Let me know if there are any issues with this and I can help get around this!