omegaconf.errors.ConfigAttributeError: Missing key datafiles
swcrazyfan opened this issue ยท 3 comments
๐ Bug
I follow everything in the documentary precisely on Kaggle, Colab, and a TPU VM, but it always give me a missing key datafiles error.
To Reproduce
Steps to reproduce the behavior:
Run this Kaggle notebook. Then, see the final output.
This is the error I get, but you can read the full thing in the notebook:
Error executing job with overrides: ['dataset.cfg.train_file=../input/netkjvsourcetarget/train.json', 'dataset.cfg.validation_file=../input/netkjvsourcetarget/eval.json', 'task=nlp/summarization', 'trainer.gpus=1']
Traceback (most recent call last):
File "train.py", line 10, in hydra_entry
main(cfg)
File "/kaggle/working/lightning-transformers/lightning_transformers/cli/train.py", line 77, in main
logger=logger,
File "/kaggle/working/lightning-transformers/lightning_transformers/cli/train.py", line 52, in run
data_module.setup("fit")
File "/opt/conda/lib/python3.7/site-packages/pytorch_lightning/core/datamodule.py", line 474, in wrapped_fn
fn(*args, **kwargs)
File "/kaggle/working/lightning-transformers/lightning_transformers/core/nlp/data.py", line 31, in setup
dataset = self.load_dataset()
File "/kaggle/working/lightning-transformers/lightning_transformers/core/nlp/data.py", line 62, in load_dataset
elif self.cfg.datafiles:
omegaconf.errors.ConfigAttributeError: Missing key datafiles
full_key: datafiles
object_type=dict
Set the environment variable HYDRA_FULL_ERROR=1
for a complete stack trace.
Can anyone point me in the right direction?
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
We're moving away from Hydra configs existing in this repo, and all examples are now pure Pytorch Lightning. Let me know if there are any issues with this and I can help get around this!