okvqa-stage1-pretrain

Question

okvqa-stage1-pretrain

123susu opened this issue a year ago · 3 comments

when l pretrain in okvqa use mcan model,it error
raise LocalEntryNotFoundError( huggingface_hub.utils._errors.LocalEntryNotFoundError: Cannot find the requested files in the disk cache and outgoing traffic has been disabled. To enable hf.co look-ups and downloads online, set 'local_files_only' to False.
During handling of the above exception, another exception occurred:
OSError: We couldn't connect to 'https://huggingface.co' to load this file, couldn't find it in the cached files and it looks like bert-large-uncased is not the path to a directory containing a file named config.json.
Checkout your internet connection or see how to run the library in offline mode at 'https://huggingface.co/docs/transformers/installation#offline-mode'.

so need download bert-large-uncased online? and run code offline?

Answer 1 · 2023-08-04T09:09:46.000Z

Yes. You can download the HF model repo manually. Then set the first argument of from_pretrained to the local path of downloaded repo and run the code in offline mode.

Answer 2 · 2023-08-04T09:18:22.000Z

ok,l solve this question, and l want ask the question is:
What is the minimum cuda(video) memory requirement?
Although l set pretrain batchsize is 2,l can't train,it needs 20G, and my is 8G.it error is cuda out of memory.
and what can l do to train in my 8G cuda(video) memory?
cuda memory is 显存。thanks!

Answer 3 · 2023-08-04T13:28:30.000Z

You need a machine with more than 20G GPU memory. Consider renting a cloud server equipped with a single Nvidia RTX3090 card.