How to handle with the ConnectionResetError
Closed this issue · 11 comments
Hello, I'm a beginner of neptune-client, and the version of 1.10.4 is used. I'm trying to revise the code of "genetic expert guided learning" based on the legacy api. When I just update the codes about neptune api the benchmark program can be completed normally. However, after further introducing my revised code of the scoring function, the benchmark program raises a ConnectionResetError halfway. I have set NEPTUNE_ALLOW_SELF_SIGNED_CERTIFICATE=True, but the problem is still unresolved.
What are the possible causes of the error? How should I handle with it?
Hey @ganfisher 👋
Thank you for reporting this issue! Could you please provide us with more details:
1. Describe the bug.
A clear and concise description of the problem you're experiencing.
2. Reproduction
Please provide steps to reproduce the issue or code snippet in as much detail as possible.
3. Expected behavior
A clear and concise description of what you expected to happen.
4. Traceback
If applicable, add traceback or log output/screenshots to help explain your problem.
5. Environment info
The output of pip list
:
The operating system you're using:
**The output of python --version
:**
6. Additional context
Add any other context about the problem here.
Hello, sorry for the answer delay. I have migrated the old API codes to the new API codes, and the ConnectionResetError does not appear again. However, the running might be stuck after a few epoches, and no exceptions are raised. The running stuck is like the phenomenon described in issue #1151 , but neptune logger and Pytorch Lightning are not explicitly called in my case. I'm still trying to handle the problem by myself.
Hello, sorry for the answer delay. I have migrated the old API codes to the new API codes, and the ConnectionResetError does not appear again. However, the running might be stuck after a few epoches, and no exceptions are raised. The running stuck is like the phenomenon described in issue #1151 , but neptune logger and Pytorch Lightning are not explicitly called in my case. I'm still trying to handle the problem by myself.
I will try the offline mode first.
Hey @ganfisher ,
Please let me know if the offline mode works.
If it does, the amount of metadata logged asynchronously might be too much for your network.
Hey @ganfisher ,
Please let me know if the offline mode works. If it does, the amount of metadata logged asynchronously might be too much for your network.
Yeah, the offline running could finish successfully. However, HTTPError could be raised from time to time, if I try to synchronise the offline results with "neptune sync". Considering that the "ring" symbol meaning "synchronising" exists, the HTTPError may not matter?
Is there any way to visualize the logged metadata locally?
Considering that the "ring" symbol meaning "synchronising" exists, the HTTPError may not matter?
Could you post a screenshot of the traceback?
Is there any way to visualize the logged metadata locally?
No, unfortunately.
Yeah, that's fine.
As long as the sync completes successfully, there should not be any cause for concern ✅
Yeah, that's fine. As long as the sync completes successfully, there should not be any cause for concern ✅
Thanks, I will close the issue, if all the data can be uploaded successfully.
Yeah, that's fine. As long as the sync completes successfully, there should not be any cause for concern ✅
Thank you for your help, all the metadata can be uploaded with enough time.