High VRAM usage after img2img when worker stays alive
HoopyFreud opened this issue · 4 comments
HoopyFreud commented
Unless the worker is killed after completing an img2img job, onnx_server continues to use high VRAM, resulting in large slowdowns in subsequent generation, even when cache limit is set to zero and the server cache is empty. Can be worked around by setting ONNX_WEB_JOB_LIMIT=1
HoopyFreud commented
HoopyFreud commented
HoopyFreud commented
HoopyFreud commented
This appears to have magically fixed itself with the new pipeline model