microsoft/hi-ml

Hyperdrive jobs can't be submitted with SDK v2

ant0nsc opened this issue · 2 comments

Hyperdrive jobs can't be submitted with SDK v2

the docker shm size is not propagated to the child runs, so they get 2Gb (default) and immediately go out of memory (dataloader killed error)

the docker shm size is not propagated to the child runs, so they get 2Gb (default) and immediately go out of memory (dataloader killed error)

FYI: #880 (comment)