NVIDIA/NeMo

Megatron Multilingual En Any 500M

Closed this issue · 2 comments

Hi!

I'm using the Megatron Multilingual En Any 500M model with this example script.

I got the script to work in a Nemo Docker container in the sense that the script runs without error. However, the output file only contains empty lines indicating that the translation doesn't work properly.
The model seems to be loaded fine and there is no hint in the logs.

Any idea what is going wrong? Is this model still supported?

Thanks!

Hi @rogerwelo . Can you specify which Nemo container you are using?

I've been using this one (nvcr.io/nvidia/nemo:23.08) for inference with this model. If you are using a different container, can you try it out?

Shuoyang

I ended up using Riva and followed the steps described here. This worked fine.