Issues
- 0
Initialize a Parakeet Cache-Aware Streaming model's encoder from an offline model.
#11250 opened by gabitza-tech - 2
- 0
How to convert downloaded local models into Nemo files?
#11248 opened by llll111111 - 2
- 2
- 3
Global shape mismatch for loaded ((1024, 768)) and expected ((512, 768)) tensor for key model.embedding.position_embeddings.weight
#10715 opened by Alireza3242 - 2
- 1
Resuming from a checkpoint that ended before the epoch ended and your dataloader is not resumable
#10797 opened by AudranBert - 3
[Question] Converting a Megatron-LM ckpt to Nemo
#10831 opened by abgoswam - 1
- 1
Conversion script of phi3 from HF to Nemo
#10825 opened by zhouxie000 - 1
Need env.yaml or requirements.txt
#11231 opened by jugal-sheth - 0
- 1
[NeVa Pretraining] Vision Encoder Created on All GPUs During Pipeline Parallelism
#10805 opened by Esthesia - 0
Nemo run pretrain raised Exception: No dot product attention support for the provided inputs
#11218 opened by ycchenzheng - 0
Availability of Meeting MSDD models
#11207 opened by uro-sh - 0
- 0
NameError: name' flash_attn_with_kvcache 'is not defined
#11200 opened by 1074224619 - 1
How to use wandb sweep for hyperparameter search when finetuning with llama2
#11151 opened by PurvangL - 0
Would it be possible to use pyctcdecode with ASR cache-aware streaming?
#11182 opened by gabitza-tech - 0
tutorials/tts/Inference_ModelSelect.ipynb is throwing error when installing dependancies
#11178 opened by bnarasimha - 2
`IPython` should be included in the requirements
#10772 opened by MahmoudAshraf97 - 0
Deploy ASR STT Streaming model
#11019 opened by rkchamp25 - 0
How to export fastconformer based asr model to support torchscript model?
#11161 opened by zw76859420 - 0
How to finetune the NEST model with CTC Loss for ASR task?
#11163 opened by zw76859420 - 2
When do add the code for Target Speaker Extraction, thank!
#10830 opened by haha010508 - 0
convert_qwen2_hf_to_nemo error
#11142 opened by huangqingyi-code - 6
The SDXL Infer output image is full of noise
#10938 opened by blacklong28 - 0
How to Visualize Backward Computational Graph
#11139 opened by zixianwang2022 - 0
Slurm interactive mode, transcribe_speech_parallel.py gets stuck on consecutive runs
#11105 opened by itzsimpl - 2
NeMo2.0 nemorun llm export ValueError: PyTorch DDP is not enabled for mcore optimizer
#10939 opened by lifeiteng - 2
Megatron Multilingual En Any 500M
#10976 opened by rogerwelo - 0
Code adoptability for other pretrained ASR
#11074 opened by Amg9794 - 1
When training ASR models, it saves .nemo 2 times in a row
#10798 opened by AudranBert - 0
activate subscription: Not able to find the Promotional Code, Serial Number, or Token to run the inference script?
#11064 opened by Alla-Abdella - 3
srun issue with nemorun
#10997 opened by RachitBansal - 1
Unable to export MSDD model to pt or ONNX
#10999 opened by jingzhaoo - 0
RuntimeError: Function 'AcosBackward0' returned nan values in its 0th output.
#11025 opened by gor2000 - 1
[Question] Pipeline Parallel for Mamba Megatron
#10900 opened by zixianwang2022 - 2
NeMO dependency issues on HuggingFace Hub (for ASR models)
#10940 opened by bhavnicksm - 0
Converting Mamba to tp4: RuntimeError: The size of tensor a (18560) must match the size of tensor b (4640) at non-singleton dimension 0
#10966 opened by zixianwang2022 - 0
NeuralDiarizer with the telephonic config mix speakers at the very beginning of shorter audio files (less than 2 minutes duration)
#10988 opened by uro-sh - 0
canary-1b is not exportable
#11004 opened by pdufour - 1
Modules fail for Dreambooth example
#10888 opened by paulaserna16 - 0
global batch size at different sequence length
#10905 opened by erhoo82 - 0
Link Not Found at Mamba Tutorial
#10899 opened by zixianwang2022 - 0
- 0
Add Hydrarunner to oomptimizer
#10882 opened by bonham79 - 0
SFT stage use context parallel with flash attention error
#10876 opened by ARQlalala - 0