ali-vilab/dreamtalk

No audio I/O backend is available

StartHua opened this issue · 2 comments

Some weights of the model checkpoint at jonatasgrosman/wav2vec2-large-xlsr-53-english were not used when initializing Wav2Vec2Model: ['lm_head.bias', 'lm_head.weight']

  • This IS expected if you are initializing Wav2Vec2Model from the checkpoint of a model trained on another task or with another architecture (e.g. initializing a BertForSequenceClassification model from a BertForPreTraining model).
  • This IS NOT expected if you are initializing Wav2Vec2Model from the checkpoint of a model that you expect to be exactly identical (initializing a BertForSequenceClassification model from a BertForSequenceClassification model).
    Traceback (most recent call last):
    File "inference_for_demo_video.py", line 178, in
    speech_array, sampling_rate = torchaudio.load(wav_16k_path)
    File "D:\python\Anaconda\envs\dreamtalk\lib\site-packages\torchaudio\backend\no_backend.py", line 20, in load
    raise RuntimeError('No audio I/O backend is available.')
    RuntimeError: No audio I/O backend is available.

The code has been tested without issues on Ubuntu. Based on the error message you received, you might consider installing an audio I/O backend.

The code has been tested without issues on Ubuntu. Based on the error message you received, you might consider installing an audio I/O backend.

pip install soundfile (win)

pip install sox (linux)