HPI-DeepLearning/crnn-lid

Changing to shorter segments

Themba4Sho opened this issue · 1 comments

What do I need to change to train on shorter segments? I used data with minimum length of 3 seconds, but the wav_to_spec module still processes speech with only 10 seconds or more.

In case you still want answer:

  1. Go to config.yaml.
  2. See input shape parameter like [129, 500, 1].
  3. Set segment_length as 3. This is minimal audio lenght in seconds
  4. Set pixel_per_second = 500 / segment_length.
    In general you must provide input_shape[1] = segment_length * pixel_per_second by any means you wish.