johnmartinsson/bird-species-classification

Divide Spectrograms into Chunks

Closed this issue · 1 comments

Implement a method which divides the spectrograms into equal chunks.

Implementation details:

  1. Split spectrogram into chunks of equal size (length 512).

Motivation:

  1. Need fixed sized input for the neural network architecture.
    • Allow to pad only the last part, and keep step size constant
  2. Each chunk can be used as a unique sample for training (since "empty" parts have been removed)
  3. Network can make multiple predictions per sound file, and average them to generate a final prediction.

This issue has been resolved.