This project aims to build an audio classifier using TensorFlow to detect Capuchinbird calls from audio recordings. The key steps include:
- Installation: Installing necessary libraries like TensorFlow, TensorFlow-IO, and Matplotlib.
- Data Preparation: Loading audio files, preprocessing them by resampling to 16kHz, and converting them to spectrograms.
- Model Training: Creating and training a Convolutional Neural Network (CNN) on labeled audio data of Capuchinbird calls and other sounds.
- Prediction on New Data: Applying the trained model to new audio recordings to detect Capuchinbird calls, using a sliding window approach to handle long recordings.
- Export Results: Aggregating the model predictions and exporting the results to a CSV file.
Install required libraries:
pip install tensorflow==2.8.0 tensorflow-io==0.25.0 matplotlib
Load and preprocess audio files, converting them to a standard format suitable for model training.
Train a CNN on the preprocessed audio data to classify Capuchinbird calls.
Use the trained model to detect Capuchinbird calls in new audio recordings and post-process the results to identify distinct calls.
Export the final detection results to a CSV file for analysis.
This project is based on the tutorial by Nick Nochnack. For more information, visit Nick Nochnack's GitHub.