Capuchinbird Call Detector

This project aims to build an audio classifier using TensorFlow to detect Capuchinbird calls from audio recordings. The key steps include:

Installation: Installing necessary libraries like TensorFlow, TensorFlow-IO, and Matplotlib.
Data Preparation: Loading audio files, preprocessing them by resampling to 16kHz, and converting them to spectrograms.
Model Training: Creating and training a Convolutional Neural Network (CNN) on labeled audio data of Capuchinbird calls and other sounds.
Prediction on New Data: Applying the trained model to new audio recordings to detect Capuchinbird calls, using a sliding window approach to handle long recordings.
Export Results: Aggregating the model predictions and exporting the results to a CSV file.

Installation

Install required libraries:

pip install tensorflow==2.8.0 tensorflow-io==0.25.0 matplotlib

Load and preprocess audio files, converting them to a standard format suitable for model training.

Train a CNN on the preprocessed audio data to classify Capuchinbird calls.

Use the trained model to detect Capuchinbird calls in new audio recordings and post-process the results to identify distinct calls.

Export the final detection results to a CSV file for analysis.

This project is based on the tutorial by Nick Nochnack. For more information, visit Nick Nochnack's GitHub.