/copiste

A radio ripper featuring automatic audio classification

Primary LanguageC++GNU General Public License v3.0GPL-3.0

Copiste project

Antonin Delpeuch

Based on :

This project provides a small set of tools around audio classification.

nnat : Neural Network Analysis Tool

This tool takes a neural network described in an XML format (see the networks directory for examples) and trains it to fit a given corpus, described in another XML format (see the corpus directory for examples). It can also evaluate how good a network fits a corpus, and provides a small graphical interface that can be used to visualize 2D networks.

featdraw : Feature Drawer

This tool takes an audio file and draws the variations of some features in this file. The features are described by a pipeline (XML file, see the pipeline directory), allowing the user to change on the fly how features are computed.

mkcorpus : Make Corpus

This tool uses a pipeline and a set of audio files to create a corpus (as in nnat) where the features are written. It uses another XML file, which defines the different audio classes.

specdraw : Draw Spectrum

This tool just takes as input an audio file and draws the variations of the frequency spectrum during the time.

classify : Classify an audio file

This program takes an audio file, a pipeline and a network, and says what class the file belongs to.

live : Live stream classification with neural networks

This program reads a live stream (e.g. a radio), and shows the variations of the features and the classification of the stream, using a neural network.

fingerprint : Live stream classification with hidden Markov models

This program does the same thing as live but using HMM. Fingerprints fed into the HMM are computed by libchromaprint. It is currently under development.

See help pages of all these tools for specific usage instructions.