Audio Analyzer

Description:

Audio Analyzer is a pipeline for audio analysis that aims to provide a comprehensive solution for audio processing. The project is designed to be modular and scalable, allowing users to easily add new features and functionalities.

Solution Diagram:

pipeline_diagram

to-do:

  • create a directory for each run to debug with time stamp
  • check on large audio files
  • split to 1 minute segments
  • extract features from each segment [make it parallel]
  • save features to csv
  • not assuming language

audio:

  • background noise classification
  • audio enhancement
  • speaker diarization Link1, Link2
  • tone classification

text:

  • text extraction
  • sentiment analysis
  • toxic words detection

future tasks to consider:

  • text summarization
  • cleaning dead segments
  • audio segmentation - part of day

data:

models:

other:

  1. run pipreqs to generate requirements.txt
    pipreqs . --force
  2. run:
    pip install -r requirements.txt

bugfixes: