audio-augmentation

There are 13 repositories under audio-augmentation topic.

  • AgaMiko/data-augmentation-review

    List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.

  • KentoNishi/torch-pitch-shift

    Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

    Language:Python1322612
  • KentoNishi/torch-time-stretch

    Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

    Language:Python36233
  • Lallapallooza/fast-audiomentations

    ⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.

    Language:Python32301
  • zhaoyi2/audio_augment

    A tool/script for batch speech data enhancement with speed/volume/RIRS/MUSAN

    Language:Shell22114
  • zabir-nabil/audioperm

    A python library for generating different permutations of audible segments from audio files.

    Language:Jupyter Notebook13302
  • zabir-nabil/torch-speech-dataloader

    A ready-to-use pytorch dataloader for audio classification, speech classification, speaker recognition, etc. with in-GPU augmentations

    Language:Python8300
  • DBraun/audiotree

    Audio data loading and augmentations in JAX

    Language:Python323
  • lucas-fpaiva/survey-audio-aug

    Implementation of audio, image, and spectrogram augmentation techniques provided by the librosa, Keras and audiomentations

    Language:Jupyter Notebook3100
  • hperer02/Bird-sound-classification

    This repository contains the code and methodology used for the BirdCLEF 2024 Kaggle competition, where I achieved a rank of 55th out of 974 participants, earning a bronze medal. The goal of this competition was to build a model that can accurately classify bird sounds.

    Language:Jupyter Notebook1100
  • AndreasScharnetzki/EmotionClassifier

    A Convolutional Neural Network that distinguishes between the speakers emotions. Comes with multiple preprocessors to improve the models performance.

    Language:Python0100
  • laurencecliffe/SoundScaper

    SoundScaper is an audio augmented reality mobile application that allows users to author, save and reload virtual, and spatially interactive, three-dimensional binaural soundscapes within physical, real world spaces.

  • imane-ayouni/Text-to-Speech-using-Tacotron2

    Converting text to audio and applying audio augmentation

    Language:HTML10