audio-augmentation
There are 13 repositories under audio-augmentation topic.
AgaMiko/data-augmentation-review
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.
KentoNishi/torch-pitch-shift
Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
KentoNishi/torch-time-stretch
Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
Lallapallooza/fast-audiomentations
⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.
zhaoyi2/audio_augment
A tool/script for batch speech data enhancement with speed/volume/RIRS/MUSAN
zabir-nabil/audioperm
A python library for generating different permutations of audible segments from audio files.
zabir-nabil/torch-speech-dataloader
A ready-to-use pytorch dataloader for audio classification, speech classification, speaker recognition, etc. with in-GPU augmentations
DBraun/audiotree
Audio data loading and augmentations in JAX
lucas-fpaiva/survey-audio-aug
Implementation of audio, image, and spectrogram augmentation techniques provided by the librosa, Keras and audiomentations
hperer02/Bird-sound-classification
This repository contains the code and methodology used for the BirdCLEF 2024 Kaggle competition, where I achieved a rank of 55th out of 974 participants, earning a bronze medal. The goal of this competition was to build a model that can accurately classify bird sounds.
AndreasScharnetzki/EmotionClassifier
A Convolutional Neural Network that distinguishes between the speakers emotions. Comes with multiple preprocessors to improve the models performance.
laurencecliffe/SoundScaper
SoundScaper is an audio augmented reality mobile application that allows users to author, save and reload virtual, and spatially interactive, three-dimensional binaural soundscapes within physical, real world spaces.
imane-ayouni/Text-to-Speech-using-Tacotron2
Converting text to audio and applying audio augmentation