vkothapally
Machine Learning, Deep Learning, Microphone Arrays, Distant Speech Enhancement and Recognition
Center for Robust Speech Systems (CRSS)Dallas, TX
Pinned Repositories
Array-Processing
Beamforming Techniques
awesome-fast-attention
list of efficient attention modules
Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
awesome-speech-enhancement-1
speech enhancement\speech seperation\sound source localization
Complex-valued-Attention
Transformer based Self-Attention for Complex Numbers
Complex-valued-DNN-Speech-Enhancement
Complex valued Deep Neural Network for Speech Enhancement
Complex-valued-GRU-PyTorch
Gated Recurrent Neural Networks for Complex Numbers
JAECBF
Speech-Enhancement
A Multi-Channel Front End processing for speech enhancement
Subband-Beamformer
vkothapally's Repositories
vkothapally/JAECBF
vkothapally/Subband-Beamformer
vkothapally/Complex-valued-Attention
Transformer based Self-Attention for Complex Numbers
vkothapally/Complex-valued-DNN-Speech-Enhancement
Complex valued Deep Neural Network for Speech Enhancement
vkothapally/Complex-valued-GRU-PyTorch
Gated Recurrent Neural Networks for Complex Numbers
vkothapally/awesome-speech-enhancement-1
speech enhancement\speech seperation\sound source localization
vkothapally/Complex-valued-Deformable-Convolutions
Deformable Convolutions for Complex Numbers
vkothapally/Machine-Learning-Collection
A resource for learning about ML, DL, PyTorch and TensorFlow. Feedback always appreciated :)
vkothapally/pysepm
Python implementation of performance metrics in Loizou's Speech Enhancement book
vkothapally/sru
Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)
vkothapally/TCN
Sequence modeling benchmarks and temporal convolutional networks
vkothapally/Adaptive-deformable-convolution
Pytorch-based adaptive deformable convolution
vkothapally/auraloss
Collection of audio-focused loss functions in PyTorch
vkothapally/EfficientDNNs
Collection of recent methods on (deep) neural network compression and acceleration.
vkothapally/GCRN-complex
vkothapally/Neural-Speech-Dereverberation
Machine and Deep Learning models for speech dereverberation
vkothapally/scientific-visualization-book
An open access book on scientific visualization using python and matplotlib
vkothapally/SkipConvGAN
vkothapally/StyleSwin
StyleSwin: Transformer-based GAN for High-resolution Image Generation
vkothapally/TensorLayer
Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥
vkothapally/transformer
Implementation of "Attention Is All You Need" using pytorch
vkothapally/UniTrack
[NeurIPS'21] Unified tracking framework with a single appearance model. It supports Single Object Tracking (SOT), Video Object Segmentation (VOS), Multi-Object Tracking (MOT), Multi-Object Tracking and Segmentation (MOTS), Pose Tracking, Video Instance Segmentation (VIS), and class-agnostic MOT (e.g. TAO dataset).
vkothapally/ASH-IR-Dataset
An impulse response dataset for binaural synthesis of spatial audio systems on headphones
vkothapally/audio-development-tools
This is a list of sound, audio and music development tools which contains machine learning, audio generation, audio signal processing, sound synthesis, spatial audio, music information retrieval, music generation, speech recognition, speech synthesis, singing voice synthesis and more.
vkothapally/beamformers
Easy to use Beamformers for multi-channel speech separation/enhancement
vkothapally/MLfAS
Machine Learning for Audio Signals in Python
vkothapally/pytorch-speech-features
vkothapally/SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
vkothapally/torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
vkothapally/torchsubband
Pytorch implementation of subband decomposition