vkothapally

Machine Learning, Deep Learning, Microphone Arrays, Distant Speech Enhancement and Recognition

Center for Robust Speech Systems (CRSS)Dallas, TX

Pinned Repositories

Array-Processing
Beamforming Techniques
Language:Matlab5 0 15
awesome-fast-attention
list of efficient attention modules
Language:Python2 0 00
Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Language:MATLAB2 0 00
awesome-speech-enhancement-1
speech enhancement\speech seperation\sound source localization
2 0 00
Complex-valued-Attention
Transformer based Self-Attention for Complex Numbers
Language:Python10 1 02
Complex-valued-DNN-Speech-Enhancement
Complex valued Deep Neural Network for Speech Enhancement
3 1 00
Complex-valued-GRU-PyTorch
Gated Recurrent Neural Networks for Complex Numbers
3 1 10
JAECBF
Language:Python49 1 110
Speech-Enhancement
A Multi-Channel Front End processing for speech enhancement
Language:Python3 0 02
Subband-Beamformer
Language:HTML31 2 05

vkothapally's Repositories

vkothapally/JAECBF
Language:Python49 1 110
vkothapally/Subband-Beamformer
Language:HTML31 2 05
vkothapally/Complex-valued-Attention
Transformer based Self-Attention for Complex Numbers
Language:Python10 1 02
vkothapally/Complex-valued-DNN-Speech-Enhancement
Complex valued Deep Neural Network for Speech Enhancement
3 1 00
vkothapally/Complex-valued-GRU-PyTorch
Gated Recurrent Neural Networks for Complex Numbers
3 1 10
vkothapally/awesome-speech-enhancement-1
speech enhancement\speech seperation\sound source localization
2 0 00
vkothapally/Complex-valued-Deformable-Convolutions
Deformable Convolutions for Complex Numbers
Language:Python2 1 00
vkothapally/Machine-Learning-Collection
A resource for learning about ML, DL, PyTorch and TensorFlow. Feedback always appreciated :)
Language:Python2 0 00
vkothapally/pysepm
Python implementation of performance metrics in Loizou's Speech Enhancement book
Language:Python2 0 00
vkothapally/sru
Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)
Language:Python2 0 0
vkothapally/TCN
Sequence modeling benchmarks and temporal convolutional networks
Language:Python2 0 0
vkothapally/Adaptive-deformable-convolution
Pytorch-based adaptive deformable convolution
Language:Python1 0 0
vkothapally/auraloss
Collection of audio-focused loss functions in PyTorch
Language:Python1 0 0
vkothapally/EfficientDNNs
Collection of recent methods on (deep) neural network compression and acceleration.
1 0 0
vkothapally/GCRN-complex
Language:Python1 0 0
vkothapally/Neural-Speech-Dereverberation
Machine and Deep Learning models for speech dereverberation
Language:Python1 0 0
vkothapally/scientific-visualization-book
An open access book on scientific visualization using python and matplotlib
Language:Python1 0 0
vkothapally/SkipConvGAN
Language:HTML1 1 0
vkothapally/StyleSwin
StyleSwin: Transformer-based GAN for High-resolution Image Generation
1 0 0
vkothapally/TensorLayer
Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥
Language:Python1 0 0
vkothapally/transformer
Implementation of "Attention Is All You Need" using pytorch
Language:Python1 0 0
vkothapally/UniTrack
[NeurIPS'21] Unified tracking framework with a single appearance model. It supports Single Object Tracking (SOT), Video Object Segmentation (VOS), Multi-Object Tracking (MOT), Multi-Object Tracking and Segmentation (MOTS), Pose Tracking, Video Instance Segmentation (VIS), and class-agnostic MOT (e.g. TAO dataset).
Language:Python1 0 0
vkothapally/ASH-IR-Dataset
An impulse response dataset for binaural synthesis of spatial audio systems on headphones
vkothapally/audio-development-tools
This is a list of sound, audio and music development tools which contains machine learning, audio generation, audio signal processing, sound synthesis, spatial audio, music information retrieval, music generation, speech recognition, speech synthesis, singing voice synthesis and more.
vkothapally/beamformers
Easy to use Beamformers for multi-channel speech separation/enhancement
Language:Python0 0
vkothapally/MLfAS
Machine Learning for Audio Signals in Python
Language:Jupyter Notebook0 0
vkothapally/pytorch-speech-features
vkothapally/SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
Language:Python0 0
vkothapally/torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
Language:Python0 0
vkothapally/torchsubband
Pytorch implementation of subband decomposition
Language:HTML0 0