banalasaritha

Speaker Recognition and Identification, Meta-learning, Few Shot Learning & Speech Processing, Speech-activity-detection , T-F Representations.

IndiaNational Institute of Technology

Pinned Repositories

3DCNN
3D convolutional neural network for video classification
Language:Python00
AFRNN
Language:Python0 0 00
ClusteringDirectionCentrality
A novel Clustering algorithm by measuring Direction Centrality (CDC) locally. It adopts a density-independent metric based on the distribution of K-nearest neighbors (KNNs) to distinguish between internal and boundary points. The boundary points generate enclosed cages to bind the connections of internal points.
Language:MATLAB1 0 00
Hands-On-Meta-Learning-With-Python
Learning to Learn using One-Shot Learning, MAML, Reptile, Meta-SGD and more with Tensorflow
Language:Jupyter Notebook1 0 00
MAML-and-FOMAML-implimentaion-and-comparison
Comparison between MAML & FOMAML
Language:Jupyter Notebook1 0 00
MetaAudio-A-Few-Shot-Audio-Classification-Benchmark
A new comprehensive and diverse few-shot acoustic classification benchmark.
Language:Python10
prototypical-networks-tensorflow
Tensorflow implementation of NIPS 2017 Paper "Prototypical Networks for Few-shot Learning"
Language:Jupyter Notebook1 0 00
reptile-pytorch
A PyTorch implementation of OpenAI's REPTILE algorithm
Language:Jupyter Notebook1 0 00
Self-Supervised-Audio-Spectrogram-Transformer
Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
Language:Python1 0 00
workshops
Materials for workshops on the Hugging Face ecosystem
Language:Jupyter Notebook1 0 00

banalasaritha's Repositories

banalasaritha/ClusteringDirectionCentrality
A novel Clustering algorithm by measuring Direction Centrality (CDC) locally. It adopts a density-independent metric based on the distribution of K-nearest neighbors (KNNs) to distinguish between internal and boundary points. The boundary points generate enclosed cages to bind the connections of internal points.
Language:MATLAB1 0 00
banalasaritha/MAML-and-FOMAML-implimentaion-and-comparison
Comparison between MAML & FOMAML
Language:Jupyter Notebook1 0 00
banalasaritha/workshops
Materials for workshops on the Hugging Face ecosystem
Language:Jupyter Notebook1 0 00
banalasaritha/AFRNN
Language:Python0 0 00
banalasaritha/AISHELL-2
kaldi-asr/kaldi is the official location of the Kaldi project.
banalasaritha/AS-pVAD
AS-pVAD: A Real-time Personalized Voice Activity Detection Network With Attentive Score Loss
0 0
banalasaritha/Audio-Spectrogram-Transformer
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
Language:Jupyter Notebook0 0
banalasaritha/AudioSet-For-Meta-Learning
Meta-Learning for Few Shot Learning
Language:Python1 0
banalasaritha/CACRN-Net
Channel Attention Convolutional Recurrent Neural Network for Few-Shot Speaker Identification
Language:Jupyter Notebook2 0
banalasaritha/Chinese-Speaker-Identification
End-to-End Chinese Speaker Identification
Language:Python0 0
banalasaritha/DO-Conv
Depthwise Over-parameterized Convolutional Layer
Language:Python0 0
banalasaritha/FastVADCode
Code for FastVad
Language:Jupyter Notebook0 0
banalasaritha/FunASR-Transformer-VAD
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.
Language:Python0 0
banalasaritha/image_caption_with_selfAttention
Language:Jupyter Notebook0 0
banalasaritha/Meta-TTS
Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.
Language:Python0 0
banalasaritha/MultiresolutionNeuralNetworks
Multi-Resolution Neural Networks
Language:Python0 0
banalasaritha/prototypical-networks-jupyter
Prototypical-networks few shot learning
Language:Jupyter Notebook1 0
banalasaritha/prototypical_networks_pytorch
Language:Jupyter Notebook1 0
banalasaritha/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
banalasaritha/python-wigner-distribution
A python based Wigner distribution including a method for interference reduction
Language:Python0 0
banalasaritha/pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
Language:Python0 0
banalasaritha/ResNeSt
ResNeSt: Split-Attention Networks
Language:Python0 0
banalasaritha/Speaker-emotion-speech-and-diarazation-recognition
Language:Python0 0
banalasaritha/speaker-identification-1
Speaker Identification using Neural Net.
Language:Python0 0
banalasaritha/speakerbox
Speakerbox: Fine-tune Audio Transformers for speaker identification.
Language:Python0 0
banalasaritha/Time-Frequency-Representations-of-RSR2015-Database
Time frequency representations of RSR 2015 database.
2 0
banalasaritha/transformers-wav2vec
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
banalasaritha/vggvox_identification
Training and evaluation of VGGVox neural network for speaker identification
Language:Python0 0
banalasaritha/voice_activity_detection
Voice Activity Detection based on Deep Learning & TensorFlow
Language:Python0 0
banalasaritha/Voice_Activity_Detection_Frame
Frame-VAD: More Effective and Efficient VAD for More Fine-grained Timestamps
Language:Jupyter Notebook0 0