ferugit

Ph.D. student in AI, Speech and Audio Technologies at Universidad Autónoma de Madrid | AI Research and Prototyping @Telefonica 's Discovery Innovation Team

TelefónicaMadrid, Spain

Pinned Repositories

Audiomer-PyTorch
A Convolutional Transformer for Keyword Spotting (AAAI 2022 DSTC Workshop)
Language:Python0 0 01
crnn-audio-classification
UrbanSound classification using Convolutional Recurrent Networks in PyTorch
Language:Python0 1 00
ctc-loss
A PyTorch implementation of CTCLoss (for learning purposes)
Language:Python1 2 00
ctc-segmentation
Segment an audio file and obtain utterance alignments. (Python package)
Language:Python0 1 00
iterative-pseudo-forced-alignment-ctc
The code for the https://arxiv.org/pdf/2210.15226.pdf
Language:Python2 2 10
quark
Efficient Keyword Spotting
Language:Python0 1 01
sonopytorch
Torch implementation of Sonopy
Language:Python0 2 10
speechbrain
A PyTorch-based Speech Toolkit
Language:Python0 1 00
transformer-corrector
Transformer-based Spanish corrector
Language:Python0 1 00
lullaby-generation-spanish
Notebooks used by Menara, Gonimix y Andino team in the AI Song Contest 2021 to generate lullabies in Spanish.
Language:Jupyter Notebook2 1 00

ferugit's Repositories

ferugit/iterative-pseudo-forced-alignment-ctc
The code for the https://arxiv.org/pdf/2210.15226.pdf
Language:Python2 2 10
ferugit/ctc-loss
A PyTorch implementation of CTCLoss (for learning purposes)
Language:Python1 2 00
ferugit/Audiomer-PyTorch
A Convolutional Transformer for Keyword Spotting (AAAI 2022 DSTC Workshop)
Language:Python0 0 01
ferugit/crnn-audio-classification
UrbanSound classification using Convolutional Recurrent Networks in PyTorch
Language:Python0 1 00
ferugit/ctc-segmentation
Segment an audio file and obtain utterance alignments. (Python package)
Language:Python0 1 00
ferugit/degan
Deep Effect Generation using GANs
Language:Python0 2 10
ferugit/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Language:Python0 1 00
ferugit/quark
Efficient Keyword Spotting
Language:Python0 1 01
ferugit/sonopytorch
Torch implementation of Sonopy
Language:Python0 2 10
ferugit/speechbrain
A PyTorch-based Speech Toolkit
Language:Python0 1 00
ferugit/transformer-corrector
Transformer-based Spanish corrector
Language:Python0 1 00
ferugit/DESED_task
Domestic environment sound event detection task
Language:Jupyter Notebook1 0
ferugit/diart
Lightweight python library for streaming speaker diarization in real-time implemented in pytorch
Language:Python0 0
ferugit/EfficientAT
This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.
ferugit/examples
TensorFlow examples
Language:Jupyter Notebook1 0
ferugit/ferugit.github.io
W. Fernando López Gavilánez public page
Language:HTML2 0
ferugit/performer-pytorch
An implementation of Performer, a linear attention-based transformer, in Pytorch
Language:Python0 0
ferugit/pytorch_introduction
Several pytorch projects
Language:Jupyter Notebook2 0
ferugit/RugPullDetection
Language:Jupyter Notebook0 01
ferugit/speaker-recognition-exploration
Speaker Recognition Exploration
Language:Jupyter Notebook1 0
ferugit/ssast
Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
Language:Python0 0
ferugit/transducer-tutorial
Example code for a neural transducer model.
Language:Jupyter Notebook1 0
ferugit/udacity-deep-learning
Udacity Deep Learning Course
Language:Jupyter Notebook2 0
ferugit/wuw-challenge-2024
Baseline of the Wake-up Word Challenge of the 2024 Albayzin Evaluations
Language:Python

ferugit

Pinned Repositories

Audiomer-PyTorch

crnn-audio-classification

ctc-loss

ctc-segmentation

iterative-pseudo-forced-alignment-ctc

quark

sonopytorch

speechbrain

transformer-corrector

lullaby-generation-spanish

ferugit's Repositories

ferugit/iterative-pseudo-forced-alignment-ctc

ferugit/ctc-loss

ferugit/Audiomer-PyTorch

ferugit/crnn-audio-classification

ferugit/ctc-segmentation

ferugit/degan

ferugit/denoiser

ferugit/quark

ferugit/sonopytorch

ferugit/speechbrain

ferugit/transformer-corrector

ferugit/DESED_task

ferugit/diart

ferugit/EfficientAT

ferugit/examples

ferugit/ferugit.github.io

ferugit/performer-pytorch

ferugit/pytorch_introduction

ferugit/RugPullDetection

ferugit/speaker-recognition-exploration

ferugit/ssast

ferugit/transducer-tutorial

ferugit/udacity-deep-learning

ferugit/wuw-challenge-2024