dariadiatlova

voice dl researcher

@deepvkSaint-Petersburg

Pinned Repositories

asr_project_template
Template for ASR project
Language:Python0 0 00
Bayesian-Methods-hse-fall-2021
Домашние задания к курсу ШАД БММО21
Language:Jupyter Notebook0 1 00
csc_nlp_word_vectors
Language:Python0 1 00
CV
homework assignments: computer vision course
Language:Jupyter Notebook0 1 00
dariadiatlova.github.io
Language:HTML0 1 00
data-efficient-gans
[NeurIPS 2020] Differentiable Augmentation for Data-Efficient GAN Training
Language:Python0 0 00
Fre-GAN
Test-task for VK-research internship 2022
Language:Python8 3 02
iSTFTNet-pytorch
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
Language:Python2 1 01
plot_templates
A number of custom templates to build plots with a help of python-friendly libraries
Language:Python1 1 00
emospeech
Language:Python89 4 49

dariadiatlova's Repositories

dariadiatlova/Fre-GAN
Test-task for VK-research internship 2022
Language:Python8 3 02
dariadiatlova/iSTFTNet-pytorch
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
Language:Python2 1 01
dariadiatlova/dariadiatlova.github.io
Language:HTML0 1 00
dariadiatlova/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Language:Python0 0 00
dariadiatlova/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Language:Python0 0 00
dariadiatlova/NeMo
NeMo: a toolkit for conversational AI
Language:Python0 0 00
dariadiatlova/speech_course
YSDA course in Speech Processing.
Language:Jupyter Notebook0 0 00
dariadiatlova/dla
Deep learning for audio processing
Language:Jupyter Notebook0 0
dariadiatlova/DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
Language:Python0 0
dariadiatlova/dsp
Digital Signal Processing course
Language:Python0 0
dariadiatlova/DTLN
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
Language:Python0 0
dariadiatlova/dul_2021
Language:Jupyter Notebook0 0
dariadiatlova/emo-tts-data
Language:Python1 0
dariadiatlova/emotion2vec
Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Language:Python0 0
dariadiatlova/FRN
Language:Python0 0
dariadiatlova/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Language:Python0 0
dariadiatlova/hse-advanced-python
Language:Python1 0
dariadiatlova/Kazakh_TTS
An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis corpus. In KazakhTTS2, the overall size has increased from 93 hours to 271 hours, the number of speakers has risen from two to five (three females and two males), and the topic coverage has been diversified.
Language:Shell0 0
dariadiatlova/LPCNet
Efficient neural speech synthesis
Language:C0 0
dariadiatlova/MB-iSTFT-VITS2
Application of MB-iSTFT-VITS components to vits2_pytorch
Language:Python0 01
dariadiatlova/MSP-Podcast_Challenge
MSP-Podcast Challenge Baseline Code
Language:Python0 0
dariadiatlova/RecSys-hse-fall-2021
This repository consists of hometasks for the recommendation systems course.
Language:Jupyter Notebook1 0
dariadiatlova/russian_speech_denoiser
The repository consists of supportive scripts for the Master Thesis research
Language:Jupyter Notebook1 0
dariadiatlova/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Language:Python0 0
dariadiatlova/speechbrain
A PyTorch-based Speech Toolkit
Language:Python0 0
dariadiatlova/StarGAN-Voice-Conversion
This is a pytorch implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks
Language:Python0 0
dariadiatlova/swift-sandbox
This repository consists of several simple iOS apps
Language:Swift1 0
dariadiatlova/TTS_HW
Language:Python0 0
dariadiatlova/urban-sound-classification
Language:Jupyter Notebook1 0
dariadiatlova/vits2_pytorch
unofficial vits2-TTS implementation in pytorch
Language:Python0 0