Pinned Repositories
aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
allophant
A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.
audio-steganography-algorithms
A Library of Audio Steganography & Watermarking Algorithms
audio_steganalysis_ml
Audio steganalysis based on traditional handcrafted features design.
automatic-ecg-diagnosis
Scripts and modules for training and testing neural network for ECG automatic classification. Companion code to the paper "Automatic diagnosis of the 12-lead ECG using a deep neural network".
bark
🔊 Text-Prompted Generative Audio Model
COVID-CAPS
A Capsule Network-based framework for identification of COVID-19 cases from chest X-ray Images
Deep-Audio-Steganalysis
Official Repository for Deep Audio Steganalysis in Time Domain
deepspeech-server
A testing server for a speech to text service based on mozilla deepspeech
DeepSpeech-with-keras
a Keras implementation of DeepSpeech using google Colab
mohsen-goodarzi's Repositories
mohsen-goodarzi/aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
mohsen-goodarzi/allophant
A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.
mohsen-goodarzi/audio-steganography-algorithms
A Library of Audio Steganography & Watermarking Algorithms
mohsen-goodarzi/audio_steganalysis_ml
Audio steganalysis based on traditional handcrafted features design.
mohsen-goodarzi/automatic-ecg-diagnosis
Scripts and modules for training and testing neural network for ECG automatic classification. Companion code to the paper "Automatic diagnosis of the 12-lead ECG using a deep neural network".
mohsen-goodarzi/bark
🔊 Text-Prompted Generative Audio Model
mohsen-goodarzi/COVID-CAPS
A Capsule Network-based framework for identification of COVID-19 cases from chest X-ray Images
mohsen-goodarzi/Deep-Audio-Steganalysis
Official Repository for Deep Audio Steganalysis in Time Domain
mohsen-goodarzi/deepspeech-server
A testing server for a speech to text service based on mozilla deepspeech
mohsen-goodarzi/DeepSpeech-with-keras
a Keras implementation of DeepSpeech using google Colab
mohsen-goodarzi/django-deepspeech-server
Mozilla deepspeech server implemented in django.
mohsen-goodarzi/dotfiles
mohsen-goodarzi/ECG-acquisition-classification
Single Lead ECG signal Acquisition and Arrhythmia Classification using Deep Learning
mohsen-goodarzi/espnet
End-to-End Speech Processing Toolkit
mohsen-goodarzi/Factorized-TDNN
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
mohsen-goodarzi/hello-world
learning git
mohsen-goodarzi/icefall
mohsen-goodarzi/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
mohsen-goodarzi/k2-online
FSA/FST algorithms, intended to (eventually) be interoperable with PyTorch and similar
mohsen-goodarzi/kickstart.nvim
A launch point for your personal nvim configuration
mohsen-goodarzi/lhotse
Tools for handling speech data in machine learning projects.
mohsen-goodarzi/mohsen-goodarzi.github.io
mohsen-goodarzi/num2fawords
Takes a number and converts it to Persian word form
mohsen-goodarzi/Precision_tES_tACS_tRNS_tDCS
Precision transcranial electrical stimulation device
mohsen-goodarzi/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
mohsen-goodarzi/Resemblyzer
A python package to analyze and compare voices with deep learning
mohsen-goodarzi/tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
mohsen-goodarzi/tf_audio_steganalysis
Audio steganalysis with tensorflow.
mohsen-goodarzi/VALL-E-Zero-Shot-Text-To-Speech-
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
mohsen-goodarzi/voxpopuli
A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation