mohsen-goodarzi

Germany

Pinned Repositories

aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Language:Python0 1 00
allophant
A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.
Language:Python00
audio-steganography-algorithms
A Library of Audio Steganography & Watermarking Algorithms
Language:MATLAB0 1 00
audio_steganalysis_ml
Audio steganalysis based on traditional handcrafted features design.
Language:MATLAB0 1 00
automatic-ecg-diagnosis
Scripts and modules for training and testing neural network for ECG automatic classification. Companion code to the paper "Automatic diagnosis of the 12-lead ECG using a deep neural network".
Language:Python0 1 00
bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook0 0 00
COVID-CAPS
A Capsule Network-based framework for identification of COVID-19 cases from chest X-ray Images
Language:Python0 1 00
Deep-Audio-Steganalysis
Official Repository for Deep Audio Steganalysis in Time Domain
Language:Python0 1 00
deepspeech-server
A testing server for a speech to text service based on mozilla deepspeech
Language:Python0 1 00
DeepSpeech-with-keras
a Keras implementation of DeepSpeech using google Colab
Language:Jupyter Notebook0 3 00

mohsen-goodarzi's Repositories

mohsen-goodarzi/aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Language:Python0 1 00
mohsen-goodarzi/allophant
A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.
Language:Python00
mohsen-goodarzi/audio-steganography-algorithms
A Library of Audio Steganography & Watermarking Algorithms
Language:MATLAB0 1 00
mohsen-goodarzi/audio_steganalysis_ml
Audio steganalysis based on traditional handcrafted features design.
Language:MATLAB0 1 00
mohsen-goodarzi/automatic-ecg-diagnosis
Scripts and modules for training and testing neural network for ECG automatic classification. Companion code to the paper "Automatic diagnosis of the 12-lead ECG using a deep neural network".
Language:Python0 1 00
mohsen-goodarzi/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook0 0 00
mohsen-goodarzi/COVID-CAPS
A Capsule Network-based framework for identification of COVID-19 cases from chest X-ray Images
Language:Python0 1 00
mohsen-goodarzi/Deep-Audio-Steganalysis
Official Repository for Deep Audio Steganalysis in Time Domain
Language:Python0 1 00
mohsen-goodarzi/deepspeech-server
A testing server for a speech to text service based on mozilla deepspeech
Language:Python0 1 00
mohsen-goodarzi/DeepSpeech-with-keras
a Keras implementation of DeepSpeech using google Colab
Language:Jupyter Notebook0 3 00
mohsen-goodarzi/django-deepspeech-server
Mozilla deepspeech server implemented in django.
Language:JavaScript0 1 00
mohsen-goodarzi/dotfiles
Language:Vim Script0 1 00
mohsen-goodarzi/ECG-acquisition-classification
Single Lead ECG signal Acquisition and Arrhythmia Classification using Deep Learning
Language:Jupyter Notebook0 1 00
mohsen-goodarzi/espnet
End-to-End Speech Processing Toolkit
Language:Python0 0
mohsen-goodarzi/Factorized-TDNN
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Language:Python1 0
mohsen-goodarzi/hello-world
learning git
Language:HTML1 0
mohsen-goodarzi/icefall
Language:Python0 0
mohsen-goodarzi/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
Language:C1 0
mohsen-goodarzi/k2-online
FSA/FST algorithms, intended to (eventually) be interoperable with PyTorch and similar
Language:Cuda0 0
mohsen-goodarzi/kickstart.nvim
A launch point for your personal nvim configuration
Language:Lua0 0
mohsen-goodarzi/lhotse
Tools for handling speech data in machine learning projects.
Language:Python0 0
mohsen-goodarzi/mohsen-goodarzi.github.io
Language:HTML2 0
mohsen-goodarzi/num2fawords
Takes a number and converts it to Persian word form
Language:Python1 0
mohsen-goodarzi/Precision_tES_tACS_tRNS_tDCS
Precision transcranial electrical stimulation device
Language:HTML1 0
mohsen-goodarzi/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:Python1 0
mohsen-goodarzi/Resemblyzer
A python package to analyze and compare voices with deep learning
Language:Python1 0
mohsen-goodarzi/tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Language:Python1 0
mohsen-goodarzi/tf_audio_steganalysis
Audio steganalysis with tensorflow.
Language:Python1 0
mohsen-goodarzi/VALL-E-Zero-Shot-Text-To-Speech-
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Language:Python0 0
mohsen-goodarzi/voxpopuli
A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation
Language:Python0 0