MohannadEhabBarakat

Erlangen, Nuremberg

MohannadEhabBarakat's Stars

AMAI-GmbH/AI-Expert-Roadmap
Roadmap to becoming an Artificial Intelligence Expert in 2022
Language:JavaScript29.7k 970 652.5k
w-okada/voice-changer
リアルタイムボイスチェンジャー Realtime Voice Changer
Language:Python17.7k 133 1.1k1.9k
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python14.9k 144 7921.6k
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
Language:Jupyter Notebook13.9k 179 5301.9k
replicate/cog
Containers for machine learning
Language:Python8.5k 69 797596
rhasspy/piper
A fast, local neural text to speech system
Language:C++8.5k 86 551642
ryo-ma/github-profile-trophy
🏆 Add dynamically generated GitHub Stat Trophies on your readme
Language:TypeScript5.7k 60 173865
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Language:Python5.6k 78 223528
espeak-ng/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Language:C4.9k 107 1k989
creativetimofficial/material-tailwind
@material-tailwind is an easy-to-use components library for Tailwind CSS and Material Design.
Language:TypeScript4.1k 21 402340
sfikas/medical-imaging-datasets
A list of Medical imaging datasets.
2.4k 60 5420
adefossez/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Language:Python1.4k 38 0141
sh-lee-prml/HierSpeechpp
The official implementation of HierSpeech++
Language:Python1.2k 54 54152
wty-ustc/HairCLIP
[CVPR 2022] HairCLIP: Design Your Hair by Text and Reference Image
Language:Python541 19 4868
laclouis5/globox
A package to read and convert object detection datasets (COCO, YOLO, PascalVOC, LabelMe, CVAT, OpenImage, ...) and evaluate them with COCO and PascalVOC metrics.
Language:Python194 4 2826
WebDevSimplified/Learn-React-In-30-Minutes
Language:JavaScript161 5 398
VIPL-Audio-Visual-Speech-Understanding/learn-an-effective-lip-reading-model-without-pains
The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the state-of-art performance in LRW-1000 dataset.
Language:Python159 1 2637
manmay-nakhashi/tortoise-tts-fastest
Faster Tortoise inference then Tortoise Fast Fork
Language:Jupyter Notebook128 2 128
NextAudioGen/ultimatevocalremover_api
API for a Vocal Remover that uses Deep Neural Networks.
Language:Python103 1 1012
drengskapur/docker-in-colab
Run Docker inside Google Colab
101 3 215
liangjiubujiu/CTooth
this is the official link to request CTooth
Language:Python97 4 185
NeuralVox/OpenPhonemizer
An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPL phonemizer.
Language:Python95 4 75
MikeOfZen/Yet-Another-Openpose-Implementation
This project reimplements from scratch the OpenPose paper (Cao et al,2018), Using Tensorflow 2.1 and optional TPU powered training.
Language:Jupyter Notebook92 2 1226
RISE-MICCAI/Journal-Club
The RISE Journal Club aims to create a friendly environment to discuss the latest state-of-the-art papers in the areas of medical image analysis, AI and computer vision. The moderators will briefly introduce the paper and then moderate a discussion where everyone is welcome to provide their thoughts and ask any questions on the paper.
Language:HTML67 12 11
MIC-DKFZ/ACDC2017
Language:Python60 9 315
CircuitCM/RVC-inference
High performance RVC inferencing, intended for multiple instances in memory at once. Also includes the latest pitch estimator RMVPE, Python 3.8-3.11 compatible, pip installable, memory + performance improvements in the pipeline and model usage.
Language:Python25 1 44
zsxkib/voice-cloning-training
Voice data <= 10 mins can also be used to train a good VC model!
Language:Python11 0 08
fakerybakery/txtsplit
A simple text splitter based on Tortoise for use in text-to-speech applications
Language:Python4 1 00
lucataco/RAVE
RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models - Official Repo
Language:Python3 0 0
NextAudioGen/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Language:Python2 0 00

MohannadEhabBarakat

MohannadEhabBarakat's Stars

AMAI-GmbH/AI-Expert-Roadmap

w-okada/voice-changer

m-bain/whisperX

neonbjb/tortoise-tts

replicate/cog

rhasspy/piper

ryo-ma/github-profile-trophy

yl4579/StyleTTS2

espeak-ng/espeak-ng

creativetimofficial/material-tailwind

sfikas/medical-imaging-datasets

adefossez/demucs

sh-lee-prml/HierSpeechpp

wty-ustc/HairCLIP

laclouis5/globox

WebDevSimplified/Learn-React-In-30-Minutes

VIPL-Audio-Visual-Speech-Understanding/learn-an-effective-lip-reading-model-without-pains

manmay-nakhashi/tortoise-tts-fastest

NextAudioGen/ultimatevocalremover_api

drengskapur/docker-in-colab

liangjiubujiu/CTooth

NeuralVox/OpenPhonemizer

MikeOfZen/Yet-Another-Openpose-Implementation

RISE-MICCAI/Journal-Club

MIC-DKFZ/ACDC2017

CircuitCM/RVC-inference

zsxkib/voice-cloning-training

fakerybakery/txtsplit

lucataco/RAVE

NextAudioGen/StyleTTS2