wa3dbk
PhD in computer science, Senior Research Scientist in Speech and Language Processing in Vocapia Research
Vocapia ResearchParis, France
Pinned Repositories
ALIZE-LIA_RAL-extensions
Tools and utilities for speaker recognition built based on the ALIZE platform
audio_degrader
Audio degradation toolbox in python. It is useful to apply controlled degradations to audio.
Barcha
Open source NLP resources for the Tunisian arabic dialect.
epitran
A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)
OpenAED
A collection of manually annotated audio files for acoustic event detection (AED).
react-transcript-editor
A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
roary
Pan-genome nextflow pipeline which uses fasta input files for Prokka and Roary before generating visualisations
ScribeSalad
A collection of YouTube videos transcripts : Podcasts (Joe Rogan Experience, Tim Ferris, Jocko podcast, ..), lectures (YaleCourses, MIT lectures, Jordan B. Peterson talks, ..). A big transcripts salad spanning history, geography, science, politics, film making and more.
SPro
Unofficial spro repository
Youtube8M-subs
Manual and automatically generated subtitles for the YouTube-8M dataset
wa3dbk's Repositories
wa3dbk/ScribeSalad
A collection of YouTube videos transcripts : Podcasts (Joe Rogan Experience, Tim Ferris, Jocko podcast, ..), lectures (YaleCourses, MIT lectures, Jordan B. Peterson talks, ..). A big transcripts salad spanning history, geography, science, politics, film making and more.
wa3dbk/ALIZE-LIA_RAL-extensions
Tools and utilities for speaker recognition built based on the ALIZE platform
wa3dbk/Barcha
Open source NLP resources for the Tunisian arabic dialect.
wa3dbk/OpenAED
A collection of manually annotated audio files for acoustic event detection (AED).
wa3dbk/react-transcript-editor
A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
wa3dbk/roary
Pan-genome nextflow pipeline which uses fasta input files for Prokka and Roary before generating visualisations
wa3dbk/audio_degrader
Audio degradation toolbox in python. It is useful to apply controlled degradations to audio.
wa3dbk/epitran
A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)
wa3dbk/SPro
Unofficial spro repository
wa3dbk/Youtube8M-subs
Manual and automatically generated subtitles for the YouTube-8M dataset