DavidDoukhan

DavidDoukhan's Stars

huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python136k 1.1k 16.2k27.2k
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python72.2k 587 08.6k
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.6k 425 4.2k6.4k
Breakthrough/PySceneDetect
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
Language:Python3.3k 68 338402
coqui-ai/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
1.3k 57 199140
SHI-Labs/Neighborhood-Attention-Transformer
Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022
Language:Python1.1k 16 7786
asteroid-team/torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
Language:Python969 12 10688
zjunlp/KnowledgeEditingPapers
Must-read Papers on Knowledge Editing for Large Language Models.
939 27 860
ina-foss/inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Language:Python758 24 74129
TaoRuijie/ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Language:Python614 4 83115
Yuan-ManX/ai-audio-datasets
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.
547 13 139
plk/biblatex
biblatex is a sophisticated bibliography system for LaTeX users. It has considerably more features than traditional bibtex and supports UTF-8
Language:TeX521 35 1.1k118
zma-c-137/VarGFaceNet
Language:Python312 28 2084
SuperKogito/SER-datasets
A collection of datasets for the purpose of emotion recognition/detection in speech.
Language:HTML300 15 4042
jitinnair1/autoCV
clean CV LaTex template with GitHub Actions that compile and publish new changes
Language:TeX169 3 3104
zhenghuatan/rVAD
Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
Language:MATLAB129 7 730
AI4LAM/awesome-ai4lam
A list of awesome AI in libraries, archives, and museum collections from around the world 🕶️
Language:SCSS94 4 427
fdbtrs/mixfacenets
Official repository for MixFaceNets: Extremely Efficient Face Recognition Networks
Language:Python61 2 912
ardaillon/FCN-f0
Fully-Convolutional Network for Pitch Estimation of Speech Signals
Language:Python55 2 313
MontrealCorpusTools/PolyglotDB
Language data store and linguistic query API
Language:Python39 13 11314
Mr-TalhaIlyas/Tensorflow-Keras-Model-Profiler
Tensorflow-Keras Model Profiler: Tells you model's memory requirement, no. of parameters, flops etc.
Language:Python27 1 63
ina-foss/inaFaceAnalyzer
INA's library with pretrained models for gender and age prediction from faces.
Language:Python19 6 289
amrta-coder/LFW-emotion-dataset
Datasets released for facial expression recognition with face masks
17 3 26
biboamy/AVASpeech_Music_Labels
Language:Python17 2 10
pabarbosa/prosody-scripts
Language:Papyrus13 5 04
caisa-lab/SPINOS-dataset
SPINOS: A Dataset of Subtle Polarity and Intensity Opinion Shifts
Language:HTML8 1 01
borgr/tutEval
Tutorial on LLM Evaluation in LREC
Language:HTML5 3 00
neuro-symbolic-ai/LangVAE
Language:Python5 1 00
getalp/genderednews
Code of the GenderedNews project
Language:Python4 3 02
M-Lancien/Praat_Scripts
Praat Scripts for acoustical analysis
1 1 00