DavidDoukhan's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Breakthrough/PySceneDetect
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
coqui-ai/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
SHI-Labs/Neighborhood-Attention-Transformer
Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022
asteroid-team/torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
zjunlp/KnowledgeEditingPapers
Must-read Papers on Knowledge Editing for Large Language Models.
ina-foss/inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
TaoRuijie/ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Yuan-ManX/ai-audio-datasets
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.
plk/biblatex
biblatex is a sophisticated bibliography system for LaTeX users. It has considerably more features than traditional bibtex and supports UTF-8
zma-c-137/VarGFaceNet
SuperKogito/SER-datasets
A collection of datasets for the purpose of emotion recognition/detection in speech.
jitinnair1/autoCV
clean CV LaTex template with GitHub Actions that compile and publish new changes
zhenghuatan/rVAD
Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
AI4LAM/awesome-ai4lam
A list of awesome AI in libraries, archives, and museum collections from around the world 🕶️
fdbtrs/mixfacenets
Official repository for MixFaceNets: Extremely Efficient Face Recognition Networks
ardaillon/FCN-f0
Fully-Convolutional Network for Pitch Estimation of Speech Signals
MontrealCorpusTools/PolyglotDB
Language data store and linguistic query API
Mr-TalhaIlyas/Tensorflow-Keras-Model-Profiler
Tensorflow-Keras Model Profiler: Tells you model's memory requirement, no. of parameters, flops etc.
ina-foss/inaFaceAnalyzer
INA's library with pretrained models for gender and age prediction from faces.
amrta-coder/LFW-emotion-dataset
Datasets released for facial expression recognition with face masks
biboamy/AVASpeech_Music_Labels
pabarbosa/prosody-scripts
caisa-lab/SPINOS-dataset
SPINOS: A Dataset of Subtle Polarity and Intensity Opinion Shifts
borgr/tutEval
Tutorial on LLM Evaluation in LREC
neuro-symbolic-ai/LangVAE
getalp/genderednews
Code of the GenderedNews project
M-Lancien/Praat_Scripts
Praat Scripts for acoustical analysis