Pinned Repositories
3D-convolutional-speaker-recognition
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
acoss-1
acoss: Audio Cover Song Suite is a framework for feature extraction and benchmarking for the cover song identification (CSI) task
Additive-Margin-Softmax
This is the implementation of paper <Additive Margin Softmax for Face Verification>
Additive-Margin-Softmax-Loss-Pytorch
Additive margin softmax loss in pytorch
AIFaceMakeup
AMSoftmax
A simple yet effective loss function for face verification.
Applying-Face-Makeup
Applying digital makeup on face
AR-Lipstick
ARKit/FirebaseMLVision based virtual lipstick. TRY IT:
Artificial-Eyeliner
Script to apply artificial eyeliner
audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
silvadirceu's Repositories
silvadirceu/acoss-1
acoss: Audio Cover Song Suite is a framework for feature extraction and benchmarking for the cover song identification (CSI) task
silvadirceu/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
silvadirceu/augmented_reality_101
Introduction to Augmented Reality (AR) with Python 3 and OpenCV 4.2.
silvadirceu/Auto-Tuning-Spectral-Clustering
This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"
silvadirceu/CQTNet
LEARNING A REPRESENTATION FOR COVER SONG IDENTIFICATION USING CONVOLUTIONAL NEURAL NETWORK. ICASSP2020
silvadirceu/ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
silvadirceu/ghostvlad-speaker
An tensorflow implementation of ghostvlad for speaker recognition
silvadirceu/IBN-Net
Instance-Batch Normalization Networks (ECCV2018)
silvadirceu/kapre
kapre: Keras Audio Preprocessors
silvadirceu/keras-sincnet
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
silvadirceu/Mastering-OpenCV-4-with-Python
Mastering OpenCV 4 with Python, published by Packt
silvadirceu/mir_eval
Evaluation functions for music/audio information retrieval/signal processing algorithms.
silvadirceu/musicnn
Pronounced as "musician", musicnn is a set of pre-trained deep convolutional neural networks for music audio tagging.
silvadirceu/onnx-convert-example
Simple example how to convert an PyTorch model into Tensorflow using ONNX.
silvadirceu/PCV
Open source Python module for computer vision
silvadirceu/pytorch_xvectors
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
silvadirceu/re-move
Training and evaluation code for Re-MOVE models with embedding distillation
silvadirceu/rVAD
Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
silvadirceu/sber-hp
silvadirceu/SCAMP
CPU/GPU Implementation of the SCAMP algorithm for computing the matrix profile
silvadirceu/SHS100K
metadata for SHS100K
silvadirceu/SincNet
SincNet is a neural architecture for efficiently processing raw audio samples.
silvadirceu/sota-music-tagging-models
silvadirceu/Speaker-Diarization
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
silvadirceu/TCN
Sequence modeling benchmarks and temporal convolutional networks
silvadirceu/tf-kaldi-speaker
Neural speaker recognition/verification system based on Kaldi and Tensorflow
silvadirceu/VGG-Speaker-Recognition
Utterance-level Aggregation For Speaker Recognition In The Wild
silvadirceu/voxceleb_trainer
In defence of metric learning for speaker recognition
silvadirceu/Wav2Vec-Wrapper
An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.
silvadirceu/whisper
Robust Speech Recognition via Large-Scale Weak Supervision