Spectra456's Stars
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
afshinea/stanford-cs-229-machine-learning
VIP cheatsheets for Stanford's CS 229 Machine Learning
openai/triton
Development repository for the Triton language and compiler
PeterL1n/RobustVideoMatting
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
worldveil/dejavu
Audio fingerprinting and recognition in Python
hwdsl2/docker-ipsec-vpn-server
Docker image to run an IPsec VPN server, with IPsec/L2TP, Cisco IPsec and IKEv2
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
snakers4/silero-models
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
zzw922cn/awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
AndrewStetsenko/tech-jobs-with-relocation
All-in-one guide to getting a tech job abroad 🌎
facebookresearch/swav
PyTorch implementation of SwAV https//arxiv.org/abs/2006.09882
iver56/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
wq2012/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
asteroid-team/torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
aliutkus/speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
alibaba-damo-academy/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
black-shadows/Cracking-the-Coding-Interview
Learn how to uncover the hints and hidden details in a question, discover how to break down a problem into manageable chunks, develop techniques to unstick yourself when stuck, learn (or re-learn) core computer science concepts, and practice on 189 interview questions and solutions.
markovka17/dla
Deep learning for audio processing
wq2012/SpectralCluster
Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.
akashmjn/tinydiarize
Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens
MobileTeleSystems/RecTools
RecTools - library to build Recommendation Systems easier and faster than ever before
mindslab-ai/nuwave
NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling @ INTERSPEECH 2021
Picovoice/cobra
On-device voice activity detection (VAD) powered by deep learning
sovaai/sova-dataset
huangzehao/torch-srgan
torch implementation of srgan
NXTProduct/TUNet
X-LANCE/MSDWILD
[INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.
lucasjinreal/yolovn
Just another yolo variant.
FlorinAndrei/soundspec
Signals processing: spectrum visualizer for audio files; uses the Fourier transform with scipy.
dayyass/pydfs
Distributed File System written in Python