Pinned Repositories
advhat
AdvHat: Real-world adversarial attack on ArcFace Face ID system
apt-cyg
Apt-cyg, an apt-get like tool for Cygwin
asteroid
The PyTorch-based audio source separation toolkit for researchers
D2GAN
Dual Discriminator Generative Adversarial Nets
dsgd
Dual Stochastic Gradient Descent
eNRBM
EMR-driven nonnegative restricted Boltzmann machines
kaggle-galaxy-zoo
male
MAchine LEarning (MALE)
RRF
Reparameterized Random Features
skvm
Sparkling Vector Machines
tund's Repositories
tund/male
MAchine LEarning (MALE)
tund/asteroid
The PyTorch-based audio source separation toolkit for researchers
tund/AudioSignalProcessingForML
Code and slides of my YouTube series called "Audio Signal Proessing for Machine Learning"
tund/clearml-server
ClearML - Auto-Magical Suite of tools to streamline your ML workflow. Experiment Manager, ML-Ops and Data-Management
tund/DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
tund/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
tund/docs
TensorFlow documentation
tund/EAST
A tensorflow implementation of EAST text detector
tund/espnet
End-to-End Speech Processing Toolkit
tund/face_recognition
tund/FaceSwap-1
3D face swapping implemented in Python
tund/faceswap-GAN
A denoising autoencoder + adversarial losses and attention mechanisms for face swapping.
tund/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
tund/fiftyone
The open-source tool for building high-quality datasets and computer vision models
tund/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
tund/openpilot
openpilot is an open source driver assistance system. openpilot performs the functions of Automated Lane Centering and Adaptive Cruise Control for over 85 supported car makes and models.
tund/OpenSeq2Seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
tund/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
tund/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
tund/rnnoise
Recurrent neural network for audio noise reduction
tund/rnnt-speech-recognition
End-to-end speech recognition using RNN Transducers in Tensorflow 2.0
tund/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
tund/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
tund/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese, German and Easy to adapt for other languages)
tund/tensorpack
A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility
tund/text-detection-ctpn
text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network
tund/transformer-tensorflow
Implementation of Transformer Model in Tensorflow
tund/voice_activity_detection
Voice Activity Detection based on Deep Learning & TensorFlow
tund/warp-ctc
Fast parallel CTC.
tund/wer_are_we
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.