tund

Pinned Repositories

advhat
AdvHat: Real-world adversarial attack on ArcFace Face ID system
Language:Python00
apt-cyg
Apt-cyg, an apt-get like tool for Cygwin
Language:Shell0 2 00
asteroid
The PyTorch-based audio source separation toolkit for researchers
Language:Python0 1 00
D2GAN
Dual Discriminator Generative Adversarial Nets
Language:Python63 2 318
dsgd
Dual Stochastic Gradient Descent
Language:Python1 2 01
eNRBM
EMR-driven nonnegative restricted Boltzmann machines
Language:MATLAB2 2 01
kaggle-galaxy-zoo
Language:Python11 3 07
male
MAchine LEarning (MALE)
Language:Python4 2 211
RRF
Reparameterized Random Features
Language:Python6 2 11
skvm
Sparkling Vector Machines
Language:MATLAB2 3 00

tund's Repositories

tund/male
MAchine LEarning (MALE)
Language:Python4 2 211
tund/asteroid
The PyTorch-based audio source separation toolkit for researchers
Language:Python0 1 00
tund/AudioSignalProcessingForML
Code and slides of my YouTube series called "Audio Signal Proessing for Machine Learning"
Language:Jupyter Notebook1 0
tund/clearml-server
ClearML - Auto-Magical Suite of tools to streamline your ML workflow. Experiment Manager, ML-Ops and Data-Management
tund/DeepLearning-500-questions
深度学习500问，以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述，以帮助自己及有需要的读者。全书分为18个章节，50余万字。由于水平有限，书中不妥之处恳请广大读者批评指正。未完待续............ 如有意合作，联系scutjy2015@163.com 版权所有，违权必究 Tan 2018.06
tund/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Language:C++1 0
tund/docs
TensorFlow documentation
Language:Jupyter Notebook1 0
tund/EAST
A tensorflow implementation of EAST text detector
Language:C++1 0
tund/espnet
End-to-End Speech Processing Toolkit
Language:Python
tund/face_recognition
tund/FaceSwap-1
3D face swapping implemented in Python
tund/faceswap-GAN
A denoising autoencoder + adversarial losses and attention mechanisms for face swapping.
Language:Jupyter Notebook1 0
tund/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python1 0
tund/fiftyone
The open-source tool for building high-quality datasets and computer vision models
tund/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Language:Shell1 0
tund/openpilot
openpilot is an open source driver assistance system. openpilot performs the functions of Automated Lane Centering and Adaptive Cruise Control for over 85 supported car makes and models.
Language:C++1 0
tund/OpenSeq2Seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
tund/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
Language:Jupyter Notebook1 0
tund/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:Python1 0
tund/rnnoise
Recurrent neural network for audio noise reduction
Language:C1 0
tund/rnnt-speech-recognition
End-to-end speech recognition using RNN Transducers in Tensorflow 2.0
Language:Python1 0
tund/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Language:Jupyter Notebook1 0
tund/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Language:Python1 0
tund/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese, German and Easy to adapt for other languages)
tund/tensorpack
A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility
Language:Python1 0
tund/text-detection-ctpn
text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network
Language:Python1 0
tund/transformer-tensorflow
Implementation of Transformer Model in Tensorflow
Language:Python1 0
tund/voice_activity_detection
Voice Activity Detection based on Deep Learning & TensorFlow
Language:Python1 0
tund/warp-ctc
Fast parallel CTC.
Language:Cuda1 0
tund/wer_are_we
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.