Pinned Repositories
act-tensorflow
Adaptive Computation Time algorithm in Tensorflow
AdaSpeech
AdaSpeech: Adaptive Text to Speech for Custom Voice
AI-Paper-Collector
Fully-automated scripts for collecting AI-related papers
aishell4-preprocess
This project is used to generate multi-speaker speech without speaker overlap for AISHELL-4 dataset.
bert
TensorFlow code and pre-trained models for BERT
bert_language_understanding
Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
flywithcloud.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
SEQ-SCD
wav2vec-sid
apply wav2vec 2.0 to speaker identification
zhiyunfan's Repositories
zhiyunfan/SEQ-SCD
zhiyunfan/aishell4-preprocess
This project is used to generate multi-speaker speech without speaker overlap for AISHELL-4 dataset.
zhiyunfan/wav2vec-sid
apply wav2vec 2.0 to speaker identification
zhiyunfan/W2V-SV
This is a projection to apply wav2vec 2.0 to speaker verification.
zhiyunfan/act-tensorflow
Adaptive Computation Time algorithm in Tensorflow
zhiyunfan/AdaSpeech
AdaSpeech: Adaptive Text to Speech for Custom Voice
zhiyunfan/AI-Paper-Collector
Fully-automated scripts for collecting AI-related papers
zhiyunfan/bert
TensorFlow code and pre-trained models for BERT
zhiyunfan/bert_language_understanding
Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
zhiyunfan/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
zhiyunfan/flywithcloud.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
zhiyunfan/GAN_mapping_relationship
Code:Completely Unsupervised Phoneme Recognition by Adversarially Learning Mapping Relationships from Audio Embeddings
zhiyunfan/git-study
a doce of git study
zhiyunfan/globalphone_awe
Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.
zhiyunfan/hogg2019-icassp-paper
Speaker change detection using fundamental frequency with application to multi-talker segmentation
zhiyunfan/kaldi
This is now the official location of the Kaldi project.
zhiyunfan/learngit
zhiyunfan/LeetcodeTop
汇总各大互联网公司容易考察的高频leetcode题🔥
zhiyunfan/MASS
MAsked Sequence to Sequence (MASS) pre-training for language generation
zhiyunfan/mygit
zhiyunfan/paper_reading
zhiyunfan/RL-SCD
zhiyunfan/SEQ-SCD-DEMO
Demo the SEQ-SCD
zhiyunfan/Speech-Transformer-tf2.0
transformer for ASR-systerm (via tensorflow2.0)
zhiyunfan/speech2vec
Autoencoders for speech
zhiyunfan/Transformer-TTS
A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"
zhiyunfan/UnsupSeg
Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)
zhiyunfan/WannaFB
记录2022届各大厂计算机的福报信息(提前批、正式批的网申内推信息)
zhiyunfan/web-speech-recorder
Record audio and save it with flask app