DanielLin94144
Ph.D. @ NTU Speech Processing and Machine Learning Laboratory. Deep Learning for Speech Processing.
National Taiwan UniversityTaiwan
Pinned Repositories
Algorithm
NTHU EE3980
DanielLin94144.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
DUAL-textless-SQA
Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning" paper.
E2E-ASR-Pytorch
End-to-End-jointCTC-Attention-ASR
StyleTalk
Official release of StyleTalk dataset.
Test-time-adaptation-ASR-SUTA
Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition" paper.
unsupervised_ASR_challenge
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
superb-prosody
StyleSpeech
Official implementation of Meta-StyleSpeech and StyleSpeech
DanielLin94144's Repositories
DanielLin94144/StyleTalk
Official release of StyleTalk dataset.
DanielLin94144/DUAL-textless-SQA
Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning" paper.
DanielLin94144/Test-time-adaptation-ASR-SUTA
Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition" paper.
DanielLin94144/E2E-ASR-Pytorch
DanielLin94144/End-to-End-jointCTC-Attention-ASR
DanielLin94144/unsupervised_ASR_challenge
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
DanielLin94144/Algorithm
NTHU EE3980
DanielLin94144/DanielLin94144.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
DanielLin94144/FixMatch-pytorch
Unofficial PyTorch implementation of "FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence"
DanielLin94144/Protect-Your-Voice
Official implementation of Meta-StyleSpeech and StyleSpeech
DanielLin94144/AudioMAE
This repo hosts the code and models of "Masked Autoencoders that Listen".
DanielLin94144/CLUENER2020
CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition
DanielLin94144/CPC_audio
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
DanielLin94144/DM2020-Lab1-Homework1
DanielLin94144/DM2020-Lab2-Homework
DanielLin94144/DM2020-Lab2-Master
DanielLin94144/Emphasized-Talk
Official release of Emphasized-Talk
DanielLin94144/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
DanielLin94144/fixmatch
A simple method to perform semi-supervised learning with limited data.
DanielLin94144/GlossBERT
GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge (EMNLP 2019)
DanielLin94144/hubert-cluster-code
Extract clustering feature from hubert
DanielLin94144/Linear_Algebra_2023fall_Hw3
DanielLin94144/Meta-TTS
Official repository of https://arxiv.org/abs/2111.04040v1
DanielLin94144/ML-attack-dataset
DanielLin94144/MLSS-2021-Taipei-Collaborative-Notes
DanielLin94144/mulit
DanielLin94144/SpeechMix
Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together
DanielLin94144/tango
Codes and Model of the paper "Text-to-Audio Generation using Instruction Tuned LLM and Latent Diffusion Model"
DanielLin94144/tent
ICLR21 Tent: Fully Test-Time Adaptation by Entropy Minimization
DanielLin94144/VISinger2
VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer