DanielLin94144

Ph.D. @ NTU Speech Processing and Machine Learning Laboratory. Deep Learning for Speech Processing.

National Taiwan UniversityTaiwan

Pinned Repositories

Algorithm
NTHU EE3980
Language:C1 2 00
DanielLin94144.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript1 1 00
DUAL-textless-SQA
Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning" paper.
Language:Python34 7 211
E2E-ASR-Pytorch
Language:Python7 3 02
End-to-End-jointCTC-Attention-ASR
Language:Jupyter Notebook4 2 10
StyleTalk
Official release of StyleTalk dataset.
51 7 12
Test-time-adaptation-ASR-SUTA
Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition" paper.
Language:Python15 3 36
unsupervised_ASR_challenge
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python4 1 00
superb-prosody
Language:Python30 5 03
StyleSpeech
Official implementation of Meta-StyleSpeech and StyleSpeech
Language:Python234 7 2039

DanielLin94144's Repositories

DanielLin94144/StyleTalk
Official release of StyleTalk dataset.
51 7 12
DanielLin94144/DUAL-textless-SQA
Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning" paper.
Language:Python34 7 211
DanielLin94144/Test-time-adaptation-ASR-SUTA
Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition" paper.
Language:Python15 3 36
DanielLin94144/E2E-ASR-Pytorch
Language:Python7 3 02
DanielLin94144/End-to-End-jointCTC-Attention-ASR
Language:Jupyter Notebook4 2 10
DanielLin94144/unsupervised_ASR_challenge
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python4 1 00
DanielLin94144/Algorithm
NTHU EE3980
Language:C1 2 00
DanielLin94144/DanielLin94144.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript1 1 00
DanielLin94144/FixMatch-pytorch
Unofficial PyTorch implementation of "FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence"
Language:Python1 1 00
DanielLin94144/Protect-Your-Voice
Official implementation of Meta-StyleSpeech and StyleSpeech
Language:Python1 2 01
DanielLin94144/AudioMAE
This repo hosts the code and models of "Masked Autoencoders that Listen".
Language:Python1 0
DanielLin94144/CLUENER2020
CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition
Language:Python1 0
DanielLin94144/CPC_audio
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
Language:Python1 0
DanielLin94144/DM2020-Lab1-Homework1
Language:Jupyter Notebook1 0
DanielLin94144/DM2020-Lab2-Homework
Language:Jupyter Notebook1 0
DanielLin94144/DM2020-Lab2-Master
Language:Jupyter Notebook1 0
DanielLin94144/Emphasized-Talk
Official release of Emphasized-Talk
DanielLin94144/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Language:Python1 0
DanielLin94144/fixmatch
A simple method to perform semi-supervised learning with limited data.
Language:Python1 0
DanielLin94144/GlossBERT
GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge (EMNLP 2019)
Language:Jupyter Notebook1 0
DanielLin94144/hubert-cluster-code
Extract clustering feature from hubert
Language:Python1 01
DanielLin94144/Linear_Algebra_2023fall_Hw3
Language:Python
DanielLin94144/Meta-TTS
Official repository of https://arxiv.org/abs/2111.04040v1
Language:Python1 0
DanielLin94144/ML-attack-dataset
2 0
DanielLin94144/MLSS-2021-Taipei-Collaborative-Notes
2 0
DanielLin94144/mulit
Language:Python2 0
DanielLin94144/SpeechMix
Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together
Language:Python1 0
DanielLin94144/tango
Codes and Model of the paper "Text-to-Audio Generation using Instruction Tuned LLM and Latent Diffusion Model"
Language:Python0 0
DanielLin94144/tent
ICLR21 Tent: Fully Test-Time Adaptation by Entropy Minimization
Language:Python1 0
DanielLin94144/VISinger2
VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer