mrjunjieli/kws_challenge

Python

Conv-TasNet

These files are modified according to
This Conv-TasNet model has been modified to target speaker separation .

Reference

Conv-Tesnet
"TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation".

Usage

Before running, you need to have mixture and target audio data.

Workflow

Workflow of egs/run.sh:

Stage 1: Generating json files including wav path and duration
Stage 2: Training
Stage 3: Evaluate separation performance
Stage 4: Separate speech using Conv-TasNet

How to resume training?

$ bash run.sh --continue_from <model-path>

vad.py

filter out the silence in the signal