test_vctk.meta

Question

test_vctk.meta

jardnzm opened this issue 3 years ago · 5 comments

jardnzm commented 3 years ago

Hi,

I am wondering how the "test_vctk.meta" is created in the demo file?

Thanks!

auspicious3000 commented 3 years ago

Yes.

No.

Answer 1 · 2021-08-07T16:19:02.000Z

Just put the cepstrum and speaker embedding into a list.

Answer 2 · 2021-08-09T15:08:27.000Z

Thanks! By speaker embedding, I guess you mean the one hot representation? And does it indicate that the speaker that we "transfers to" need to be seen in the training data?
One more question, is the output quality restricted by the input length ? The demo audios are all like 1s-2s. How does it performs on 5s-10s audios?

Answer 3 · 2021-08-09T18:58:05.000Z

Just put the cepstrum and speaker embedding into a list.

Hi, is cepstrum computed by one of these three in prepare_train_data.py? If so can you point
me which one is it? If not, are there any codes in the github reflect this computation?
(https://github.com/auspicious3000/AutoPST/blob/main/prepare_train_data.py)
np.save(os.path.join(targetDir_cd, subdir, fileName[:-4]), codes.cpu().numpy(), allow_pickle=False) np.save(os.path.join(targetDir_sp, subdir, fileName[:-4]), S.astype(np.float32), allow_pickle=False) np.save(os.path.join(targetDir_cep, subdir, fileName[:-4]), cc_norm.astype(np.float32), allow_pickle=False)

Answer 4 · 2021-08-10T01:19:35.000Z

cep stands for cepstrum