nlpyang/PreSumm

data preprocessing: empty 'tgt' text

connie-n opened this issue · 1 comments

Hi,
I'm doing data preprocessing with my own data.
I converted my data to json file which has 'src', and 'tgt' dictionary.
and then run data preparation code step 2, 3, 4, 5.

but there is no tgt text. It's empty. all text is input as 'src'.
could anyone help me?

my data shape is below.
스크린샷 2022-06-29 오후 3 19 40

below is the data shape which is done step3&step 4.
all tokenized word is put as 'src'....
스크린샷 2022-06-29 오후 3 23 18

I added '@highlight' before summary sentence and it was cleared. thanks.