HLTCHKUST/VG-GPLMs

Question about the code

Closed this issue · 1 comments

Hi, thanks for your awesome work! I have a question about the code:

self.src = [" ".join(item.split()[1:]) for item in self.src]

Why did you ignore the first word in the src and tgt?
Looking forward to your reply.

Hi,
Actually, the first word is not a word but the sample ID, which is a number, if you download the date from the way I suggested in the readme.