Data Processing Scripts
Closed this issue · 2 comments
t1101675 commented
Nice work!
I'm wondering whether you could also release the scripts for data processing or the processed .src
and .tgt
files. It would be very helpful for reproducing the results and further extension.
yongchanghao commented
Thank you for your interests. The .src and .tgt files are just the normal parallel text files. For example, if you want to replicate the results on DD, you can extract them from the csv files in this repo. There is no additional data processing step.
I will provide the scripts for extracting the csv file along with other improvements for replicating.
t1101675 commented
Thanks!