facebookresearch/CodeGen

Parallel dataset generated by TransCoder-ST

yiqingxyq opened this issue · 0 comments

Hi! I'm just wondering if you could release the parallel dataset generated by TransCoder-ST. It takes a long time to generate and filter out translations from the whole GitHub dataset.

Thanks!

image