facebookresearch/CodeGen

The lack of paralled dataset in Transcoder-IR

AssassinsAlex opened this issue · 0 comments

Hello, I'm trying to run the command for full model training in transcoder-ir.md, and the program is indicating that the cpp-rust parallel dataset is missing. I would like to inquire if it's possible to provide the parallel dataset used for validation during transcoder-ir training (rust<->cpp, java<->), or how I can convert my own parallel dataset for testing purposes.