pial08/SemVulDet

If I want to test the type of vulnerability in a piece of code, how should I preprocess the test data?

zhang-zrq opened this issue · 2 comments

I followed the method provided in this link for preprocessing, but I was unable to obtain the test. jsonl file
link:【 https://github.com/microsoft/CodeXGLUE/tree/main/Code-Code/Defect-detection/dataset】

Sincerely hope that you can provide an test.sh for testing

pial08 commented

Hi ,
You can use this file and split into train test and evaluation file.
Or you can use the train_test_split() method from sklearn to split the file into 80:10:10 for train eval and test.