The dataset poblem
hui09241 opened this issue · 1 comments
hui09241 commented
Hi, since I've also been trying to write a program to classify malicious executable files recently.
However, it's really hard to find about datasets, so I would like to ask if you have a way to share or find datasets.
Please take the time to review this letter.
Thank you.
Best regards,
Annie
0x13enny commented
@hui09241
The malicious samples came from this work which provided 8940 malicious samples and virustotal. However, virustotal samples are not public available, you can issue an academic request here to their mailbox instead. By providing your research purpose, they will share a huge tons of malicious sample about the number of 110k.
On the other hand, the benign samples are collected from cnet.