How to divide our own dataset into test, dev and train data and assign them labels for fine tuning process
smruti241 opened this issue · 2 comments
smruti241 commented
Hi @jerryji1993 , @Zhihan1996 , @project-delphi , @hjgwak , @timlautk ,
I read your paper and its very interesting. I have a dataset which consists of 6-mers only. I want to divide my dataset into test, dev and train data and assign them labels for fine tuning process directly (no pre-training required, I will use pre-trained models). Can you please tell me the procedure or any script is available in the folders of this tool? Please let me know. Thanks!
Moeinh77 commented
Hi yes there is a way to load the models with HuggingFace I have done it in this repository: https://github.com/Moeinh77/Virus-DNA-Classification