Link of the paper: https://ieeexplore.ieee.org/abstract/document/8636775
This goal of this project is to find the origin of replication in Genome sequence. We used DNA sequence based features in here. The dataset contains 811 features among which there are 405 are OriC strings and 406 strings are Non_OriC.
The CSV file contains 1364 k-mer features, where k= 1,2,3,4,5.