/journals-classificator

classification of two differen jornals (IEEE Transactions on Pattern Analysis and Machine Intelligence vs IEEE Transactions on Systems, Man, and Cybernetics: Systems) through machine learning

Primary LanguagePython

journals-classificator

classification of two differen jornals IEEE Transactions on Pattern Analysis and Machine Intelligence (Pattern file) and IEEE Transactions on Systems, Man, and Cybernetics: Systems(systems file) through machine learning using the bag of words method. Workflow:

  1. get files and identify Titles, paragraph and key words of every article separately.
  2. identify stop words
  3. Tokenize
  4. Lematize
  5. Classification Classification has been done using only title, paragraph or keywords and using all of then using a Sequential Forward Floating Selection in order to identify the words that best define each jornal.