A Ripple Down Rules-based (RDR) Part-Of-Speech Tagger implementation based on RDRPOSTagger.
This library assume that you are using Python 3.5 or later.
Assumption:
- Python 3.5
- Only support 1 sentence
- The sentence must be tokenized
Current supported languages:
- Thai
Installation is really straight forward, just:
pip install artagger
Just do a quick example if you want to use this library.
>> from artagger import Tagger
>> tagger = Tagger()
>> words = tagger.tag("ผม รัก คุณ")
The tag()
function will return you an array of Word
.
>> for word in words:
>> print("%s/%s" % (word.word, word.tag))
Run above syntax to get any word and tag inside sentence.
-
Dat Quoc Nguyen, Dai Quoc Nguyen, Dang Duc Pham and Son Bao Pham. RDRPOSTagger: A Ripple Down Rules-based Part-Of-Speech Tagger. In Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2014, pp. 17-20, 2014. [.PDF] [.bib]
-
Dat Quoc Nguyen, Dai Quoc Nguyen, Dang Duc Pham and Son Bao Pham. A Robust Transformation-Based Learning Approach Using Ripple Down Rules for Part-Of-Speech Tagging. AI Communications (AICom), vol. 29, no. 3, pp. 409-422, 2016. [.PDF] [.bib]