WorksApplications/sudachi.rs

Documentation for the tagset used by the POS tagger of Sudachi

BLKSerene opened this issue · 5 comments

Hi, I'm wondering that if there are any documentation for the tagset used by the POS tagger of Sudachi?

Thanks, but neither the pdf link nor the Python code snippet works now.

If you are using 0.6.2, the following snippet should work

import sudachipy
sudachi_dic = sudachipy.Dictionary()
matcher = sudachi_dic.pos_matcher([()])
for pos_id, pos in enumerate(matcher):
  print(pos_id, ",".join(pos), sep="\t")

It seems that the BCCWJ manual has been moved here.
https://ccd.ninjal.ac.jp/bccwj/doc/manual/BCCWJ_Manual_05.pdf

Thanks!