WorksApplications/Sudachi

User dictionary source File Creation from Token and POS mapped file

Opened this issue · 2 comments

if there any Utility to generate the “User dictionary source File” from a raw file ,which has Sentence and its Tokens and POS Mapping for Each Token .
I mean if we have Token and POS mapping , if there any easy way to generate the “User dictionary source File”

For Example , if we have a file as below , or any similar format, can we generate the “User dictionary source File”

image

Do you want to implement an analyzer for Thai(?) only, or use it to analyzer mixed Thai(?)-Japanese data?

I want only for Japanese , the Earlier Example what I given by mistake Thai language , below is the Japanese Example .
i can have in any format , but i can have below parameter

  1. Raw Sentence and its Tokens and POS mapping

image