tshatrov/ichiran

[feature request] ichiran-cli segment sentence into dictionary forms

Sakusuke opened this issue · 2 comments

Requesting a flag which turns the input of a sentence like: "昨日すき焼きを食べました" into "昨日;すき焼き;を;食べる".
Needing this for automation with a command line dictionary. -i and -f already kinda do this but getting the dictionary forms from their output is a bit of a hassle and they take quite a bit longer to execute than a normal 'simple-segment' because of the included definitions (atleast that's what I'm guessing). Also this seems like a common enough usecase to warrent a flag.

This operation is not well defined, i.e. きました -> くる(来る) or きる(着る). Also seems pretty difficult to implement in general, but I haven't thought about it very hard.

Very good point. The perfect solution would be analyzing context, but since that's way out of scope resorting to frequency would be good enough.

this sentence wouldn't work correctly without the space, but it merely serves as an example of how the flag's output could look like

$ ichiran-cli -foobar 彼は きました
彼
は
来る;着る