Elasticsearch Analysis Neologd Plugin provides Tokenizer/CharFilter/TokenFilter for Kuromoji with Neologd.
Versions in Maven Repository Old(<= 6.5.1) Repository
Please file an issue. (Japanese forum is here.)
$ $ES_HOME/bin/elasticsearch-plugin install org.codelibs:elasticsearch-analysis-kuromoji-ipadic-neologd:7.1.0
The plugin includes these analyzer and tokenizer, tokenfilter.
name | type |
---|---|
kuromoji_ipadic_neologd_iteration_mark | charfilter |
kuromoji_ipadic_neologd | analyzer |
kuromoji_ipadic_neologd_tokenizer | tokenizer |
kuromoji_ipadic_neologd_baseform | tokenfilter |
kuromoji_ipadic_neologd_part_of_speech | tokenfilter |
kuromoji_ipadic_neologd_readingform | tokenfilter |
kuromoji_ipadic_neologd_stemmer | tokenfilter |
If you want to replace with the latest Lucene Neologd jar file, download it from https://maven.codelibs.org/org/codelibs/lucene-analyzers-kuromoji-ipadic-neologd/ and then replace old file in $ES_HOME/plugins/analysis-kuromoji-ipadic-neologd.
See mecab-ipadic-NEologd.
If you want to use Lucene Kuromoji for Neologd in your application other than elasticsearch, you can use lucene-analyzers-kuromoji-ipadic-neologd jar file, not this plugin. To use the jar file, put the following settings into your pom.xml.
...
<repositories>
<repository>
<id>codelibs.org</id>
<name>CodeLibs Repository</name>
<url>https://maven.codelibs.org/</url>
</repository>
</repositories>
...
<dependencies>
<dependency>
<groupId>org.codelibs</groupId>
<artifactId>lucene-analyzers-kuromoji-ipadic-neologd</artifactId>
<version>6.4.0-20180927</version>
<!-- https://maven.codelibs.org/org/codelibs/lucene-analyzers-kuromoji-ipadic-neologd/ --->
</dependency>
...