Python Analyzer for lucene
mustafa0x opened this issue · 0 comments
mustafa0x commented
I'm using classical solr/lucene. I'd like to write a python analyzer (part-of-speech NLP library). I couldn't find a good way to do this using solr/lucene. I came across this file which makes me think lupyne might be a solution:
https://github.com/coady/lupyne/blob/main/lupyne/engine/analyzers.py
Or should I instead process the text before inserting into lucene? Eg replace <word>
with <pos>|<word>
, then tell lucene to split at |
?