coady/lupyne

Python Analyzer for lucene

mustafa0x opened this issue · 0 comments

I'm using classical solr/lucene. I'd like to write a python analyzer (part-of-speech NLP library). I couldn't find a good way to do this using solr/lucene. I came across this file which makes me think lupyne might be a solution:

https://github.com/coady/lupyne/blob/main/lupyne/engine/analyzers.py

Or should I instead process the text before inserting into lucene? Eg replace <word> with <pos>|<word>, then tell lucene to split at |?