Spell corrector in other languages
al-jwarizmi opened this issue · 0 comments
al-jwarizmi commented
I understand that following Peter Norvig's approach for spelling correction it could be relatively easy to have the corrector work for any given language if a big (and reliable) corpus is available.
I know that you can set the corpus to either "english" or "twitter" and the function ekphrasis.utils.read_stats()
will load the corresponding corpus file, I just don't know where do I have to store a new corpus file for it to be used for spelling corrections and what to set as corrector
argument when instantiating a TextPreProcessor
, for instance.