cbaziotis/ekphrasis

Spell corrector in other languages

al-jwarizmi opened this issue · 0 comments

I understand that following Peter Norvig's approach for spelling correction it could be relatively easy to have the corrector work for any given language if a big (and reliable) corpus is available.

I know that you can set the corpus to either "english" or "twitter" and the function ekphrasis.utils.read_stats() will load the corresponding corpus file, I just don't know where do I have to store a new corpus file for it to be used for spelling corrections and what to set as corrector argument when instantiating a TextPreProcessor, for instance.