jonsafari/perstem

Receiving Invalid Results

thedamnedrhino opened this issue · 1 comments

Hi,
I'm trying to use your stemmer on a linux machine. I run your stemmer using perl perstem.pl input > output where input is the name of the input file containing this single line:

کتابهاصفحه ها صفحهی صفحه ی

However after running the script, I get the following results in the output file:

+�+C+z+A+�+z+N+++F+� +�+z +N+++F+��� +N+++F+� ��

So far as I could tell, this output does have the utf8 encoding however they are somehow distorted.

I would really appreciate it if you could get back to me on this as soon as you can. I could also fix it myself in case there really is a problem and you point me in the right direction.

My mistake it should have been called using perl perstem.pl < input > output. (I thought < input > was a placeholder for the input).