Receiving Invalid Results
thedamnedrhino opened this issue · 1 comments
Hi,
I'm trying to use your stemmer on a linux machine. I run your stemmer using perl perstem.pl input > output
where input
is the name of the input file containing this single line:
کتابهاصفحه ها صفحهی صفحه ی
However after running the script, I get the following results in the output file:
+�+C+z+A+�+z+N+++F+� +�+z +N+++F+��� +N+++F+� ��
So far as I could tell, this output does have the utf8
encoding however they are somehow distorted.
I would really appreciate it if you could get back to me on this as soon as you can. I could also fix it myself in case there really is a problem and you point me in the right direction.
My mistake it should have been called using perl perstem.pl < input > output
. (I thought < input >
was a placeholder for the input).