/process-arabic-text

Pre-process arabic text (remove diacritics, punctuations and repeating characters)

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

Pre-process Arabic Text

Pre-process arabic text (remove diacritics, punctuations, and repeating characters)

Usage:

Usage: clean_arabic_text.py [-h] -i INFILE -o OUTFILE

Pre-process arabic text (remove diacritics, punctuations, and repeating
characters).

optional arguments:
  -h, --help            show this help message and exit
  -i INFILE, --infile INFILE
                        input file.
  -o OUTFILE, --outfile OUTFILE
                        out file.

Example

python clean_arabic_text.py -i infile.txt -o outfile.txt

How to contribute

Your contributions to improve the code are welcomed. Please follow the steps below.

  1. Fork the project.
  2. Modify the code, test it, make sure that it works fine.
  3. Make a pull request.

Please consult github help to get help.