-
git clone https://github.com/Ghonem22/Measure-Similarity-Between-Arabic-Sentences-using-TF-IDF.git
-
change directory to project folder
-
Create virtual environment conda create -n similarity python=3.9
-
activate virtaul env conda activate similarity
-
install requirements pip install -r requirements.txt
-
edit the keywods in the yaml file
-
run in terminal python get_similarity.py
-
the code wil generate csv file with new columns (new column for each keyword)
You can find the data used for testing here "1.text":
https://drive.google.com/drive/u/0/folders/1KWS7me9LYnpwfYywF5bn1dzt-omThrUP