Measure-Similarity-Between-Arabic-Sentences-using-TF-IDF

Get the similarity between tweets against some keywords we write in yaml file

git clone https://github.com/Ghonem22/Measure-Similarity-Between-Arabic-Sentences-using-TF-IDF.git
change directory to project folder
Create virtual environment conda create -n similarity python=3.9
activate virtaul env conda activate similarity
install requirements pip install -r requirements.txt
edit the keywods in the yaml file
run in terminal python get_similarity.py
the code wil generate csv file with new columns (new column for each keyword)

You can find the data used for testing here "1.text":