This Repo is the building blocs for the first ever Fake news and sentiment analysis corpus in the algerian dialect, it contains all the codes from the data analysis to training and all.
This Python script performs analysis on Arabic language text data, including counting Modern Standard Arabic (MSA) words, determining vocabulary size, and counting Latin characters.
- Ensure you have Python installed on your system.
- Install the required libraries using pip:
pip install pandasgit clone https://github.com/your_username/arabic-language-analysis.git
cd arabic-language-analysispath = "path/to/your/msa_dataset.csv"
path2 = "path/to/your/latin_dataset.csv"python analyze_arabic.pyReplace "path/to/your/msa_dataset.csv" and "path/to/your/latin_dataset.csv" with the actual file paths to your MSA and Latin datasets. Also, make sure to name your Python script accordingly (e.g., analyze_arabic.py).
@article{abdedaiem2023fake,
title={Fake News Detection in Low Resource Languages using SetFit Framework},
author={Abdedaiem, Amin and Dahou, Abdelhalim Hafedh and Cheragui, Mohamed Amine},
journal={Inteligencia Artificial},
volume={26},
number={72},
pages={178--201},
year={2023}
}