In this paper, we proposed a sequential hybrid model based on a transformer to summarize Arabic articles. We used two approaches of summarization to make our model. The First is the extractive approach which depends on the most important sentences from the articles to be the summary, so we used Deep Learning techniques specifically transformers such as AraBert to make our summary, The second is abstractive, and this approach is similar to human summarization, which means that it can use some words which have the same meaning but different from the original text. We apply this kind of summary using MT5 Arabic pre-trained transformer model. We sequentially applied these two summarization approaches to building our A3SUT hybrid model. The output of the extractive module is fed into the abstractive module. We enhanced the summary’s quality to be closer to the human summary by applying this approach. We add some features to our summary to make it more understandable by applying the metadata generation task "data about data" and classification. By applying metadata generation, we add facilities to our summary, identification, and summary organization.
IEEE Paper Link : IEEE Paper
- EASC ( Extractive Summarization Dataset )
- NADA Corpus ( Text Classification Dataset )
- Classical SVM Model with a Grid Search Optimzation Approach
- Extractive Bert Model : Model Source
- Abstractive mt5 Model : Model Source
- Meta Data Extraction Model : Model Source
- Run the TextClassificationModel.ipynb notebook
We run this notebook ( only one time ) to train the Text Classification model and convert it to a pickle file to ensure not to train the model each time we use it.
- Run the MainProject.ipynb notebook
This notebook is the main notebook which loads all of the models (Text Classification model, Extractive Bert model, Abstractive mt5 model, A3SUT (our hybrid model) and Meta Data Extraction model). All outputs are in the Models output section (the last section in MainProject.ipynb).
Contributer Name | GitHub | ||
---|---|---|---|
Mohamed Ehab | Send Email | ||
Ameen Reda | Send Email | ||
Ismail Ahmed | Send Email | ||
John Adel | Send Email | ||
Mostafa Kafafy | Send Email | ||
Nada Salah | Send Email | ||
Assoc. Prof. Ghada Khoriba | Send Email | ||
Assoc. Prof. Enssaf Hussien | Send Email |