Code for "Improving Semantic Relevance for Sequence-to-Sequence Learning of Chinese Social Media Text Summarization" The codes are also used for "A Semantic Relevance Based Neural Network for Text Summarization and Text Simplification"
- Tensorflow r1.0.1
- Python 3.5
- CUDA 8.0 (For GPU)
- ROUGE
The dataset in the paper is Large Scale Chinese Short Text Summarization (LCSTS). To preprocess the data, please split the sentences into characters, and transform the characters into numbers (ids).
python3 MleTrain.py
If you use this code for your research, please cite the paper this code is based on: Improving Semantic Relevance for Sequence-to-Sequence Learning of Chinese Social Media Text Summarization:
@inproceedings{MaEA2017,
author = {Shuming Ma and Xu Sun and Jingjing Xu and Houfeng Wang and Wenjie Li and Qi Su},
title = {Improving Semantic Relevance for Sequence-to-Sequence Learning of Chinese Social Media Text Summarization},
booktitle = {Proceedings of the 55th Annual Meeting of the Association for Computational
Linguistics, {ACL} 2017, Vancouver, Canada, July 30 - August 4, Volume
2: Short Papers},
pages = {635--640},
year = {2017}
}