/text-summarizer

An extractive text summarizer

Primary LanguagePythonMIT LicenseMIT

An extractive text summarizer

This is a project for the University of Lethbridge graduate level CPSC 5310 course, Studies in Computation Intelligence: Advanced Data Processing. The strategies implemented are based on the work of Gong and Liu.

Yihong Gong and Xin Liu. Generic text summarization using relevance measure and latent semantic analysis. In Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pages 19–25. ACM, 2001.

Running

The simple way to run the program is to open a shell and run:

$ python3 run.py

which is equivalent to:

$ python3 runp.py --prep --summarize --eval

All of the generated data can be removed with:

$ python3 run.py --clean

Dependencies

  • Python 3
    • numpy
    • matplotlib

Programs

The following programs are included in the root directory:

  • eval.py: evaluate summarization results.
  • lsa.py: perform a summarization using latent semantic analysis.
  • prep.py: preprocess the corpus into usable data.
  • rel.py: perform a summarization using the relevance measure.
  • run.py: run all of the other scripts in the correct order.