PacSum
This code is for paper Sentence Centrality Revisited for Unsupervised Summarization ACL 2019
Some codes are borrowed from pytorch_pretrained_bert and gensim
Dependencies
Python3.6, pytorch >= 1.0, numpy, gensim, pyrouge
Data used in the paper:
Download https://drive.google.com/open?id=1gNKWkZG4dVr5XrOeQBVicy1fdnpH2d5l
Bert models fine-tuned using the approach in the paper:
Download https://drive.google.com/file/d/1wbMlLmnbD_0j7Qs8YY8cSCh935WKKdsP/view?usp=sharing
Tuning the hyperparamters and test the performance using TfIdf or BERT representation
python run.py --rep tfidf --mode tune --tune_data_file path/to/validation/data --test_data_file path/to/test/data
python run.py --rep bert --mode tune --tune_data_file path/to/validation/data --test_data_file path/to/test/data --bert_model_file path/to/model --bert_config_file path/to/config --bert_vocab_file path/to/vocab