/Dialogue_Summarization_Reproduce

Reproduce the result of different dialogue summarization models.

Primary LanguagePython

Dialogue Summarization Reproduce

Reproduce summarization methods and experiments via pytorch & Transformers, mainly focusing on dialogue summarization.

Available Resource

Model

Extractive

  • LEAD3
  • TextRank

Abstractive

  • Bart-Large
  • Peugsus

Dataset

  • CNNDM3.0
  • XSUM
  • SAMSUM

Experiment Results

Abstractive Methods

CNNDM3.0/Abs ROUGE1 ROUGE2 ROUGEL ROUGEW
Transformer
Bart-Large
Xsum/Abs ROUGE1 ROUGE2 ROUGEL ROUGEW
Bart-Large - - - -
Transformer
Samsum/Abs ROUGE1 ROUGE2 ROUGEL ROUGEW
Bart-Large 48.34 25.65 41.03 -
Peugsus 50.62 26.37 42.28 -
Transformer
Ricos/Abs ROUGE1 ROUGE2 ROUGEL ROUGEW
Bart-Large - - - -
Pointer+Coverage
Transformer
Model Parameters:
BART_Large:
    max_input_length = 512
    max_target_length = 128
    learning_rate=3e-5,
    per_device_train_batch_size=8,
    weight_decay=0.01,
    num_train_epochs=5,
    fp16=True,
    warmup_steps=200
Pegasus:
    max_input_length = 512
    max_target_length = 128
    learning_rate=3e-5,
    per_device_train_batch_size=8,
    weight_decay=0.01,
    num_train_epochs=5,
    fp16=True,
    warmup_steps=200

Extractive Methods

CNNDM3.0/Ext ROUGE1 ROUGE2 ROUGEL ROUGEW
LEAD3 40.52 17.60 31.53 13.63
TextRank
Xsum/Ext ROUGE1 ROUGE2 ROUGEL ROUGEW
LEAD3 19.78 2.71 17.85 8.32
TextRank
Samsum/Ext ROUGE1 ROUGE2 ROUGEL ROUGEW
LEAD3 31.47 8.83 30.41 15.43
MID3 28.15 6.78 27.25 13.64
LAST3 26.38 5.91 25.79 12.75
RANDOM3 28.29 7.36 27.25 13.75
TextRank 27.59 6.93 27.06 13.55
Ricos/Ext ROUGE1 ROUGE2 ROUGEL ROUGEW
LEAD3 27.15 6.83 26.78 13.71
MID3 23.30 4.65 22.84 11.48
LAST3 23.04 4.81 22.39 11.25
RANDOM3 24.39 5.65 23.16 11.77
TextRank 24.41 5.80 23.62 12.03