Quartz14/NLP_2022_TST

Jupyter Notebook

NLP_2022_TST

scoring_generated_sent has rough code for analysis of generated text
function to remove special tokens
- pattern = re.compile("")
- def clean_text(s):
  - new_s = pattern.split(s)[0]
  - new_s = re.sub(r'<.*?>', '', new_s)
  - return new_s
The cells in the notebook may be out of order, will correct it