google-research-datasets/wiki-atomic-edits
A dataset of atomic wikipedia edits containing insertions and deletions of a contiguous chunk of text in a sentence. This dataset contains ~43 million edits across 8 languages.
Issues
- 0
What about editions?
#7 opened by callzhang - 1
- 1
corpus contents
#3 opened by BonnieLWebber - 1
Regarding pre-processing tools used
#4 opened by ajaynagesh - 2
deletions.tsv have insertion examples
#2 opened by pcyin - 2