google-research-datasets/wiki-split
One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.
Stargazers
- aagoharyMicrosoft
- albertoseabra
- alexrigler@aldaleri
- aminorexGedanken Laboratories
- bkamber@KareOfis @SuitSoftware
- buaahshMicrosoft Research
- BusbyActualLos Angeles
- crazyofappleShenzhen
- donglixpMicrosoft Research
- dongyuguoai1
- easonnie
- EricAugust
- fly51flyPRIS
- getaoMicrosoft
- gyn0806
- Imagist-ShuoBeihang University
- j-minUNC Chapel Hill
- j6mesKAIST
- JoGreenRome
- JohannesTK@galadriel-ai
- jusjosgraHealx
- magic282
- MarkWuNLPMicrosoft Research
- massanishiToronto
- miweru
- nzw0301Preferred Networks, Inc. / Preferred Elements, Inc.
- pcyinCalifornia
- psyguyUtrecht University
- SeanLee97@4AI @mixedbread-ai @whereisai
- shkarupa-alex
- todd-cookBurlingame
- tonydamageBratislava, Slovakia
- ubuntu733
- XingLuxiBeijing, China
- xuanhan863Los Angeles, USA
- ZeweiChuGoogle