google-research-datasets/paws
This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, and word order information for the problem of paraphrase identification.
PythonNOASSERTION
Stargazers
- AdrienBenamira
- APodolskiyMoscow
- BinHeRunningTencent
- BusbyActualLos Angeles
- clungztaTraversal Labs
- DevSinghSachanMontreal
- f-dx
- fabiofumarolaDataToKnowledge
- fly51flyPRIS
- gabrielStanovskyAI2 and UW-NLP
- hhexiy
- hmishra2250Byjus
- huazai5201995fusidic
- ink-pad
- jennhuHarvard University
- jiangfeng1124AWS AI Lab; MIT-CSAIL
- josephkirkVietNam
- kk-machine-learning
- manojsukhavasiBangalore
- mhany90
- nazmiasri95@MoneyLion
- nth-attemptNew York
- OneplusAlibaba DAMO Academy
- SmeritySan Francisco, California
- sopankhosla
- spandanagellaUniversity of Edinburgh
- stefan-itBavarian Oberland, Germany
- sumehtaSan Francisco, CA
- szhaAmazon AGI
- tahakucukkatirciIstanbul
- xgk
- xuanhan863Los Angeles, USA
- yuanzhGoogle AI, Language
- yyht
- zhangmeishan16:27:ac:a5:76:28:2d:36:63:1b:56:4d:eb:df:a6:48
- zxybazhOctoML