Classification & SentimentNon-Commercial
PAWS: Paraphrase Adversaries from Word Scrambling
by google-research-datasets
43.0Kdownloads
38likes
100K<n<1MDescription
Dataset Card for PAWS: Paraphrase Adversaries from Word Scrambling
Dataset Summary
PAWS: Paraphrase Adversaries from Word Scrambling
This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, and word order information for the problem of paraphrase identification. The dataset has two subsets, one based on Wikipedia and the other one based on the Quora Question Pairs (QQP) dataset.
For further… See the full description on the dataset page: https://huggingface.co/datasets/google-research-datasets/paws.
What can I do with this?
Tags
task_categories:text-classificationtask_ids:semantic-similarity-classificationtask_ids:semantic-similarity-scoringtask_ids:text-scoringtask_ids:multi-input-text-classificationannotations_creators:expert-generatedannotations_creators:machine-generatedlanguage_creators:machine-generatedmultilinguality:monolingualsource_datasets:originallanguage:enlicense:othersize_categories:100K<n<1Mformat:parquetmodality:textlibrary:datasetslibrary:pandaslibrary:mlcroissantlibrary:polarsarxiv:1904.01130