Classification & SentimentNon-Commercial

PAWS: Paraphrase Adversaries from Word Scrambling

by google-research-datasets

Silver51
43.0Kdownloads
38likes
100K<n<1M

Description

Dataset Card for PAWS: Paraphrase Adversaries from Word Scrambling Dataset Summary PAWS: Paraphrase Adversaries from Word Scrambling This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, and word order information for the problem of paraphrase identification. The dataset has two subsets, one based on Wikipedia and the other one based on the Quora Question Pairs (QQP) dataset. For further… See the full description on the dataset page: https://huggingface.co/datasets/google-research-datasets/paws.

What can I do with this?

Tags

task_categories:text-classificationtask_ids:semantic-similarity-classificationtask_ids:semantic-similarity-scoringtask_ids:text-scoringtask_ids:multi-input-text-classificationannotations_creators:expert-generatedannotations_creators:machine-generatedlanguage_creators:machine-generatedmultilinguality:monolingualsource_datasets:originallanguage:enlicense:othersize_categories:100K<n<1Mformat:parquetmodality:textlibrary:datasetslibrary:pandaslibrary:mlcroissantlibrary:polarsarxiv:1904.01130