Question AnsweringHuman AnnotatedCopyleft

SQuAD2.0

by rajpurkar

Silver55
33.7Kdownloads
244likes
100K<n<1M

Description

Dataset Card for SQuAD 2.0 Dataset Summary Stanford Question Answering Dataset (SQuAD) is a reading comprehension dataset, consisting of questions posed by crowdworkers on a set of Wikipedia articles, where the answer to every question is a segment of text, or span, from the corresponding reading passage, or the question might be unanswerable. SQuAD 2.0 combines the 100,000 questions in SQuAD1.1 with over 50,000 unanswerable questions written adversarially by crowdworkers… See the full description on the dataset page: https://huggingface.co/datasets/rajpurkar/squad_v2.

What can I do with this?

Tags

task_categories:question-answeringtask_ids:open-domain-qatask_ids:extractive-qaannotations_creators:crowdsourcedlanguage_creators:crowdsourcedmultilinguality:monolingualsource_datasets:originallanguage:enlicense:cc-by-sa-4.0size_categories:100K<n<1Mformat:parquetmodality:textlibrary:datasetslibrary:pandaslibrary:mlcroissantlibrary:polarsarxiv:1806.03822arxiv:1606.05250region:us