Question AnsweringHuman AnnotatedCopyleft
SQuAD2.0
by rajpurkar
33.7Kdownloads
244likes
100K<n<1MDescription
Dataset Card for SQuAD 2.0
Dataset Summary
Stanford Question Answering Dataset (SQuAD) is a reading comprehension dataset, consisting of questions posed by crowdworkers on a set of Wikipedia articles, where the answer to every question is a segment of text, or span, from the corresponding reading passage, or the question might be unanswerable.
SQuAD 2.0 combines the 100,000 questions in SQuAD1.1 with over 50,000 unanswerable questions written adversarially by crowdworkers… See the full description on the dataset page: https://huggingface.co/datasets/rajpurkar/squad_v2.
What can I do with this?
Tags
task_categories:question-answeringtask_ids:open-domain-qatask_ids:extractive-qaannotations_creators:crowdsourcedlanguage_creators:crowdsourcedmultilinguality:monolingualsource_datasets:originallanguage:enlicense:cc-by-sa-4.0size_categories:100K<n<1Mformat:parquetmodality:textlibrary:datasetslibrary:pandaslibrary:mlcroissantlibrary:polarsarxiv:1806.03822arxiv:1606.05250region:us