Benchmarks & EvaluationPPO, Human AnnotatedCopyleft
SQuAD
by rajpurkar
120.8Kdownloads
359likes
10K<n<100KDescription
Dataset Card for SQuAD
Dataset Summary
Stanford Question Answering Dataset (SQuAD) is a reading comprehension dataset, consisting of questions posed by crowdworkers on a set of Wikipedia articles, where the answer to every question is a segment of text, or span, from the corresponding reading passage, or the question might be unanswerable.
SQuAD 1.1 contains 100,000+ question-answer pairs on 500+ articles.
Supported Tasks and Leaderboards
Question Answering.… See the full description on the dataset page: https://huggingface.co/datasets/rajpurkar/squad.
What can I do with this?
Tags
task_categories:question-answeringtask_ids:extractive-qaannotations_creators:crowdsourcedlanguage_creators:crowdsourcedlanguage_creators:foundmultilinguality:monolingualsource_datasets:extended|wikipedialanguage:enlicense:cc-by-sa-4.0size_categories:10K<n<100Kformat:parquetmodality:textlibrary:datasetslibrary:pandaslibrary:mlcroissantlibrary:polarsarxiv:1606.05250region:us