Science & ResearchCommercial OK
GPQA
by Idavidrein
114.2Kdownloads
407likes
n<1KDescription
Dataset Card for GPQA
GPQA is a multiple-choice, Q&A dataset of very hard questions written and validated by experts in biology, physics, and chemistry. When attempting questions out of their own domain (e.g., a physicist answers a chemistry question), these experts get only 34% accuracy, despite spending >30m with full access to Google.
We request that you do not reveal examples from this dataset in plain text or images online, to reduce the risk of leakage into foundation model… See the full description on the dataset page: https://huggingface.co/datasets/Idavidrein/gpqa.
What can I do with this?
Tags
benchmark:officialbenchmark:eval-yamltask_categories:question-answeringtask_categories:text-generationlanguage:enlicense:cc-by-4.0size_categories:1K<n<10Kformat:csvmodality:tabularmodality:textlibrary:datasetslibrary:pandaslibrary:polarslibrary:mlcroissantarxiv:2311.12022region:usopen-domain-qaopen-book-qamultiple-choice-qa