Science & ResearchCommercial OK

GPQA

by Idavidrein

Silver60
114.2Kdownloads
407likes
n<1K

Description

Dataset Card for GPQA GPQA is a multiple-choice, Q&A dataset of very hard questions written and validated by experts in biology, physics, and chemistry. When attempting questions out of their own domain (e.g., a physicist answers a chemistry question), these experts get only 34% accuracy, despite spending >30m with full access to Google. We request that you do not reveal examples from this dataset in plain text or images online, to reduce the risk of leakage into foundation model… See the full description on the dataset page: https://huggingface.co/datasets/Idavidrein/gpqa.

What can I do with this?

Tags

benchmark:officialbenchmark:eval-yamltask_categories:question-answeringtask_categories:text-generationlanguage:enlicense:cc-by-4.0size_categories:1K<n<10Kformat:csvmodality:tabularmodality:textlibrary:datasetslibrary:pandaslibrary:polarslibrary:mlcroissantarxiv:2311.12022region:usopen-domain-qaopen-book-qamultiple-choice-qa