Description
[!NOTE]
IMPORTANT: Please help us protect the integrity of this benchmark by not publicly sharing, re-uploading, or distributing the dataset.
Humanity's Last Exam
๐ Website | ๐ Paper | GitHub
Center for AI Safety & Scale AI
Humanity's Last Exam (HLE) is a multi-modal benchmark at the frontier of human knowledge, designed to be the final closed-ended academic benchmark of its kind with broad subject coverage. Humanity's Last Exam consists of 2,500 questions across dozens ofโฆ See the full description on the dataset page: https://huggingface.co/datasets/cais/hle.
What can I do with this?
Tags
benchmark:officiallicense:mitsize_categories:1K<n<10Kformat:parquetmodality:imagemodality:textlibrary:datasetslibrary:pandaslibrary:polarslibrary:mlcroissantregion:us