Benchmarks & EvaluationPPOCommercial OK
DarijaMMLU
by MBZUAI-Paris
4.3Kdownloads
6likes
10K<n<100KDescription
Dataset Card for DarijaMMLU
Dataset Summary
DarijaMMLU is an evaluation benchmark designed to assess large language models' (LLM) performance in Moroccan Darija, a variety of Arabic. It consists of 22,027 multiple-choice questions, translated from selected subsets of the Massive Multitask Language Understanding (MMLU) and ArabicMMLU benchmarks to measure model performance on 44 subjects in Darija.
Supported Tasks
Task Category: Multiple-choice question… See the full description on the dataset page: https://huggingface.co/datasets/MBZUAI-Paris/DarijaMMLU.
What can I do with this?
Tags
task_categories:question-answeringtask_ids:multiple-choice-qaannotations_creators:machine-generatedlanguage_creators:machine-translatedmultilinguality:monolingualsource_datasets:mmlusource_datasets:arabicmmlulanguage:malicense:mitsize_categories:10K<n<100Kformat:parquetmodality:textlibrary:datasetslibrary:pandaslibrary:mlcroissantlibrary:polarsarxiv:2409.17912arxiv:2402.12840region:us