Benchmarks & EvaluationPPOCommercial OK

DarijaMMLU

by MBZUAI-Paris

Bronze40
4.3Kdownloads
6likes
10K<n<100K

Description

Dataset Card for DarijaMMLU Dataset Summary DarijaMMLU is an evaluation benchmark designed to assess large language models' (LLM) performance in Moroccan Darija, a variety of Arabic. It consists of 22,027 multiple-choice questions, translated from selected subsets of the Massive Multitask Language Understanding (MMLU) and ArabicMMLU benchmarks to measure model performance on 44 subjects in Darija. Supported Tasks Task Category: Multiple-choice question… See the full description on the dataset page: https://huggingface.co/datasets/MBZUAI-Paris/DarijaMMLU.

What can I do with this?

Tags

task_categories:question-answeringtask_ids:multiple-choice-qaannotations_creators:machine-generatedlanguage_creators:machine-translatedmultilinguality:monolingualsource_datasets:mmlusource_datasets:arabicmmlulanguage:malicense:mitsize_categories:10K<n<100Kformat:parquetmodality:textlibrary:datasetslibrary:pandaslibrary:mlcroissantlibrary:polarsarxiv:2409.17912arxiv:2402.12840region:us