Benchmarks & EvaluationPPOCommercial OK

DarijaMMLU

by MBZUAI-Paris

Bronze40

4.3Kdownloads

6likes

10K<n<100K

Description

Dataset Card for DarijaMMLU Dataset Summary DarijaMMLU is an evaluation benchmark designed to assess large language models' (LLM) performance in Moroccan Darija, a variety of Arabic. It consists of 22,027 multiple-choice questions, translated from selected subsets of the Massive Multitask Language Understanding (MMLU) and ArabicMMLU benchmarks to measure model performance on 44 subjects in Darija. Supported Tasks Task Category: Multiple-choice question… See the full description on the dataset page: https://huggingface.co/datasets/MBZUAI-Paris/DarijaMMLU.

DarijaMMLU

Description

What can I do with this?

Tags