Benchmarks & EvaluationPPONon-Commercial

GLUE (General Language Understanding Evaluation benchmark)

by nyu-mll

Silver63
379.2Kdownloads
481likes
10K<n<100K

Description

Dataset Card for GLUE Dataset Summary GLUE, the General Language Understanding Evaluation benchmark (https://gluebenchmark.com/) is a collection of resources for training, evaluating, and analyzing natural language understanding systems. Supported Tasks and Leaderboards The leaderboard for the GLUE benchmark can be found at this address. It comprises the following tasks: ax A manually-curated evaluation dataset for fine-grained analysis of system… See the full description on the dataset page: https://huggingface.co/datasets/nyu-mll/glue.

What can I do with this?

Tags

task_categories:text-classificationtask_ids:acceptability-classificationtask_ids:natural-language-inferencetask_ids:semantic-similarity-scoringtask_ids:sentiment-classificationtask_ids:text-scoringannotations_creators:otherlanguage_creators:othermultilinguality:monolingualsource_datasets:originallanguage:enlicense:othersize_categories:1M<n<10Mformat:parquetmodality:tabularmodality:textlibrary:datasetslibrary:pandaslibrary:mlcroissantlibrary:polars