CodePretraining, Synthetic DataUnknown

LongBench

by zai-org

Silver57

91.8Kdownloads

175likes

1K<n<10K

Description

LongBench is a comprehensive benchmark for multilingual and multi-task purposes, with the goal to fully measure and evaluate the ability of pre-trained language models to understand long text. This dataset consists of twenty different tasks, covering key long-text application scenarios such as multi-document QA, single-document QA, summarization, few-shot learning, synthetic tasks, and code completion.

LongBench

Description

What can I do with this?

Tags