Math & ReasoningORPO, PretrainingNon-Commercial

Natural Reasoning

by facebook

Silver50

1.5Kdownloads

555likes

1M<n<10M

Description

NaturalReasoning is a large-scale dataset for general reasoning tasks. It consists of high-quality challenging reasoning questions backtranslated from pretraining corpora DCLM and FineMath. The questions have been deduplicated and decontaminated from popular reasoning benchmarks including MATH, GPQA, MMLU-Pro, MMLU-STEM. For each question, we extract the reference final answer from the original document from the pretraining corpora if possible. We also provide a model-generated response from… See the full description on the dataset page: https://huggingface.co/datasets/facebook/natural_reasoning.

Natural Reasoning

Description

What can I do with this?

Tags