Medical & HealthcareCommercial OK

TruthfulQA

by truthfulqa

Silver58
83.4Kdownloads
278likes
n<1K

Description

Dataset Card for truthful_qa Dataset Summary TruthfulQA is a benchmark to measure whether a language model is truthful in generating answers to questions. The benchmark comprises 817 questions that span 38 categories, including health, law, finance and politics. Questions are crafted so that some humans would answer falsely due to a false belief or misconception. To perform well, models must avoid generating false answers learned from imitating human texts.… See the full description on the dataset page: https://huggingface.co/datasets/truthfulqa/truthful_qa.

What can I do with this?

Tags

task_categories:multiple-choicetask_categories:text-generationtask_categories:question-answeringtask_ids:multiple-choice-qatask_ids:language-modelingtask_ids:open-domain-qaannotations_creators:expert-generatedlanguage_creators:expert-generatedmultilinguality:monolingualsource_datasets:originallanguage:enlicense:apache-2.0size_categories:1K<n<10Kformat:parquetmodality:textlibrary:datasetslibrary:pandaslibrary:mlcroissantlibrary:polarsarxiv:2109.07958