Medical & HealthcareCommercial OK
TruthfulQA
by truthfulqa
83.4Kdownloads
278likes
n<1KDescription
Dataset Card for truthful_qa
Dataset Summary
TruthfulQA is a benchmark to measure whether a language model is truthful in generating answers to questions. The benchmark comprises 817 questions that span 38 categories, including health, law, finance and politics. Questions are crafted so that some humans would answer falsely due to a false belief or misconception. To perform well, models must avoid generating false answers learned from imitating human texts.… See the full description on the dataset page: https://huggingface.co/datasets/truthfulqa/truthful_qa.
What can I do with this?
Tags
task_categories:multiple-choicetask_categories:text-generationtask_categories:question-answeringtask_ids:multiple-choice-qatask_ids:language-modelingtask_ids:open-domain-qaannotations_creators:expert-generatedlanguage_creators:expert-generatedmultilinguality:monolingualsource_datasets:originallanguage:enlicense:apache-2.0size_categories:1K<n<10Kformat:parquetmodality:textlibrary:datasetslibrary:pandaslibrary:mlcroissantlibrary:polarsarxiv:2109.07958