Image GenerationPPO, Synthetic DataCommercial OK

DiffusionDB

by poloclub

Silver55
11.0Kdownloads
601likes
n>1T

Description

DiffusionDB is the first large-scale text-to-image prompt dataset. It contains 2 million images generated by Stable Diffusion using prompts and hyperparameters specified by real users. The unprecedented scale and diversity of this human-actuated dataset provide exciting research opportunities in understanding the interplay between prompts and generative models, detecting deepfakes, and designing human-AI interaction tools to help users more easily use these models.

What can I do with this?

Tags

task_categories:text-to-imagetask_categories:image-to-texttask_ids:image-captioningannotations_creators:no-annotationlanguage_creators:foundmultilinguality:multilingualsource_datasets:originallanguage:enlicense:cc0-1.0size_categories:n>1Tarxiv:2210.14896region:usstable diffusionprompt engineeringpromptsresearch paper