Image GenerationPPO, Synthetic DataCommercial OK
DiffusionDB
by poloclub
11.0Kdownloads
601likes
n>1TDescription
DiffusionDB is the first large-scale text-to-image prompt dataset. It contains 2
million images generated by Stable Diffusion using prompts and hyperparameters
specified by real users. The unprecedented scale and diversity of this
human-actuated dataset provide exciting research opportunities in understanding
the interplay between prompts and generative models, detecting deepfakes, and
designing human-AI interaction tools to help users more easily use these models.
What can I do with this?
Tags
task_categories:text-to-imagetask_categories:image-to-texttask_ids:image-captioningannotations_creators:no-annotationlanguage_creators:foundmultilinguality:multilingualsource_datasets:originallanguage:enlicense:cc0-1.0size_categories:n>1Tarxiv:2210.14896region:usstable diffusionprompt engineeringpromptsresearch paper