Image RecognitionUnknown
the_cauldron
by HuggingFaceM4
58.1Kdownloads
522likes
Description
Dataset Card for The Cauldron
Dataset description
The Cauldron is part of the Idefics2 release.
It is a massive collection of 50 vision-language datasets (training sets only) that were used for the fine-tuning of the vision-language model Idefics2.
Load the dataset
To load the dataset, install the library datasets with pip install datasets. Then,
from datasets import load_dataset
ds = load_dataset("HuggingFaceM4/the_cauldron", "ai2d")
to download and load the… See the full description on the dataset page: https://huggingface.co/datasets/HuggingFaceM4/the_cauldron.
What can I do with this?
Tags
size_categories:1M<n<10Mformat:parquetmodality:imagemodality:textlibrary:datasetslibrary:dasklibrary:mlcroissantlibrary:polarsarxiv:1603.07396arxiv:2206.01718arxiv:2208.05358arxiv:1612.06890arxiv:2310.00367arxiv:1710.07300arxiv:2312.12241arxiv:1912.03098arxiv:2211.08545arxiv:2306.05425arxiv:1709.00103arxiv:2003.12462