Instruction FollowingSFT, PretrainingCommercial OK
LLaVA-OneVision-1.5-Instruct-Data
by mvp-lab
211.1Kdownloads
68likes
Description
LLaVA-OneVision-1.5 Instruction Data
Paper | Code
π Introduction
This dataset, LLaVA-OneVision-1.5-Instruct, was collected and integrated during the development of LLaVA-OneVision-1.5. LLaVA-OneVision-1.5 is a novel family of Large Multimodal Models (LMMs) that achieve state-of-the-art performance with significantly reduced computational and financial costs. This meticulously curated 22M instruction dataset (LLaVA-OneVision-1.5-Instruct) is part of a comprehensive and⦠See the full description on the dataset page: https://huggingface.co/datasets/mvp-lab/LLaVA-OneVision-1.5-Instruct-Data.
What can I do with this?
Tags
task_categories:image-text-to-textlanguage:enlicense:apache-2.0size_categories:10M<n<100Mmodality:imagemodality:textarxiv:2509.23661region:usmultimodalvision-language-modellmminstruction-tuningpretrainingdataset-collectionvqaimage-captioninglarge-language-model