Instruction FollowingSFT, PretrainingCommercial OK

LLaVA-OneVision-1.5-Instruct-Data

by mvp-lab

Silver56
211.1Kdownloads
68likes

Description

LLaVA-OneVision-1.5 Instruction Data Paper | Code πŸ“Œ Introduction This dataset, LLaVA-OneVision-1.5-Instruct, was collected and integrated during the development of LLaVA-OneVision-1.5. LLaVA-OneVision-1.5 is a novel family of Large Multimodal Models (LMMs) that achieve state-of-the-art performance with significantly reduced computational and financial costs. This meticulously curated 22M instruction dataset (LLaVA-OneVision-1.5-Instruct) is part of a comprehensive and… See the full description on the dataset page: https://huggingface.co/datasets/mvp-lab/LLaVA-OneVision-1.5-Instruct-Data.

What can I do with this?

Tags

task_categories:image-text-to-textlanguage:enlicense:apache-2.0size_categories:10M<n<100Mmodality:imagemodality:textarxiv:2509.23661region:usmultimodalvision-language-modellmminstruction-tuningpretrainingdataset-collectionvqaimage-captioninglarge-language-model