Role-Play & CharactersCommercial OK
Openpdf-Analysis-Recognition
by prithivMLmods
1.6Kdownloads
4likes
1K<n<10KDescription
Openpdf-Analysis-Recognition
The Openpdf-Analysis-Recognition dataset is curated for tasks related to image-to-text recognition, particularly for scanned document images and OCR (Optical Character Recognition) use cases. It contains over 6,900 images in a structured imagefolder format suitable for training models on document parsing, PDF image understanding, and layout/text extraction tasks.
Attribute
Value
Task
Image-to-Text
Modality
Image
Format
ImageFolder… See the full description on the dataset page: https://huggingface.co/datasets/prithivMLmods/Openpdf-Analysis-Recognition.
What can I do with this?
Tags
task_categories:image-to-textlanguage:enlicense:apache-2.0size_categories:1K<n<10Kformat:imagefoldermodality:imagemodality:documentmodality:textlibrary:datasetslibrary:mlcroissantregion:usdocumentcodeRAW-PDFsocrpdftextdocfinancedocvl