Description
TextPecker-1.5M: A Dataset for Training and evaluating TextPecker
This repository contains the TextPecker-1.5M dataset, a new benchmark proposed in the paper "TextPecker: Rewarding Structural Anomaly Quantification for Enhancing Visual Text Rendering".
Code and Project Page
The official implementation and project details for the TextPecker and TextPecker-1.5M dataset can be found on the GitHub repository:
https://github.com/CIawevy/TextPecker
Sample Usage
You… See the full description on the dataset page: https://huggingface.co/datasets/CIawevy/TextPecker-1.5M.
What can I do with this?
Tags
task_categories:image-to-textlicense:apache-2.0size_categories:1M<n<10Mformat:parquetformat:optimized-parquetmodality:imagemodality:textlibrary:datasetslibrary:dasklibrary:polarslibrary:mlcroissantarxiv:2602.20903region:us