AI & ML interests
None defined yet.
Recent Activity
Papers
GutenOCR: A Grounded Vision-Language Front-End for Documents
PubMed-OCR: PMC Open Access OCR Annotations
Data and models for optical character recognition
-
PubMed-OCR: PMC Open Access OCR Annotations
Paper • 2601.11425 • Published • 11 -
GutenOCR: A Grounded Vision-Language Front-End for Documents
Paper • 2601.14490 • Published • 36 -
rootsautomation/TABMEpp
Viewer • Updated • 122k • 182 • 5 -
rootsautomation/pubmed-ocr
Viewer • Updated • 1.55M • 2.63k • 61
Data and models for optical character recognition
-
PubMed-OCR: PMC Open Access OCR Annotations
Paper • 2601.11425 • Published • 11 -
GutenOCR: A Grounded Vision-Language Front-End for Documents
Paper • 2601.14490 • Published • 36 -
rootsautomation/TABMEpp
Viewer • Updated • 122k • 182 • 5 -
rootsautomation/pubmed-ocr
Viewer • Updated • 1.55M • 2.63k • 61
A collection of RICO screenshot-based datasets for training and evaluation. We've attempted to compile all surrounding metadata for the relevant tasks