PDFTriage: Question Answering over Long, Structured Documents Paper • 2309.08872 • Published Sep 16, 2023 • 54
CommonForms: A Large, Diverse Dataset for Form Field Detection Paper • 2509.16506 • Published Sep 20, 2025 • 22
GutenOCR: A Grounded Vision-Language Front-End for Documents Paper • 2601.14490 • Published 5 days ago • 29
GutenOCR: A Grounded Vision-Language Front-End for Documents Paper • 2601.14490 • Published 5 days ago • 29
GutenOCR: A Grounded Vision-Language Front-End for Documents Paper • 2601.14490 • Published 5 days ago • 29