💡HF Papers Live 6: OCR - a AI-Insight Collection

AI-Insight 's Collections

💡HF Papers Live 1: Reinforcement Learning

💡HF Papers Live 2: Code Bench

💡HF Papers Live 3: AI for Science

💡HF Papers Live 4: Multi Modal models

💡HF Papers Live 5: Omni-Modal models

💡HF Papers Live 6: OCR

💡HF Papers Live 6: OCR

updated 21 days ago

tencent/HunyuanOCR

Image-Text-to-Text • 1.0B • Updated about 17 hours ago • 855k • 687
HunyuanOCR Technical Report

Paper • 2511.19575 • Published about 1 month ago • 21
PaddlePaddle/PaddleOCR-VL

Image-Text-to-Text • 1.0B • Updated 13 days ago • 18.5k • 1.42k
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Paper • 2510.14528 • Published Oct 16 • 108
Running on L40S

510

MinerU OCR

📚

510

A data extraction tool to convert PDF to Markdown and JSON
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26 • 139
opendatalab/MinerU2.5-2509-1.2B

Image-Text-to-Text • 1B • Updated Sep 29 • 1.1M • 300