UVDoc

Introduction

The main purpose of text image correction is to carry out geometric transformation on the image to correct the document distortion, inclination, perspective deformation and other problems in the image, so that the subsequent text recognition can be more accurate.

Model CER
UVDoc 0.179

Note: Test data set: docunet benchmark data set.

Model Usage

Install Dependencies

pip install -U paddleocr
pip install -U onnxruntime-gpu

CLI Usage

paddleocr text_image_unwarping -i ./demo.jpg --model_name UVDoc --engine onnxruntime

Python API Usage

from paddleocr import TextImageUnwarping

model = TextImageUnwarping(
    model_name="UVDoc",
    engine="onnxruntime",
)
output = model.predict("./demo.jpg", batch_size=1)
for res in output:
    res.print()
    res.save_to_img(save_path="./output/")
Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support