How to load non-default PaddleOCR detection model in a python script?

1.5k Views Asked by At

PaddleOCR seems to support different algorithms/models for text detection, as written in their algorithms overview. However, when trying to load a different model from "DB" (for example "SAST" or "EAST") an error is encountered (I get the following error: ppocr ERROR: det_algorithm must in ['DB']). From the code it seems that only the "DB" model is supported - see here. I have downloaded the models from the links posted in the documentation, is there a way to use them in a python script? I don't intend to use them as illustrated in the tutorial, for example running python3 tools/infer/predict_det.py --image_dir="./doc/imgs_en/img_10.jpg" --det_model_dir="./inference/det_r50_east/" --det_algorithm="EAST".

I have tried something like the following:

from paddleocr import PaddleOCR

ocr = PaddleOCR(use_angle_cls=True, lang="en", det_algorithm="DB")
result = ocr.ocr(img, cls=True)
1

There are 1 best solutions below

0
On

This looks like you have installed the paddleOCR using the whl package. The whl package currently only supports the DB algorithm. For predictions of other models, it is recommended to clone the entire project. You can view this discussion here