Tesseract train specific characters

288 Views Asked by At

Is it possible to increase the accuracy of specific characters in an existing traineddata model?

For example:

  • The number 3 will often be detected as 5.
  • The number 5 will often be detected as 8.
  • The W will often be detected as V.

Makes it sense to fine-tuning the eng.traineddata only with these characters, to increase the accuracy of these specific characters?

I tried to train the eng.traineddata model with line based image data and the accuracy for the characters in the example increased significant. But with the new model, letters that were well recognized in the previous model are recognized incorrectly. (Overfitting?)

0

There are 0 best solutions below