I am having trouble extracting text from image using tesseract-ocr-setup-3.02.02.exe in .NET I have used simple yatt class (yatt / tesseract-ocr-class.cs) from here
I have Downloaded and installed esseract-ocr-setup-3.02.02.exe from here Then use the yatt class in C# like this
TesseractOCR ocr = new TesseractOCR(@"C:\Program Files (x86)\Tesseract-OCR\tesseract.exe");
using (Bitmap bmp = new Bitmap(@"C:\ocr\cap.jpg"))
divOCRText.InnerHtml = ocr.OCRFromBitmap(bmp);
Its extracting text from image but there are many-2 issues. Extracted text has many spelling mistakes. Can somebody guide me what am I doing wrong?
Here is the Image to OCR
Extracted Text (Screenshot)
Here is the testdata install in my PC