Tesseract installation in windows

117.1k Views Asked by At

I am currently working on optimal character recognition project using python 2.7,open computer vision in windows.To accomplish this task i came to know that it can be done by using tesseract (software).But, it cannot be installed on windows. I searched a lot but i could not get the solution. Can any one tell me is there any way of installing it on windows ?or can it be done without using it?

3

There are 3 best solutions below

0
On

UB Mannheim provide pre-built binaries for the latest versions of tesseract.

From tesseract Github wiki.

Windows

An unofficial installer for windows for Tesseract 3.05-dev and Tesseract 4.00-dev is available from Tesseract at UB Mannheim. This includes the training tools.

...

To access tesseract-OCR from any location you may have to add the directory where the tesseract-OCR binaries are located to the Path variables, probably C:\Program Files\Tesseract-OCR.

2
On

Simple steps for tesseract installation in windows.

  1. Download tesseract exe from https://github.com/UB-Mannheim/tesseract/wiki.

  2. Install this exe in C:\Program Files (x86)\Tesseract-OCR

  3. Open virtual machine command prompt in windows or anaconda prompt.

  4. Run pip install pytesseract

  5. To test if tesseract is installed type in python prompt:

    import pytesseract

    print(pytesseract)

2
On

To accomplish OCR with Python on Windows, you will need Python and OpenCV which you already have, as well as Tesseract and the Pytesseract Python package.

To install Tesseract OCR for Windows:

  1. Run the installer(find 2021) from UB Mannheim
  2. Configure your installation (choose installation path and language data to include)
  3. Add Tesseract OCR to your environment variables

To install and use Pytesseract on Windows:

  1. Simply run pip install pytesseract
  2. You will also need to install Pillow with pip install Pillow to use Pytesseract. Import it in your Python document like so from PIL import Image.
  3. You will need to add the following line in your code in order to be able to call pytesseract on your machine: pytesseract.pytesseract.tesseract_cmd = 'C:\\Program Files\\Tesseract-OCR\\tesseract.exe'

I've given a detailed walkthrough of how to install Tesseract OCR for Windows here if you would like further guidance.