Extracting total price from a shopping bill

488 Views Asked by Krishna Jayachandran At 19 December 2016 at 03:15

I am working on an application where I need to get the net price displayed in any shopping bill from its picture. I have already retrieved the editable text from the bill images using "tesseract ocr" API. Now I need to print only the "grand total amount" from the text. How do I extract only that part( total price) from a whole bill having the item name, quantity and price?

Original Q&A

There are 1 best solutions below

Pang Ho Ming On 19 December 2016 at 03:31

Short answer, I don't think there is a quick/handy method you can call directly.

You need to look into the .hocr file returned from Tesseract(You can google hocr for more info first). The .hocr includes all the bounding box of the text(x, y, width, height, language etc.) then make use of these values, you can determine if words are on the same line (The word 'Total' and the total amount are very likely printed on the same line).

From here you can shortlist the words, add some logical operations (maybe remove all characters/words), then you can get the total value.

ps: My company is working on a similar stuff, but we decided not to use Tesseract, as it is kind of slow and not easy to train (we're dealing with receipts in several languages). We are using Google Vision API.

Hope my answer helps :D

Extracting total price from a shopping bill

There are 1 best solutions below

Related Questions in ALGORITHM

Related Questions in OCR

Related Questions in TESSERACT

Related Questions in IMAGE-RECOGNITION

Trending Questions

Popular # Hahtags

Popular Questions