I am working on an application where I need to get the net price displayed in any shopping bill from its picture. I have already retrieved the editable text from the bill images using "tesseract ocr" API. Now I need to print only the "grand total amount" from the text. How do I extract only that part( total price) from a whole bill having the item name, quantity and price?
Extracting total price from a shopping bill
482 Views Asked by Krishna Jayachandran At
1
There are 1 best solutions below
Related Questions in ALGORITHM
- Two different numbers in an array which their sum equals to a given value
- Given two arrays of positive numbers, re-arrange them to form a resulting array, resulting array contains the elements in the same given sequence
- Time complexity of the algorithm?
- Find a MST in O(V+E) Time in a Graph
- Why k and l for LSH used for approximate nearest neighbours?
- How to count the number of ways of choosing of k equal substrings from a List L(the list of All Substrings)
- Issues with reversing the linkedlist
- Finding first non-repeating number in integer array
- Finding average of an array
- How to check for duplicates with less time in a list over 9000 elements by python
- How to pick a number based on probability?
- Insertion Sort help in javascript -- Khan Academy
- Developing a Checkers (Draughts) engine, how to begin?
- Can Bellman-Ford algorithm be used to find shorthest path on a graph with only positive edges?
- What is the function for the KMP Failure Algorithm?
Related Questions in OCR
- Tesseract - The specified module could not be found
- Linux OCR of LCD characters
- Calculating equation from image in Java
- Python Tesseract OCR training to a specific list of words
- How correct send encoded base64 image to nodeJS and get response in Java
- OCR serial number CRC, check algorithm
- How to extract a specific text from an image
- Can Tesseract be set to OCR only (no image modification) when producing a PDF?
- OCR on text stamped into metal plate
- Arabic number recognization
- Tesseract Assert failed trainingsampleset.cpp line 622 with mftraining
- Camera Preview and OCR
- Getting the ocrad.js demo to work?
- What is the image type in MNIST dataset?
- Issue reading Bold fonts with Tesseract API / Tess4j
Related Questions in TESSERACT
- Tesseract - The specified module could not be found
- CIDetector to filter rectangle and get cropped image
- Python Tesseract OCR training to a specific list of words
- How to extract a specific text from an image
- Can Tesseract be set to OCR only (no image modification) when producing a PDF?
- OCR on text stamped into metal plate
- Tesseract adaptive training
- Tesseract Assert failed trainingsampleset.cpp line 622 with mftraining
- Camera Preview and OCR
- [python]has no attribute 'TessBaseAPI'
- Tesseract: An alternative to building a source in linux remote host?
- Issue reading Bold fonts with Tesseract API / Tess4j
- Convert hOCR to HTML table
- After succesfully installing tesseract_ocr in Ubuntu it shows no mudule named tesseract_ocr
- Javacpp: liblept.4.dylib library not loaded
Related Questions in IMAGE-RECOGNITION
- Calculating equation from image in Java
- Shazam for video
- How to recognize the shape of traffic sign?
- What is the image type in MNIST dataset?
- Unexpected Image Display Behavior in Ipython Notebook After Manipulating with Numpy
- Quickly recognize different scanned forms with OpenCV and find homography transformation
- Tesseract: Recognition simple numbers in C#
- Extracting total price from a shopping bill
- Tesseract - How to extract text from the image for the input coordinates?
- Detecting a baby's face using OpenCV
- Training model to recognize one specific object (or scene)
- iOs Image Recognition, Categorizing and matching pattern
- How to make unrecognized faces in opencv2 be labeled "unknown"?
- Running a TensorFlow Image Recognition API to search for an object
- FindFailed with error message implying find success in SikuliX?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Short answer, I don't think there is a quick/handy method you can call directly.
You need to look into the
.hocrfile returned fromTesseract(You can google hocr for more info first). The.hocrincludes all the bounding box of the text(x, y, width, height, languageetc.) then make use of these values, you can determine if words are on the same line (The word 'Total' and the total amount are very likely printed on the same line).From here you can shortlist the words, add some logical operations (maybe remove all characters/words), then you can get the total value.
ps: My company is working on a similar stuff, but we decided not to use Tesseract, as it is kind of slow and not easy to train (we're dealing with receipts in several languages). We are using Google Vision API.
Hope my answer helps :D