I keep getting this error while using Tabula on python.
I've gone through EVERY stackoverflow question related to this and blogs as well.
My JDK JRE is up to date.
java version "1.8.0_161" Java(TM) SE Runtime Environment (build 1.8.0_161-b12) Java HotSpot(TM) 64-Bit Server VM (build 25.161-b12, mixed mode)
My path is correctly defined in Environment variables.
Python version running on Anaconda.
Python 3.6.5 |Anaconda, Inc
df = tabula.read_pdf("C:\XXXXX\PDFExtractor\Test.pdf")
I've tried with encoding as well.
Tabula CalledProcessError: Command '['java', '-jar', 'C:\\Users\\xxxxx\\AppData\\Local\\Continuum\\anaconda3\\lib\\site-packages\\tabula\\tabula-1.0.1-jar-with-dependencies.jar', '--pages', '1', '--guess', 'C:\\Users\\xxxxxx\\PDFExtractor\\Test.pdf']' returned non-zero exit status 2.
Appreciate the help.
You need to escape backslashes or use a raw string:
or
otherwise your file is seen as
C:XXXXXPDFExtractorTest.pdf