Issue with apache Tika Extraction for Tabular Column Data in PDF

110 Views Asked by At

I extracted a PDF that has tabular column data using apache Tika, in the result the row data from different columns are getting merged

Before Extracting | Column A | Column B | | -------- | -------- | | 1 | saikiran | | 2 | pavan |

The above Tabular column data is what i am trying to extract

After Extracting the below is the result

saikiran1 pavan2

I am expecting the result to be like i have mentioned down below 1 saikiran 2 pavan

0

There are 0 best solutions below