Tika Parser is treating .pptx text content as embedded image

152 Views Asked by At

I am using tika parser to validate the content of various file types like .docx, .txt, .pptx and many more others. It seems that even for a normal text content pptx file, when running tika parser on it, its responding saying embedded image in file. Same Autodetect parser is working fine with .docx and other file extensions. Any special changes needs to be done for pptx file here? Thanks

0

There are 0 best solutions below