PdfReader with Itext library, so:
PdfReader reader = new PdfReader();
reader.getPageContent(4)
my page(4) content a image, getPageContent return byte[]
this image is empty in result
PdfReader with Itext library, so:
PdfReader reader = new PdfReader();
reader.getPageContent(4)
my page(4) content a image, getPageContent return byte[]
this image is empty in result
Copyright © 2021 Jogjafile Inc.
When you do
reader.getPageContent(4)
, you get abyte[]
containing PDF syntax. For instance:In no way this is an image. In no way this is content that can be used as a standalone object. For instance:
/F1
refers to a resource, more specifically to a font. Without looking at the/Resources
of the pages of which we extracted the PDF syntax, we have no idea what the PDF string(Hello World)
looks like.The title of your question get image byte in page is wrong. You say my page(4) content a image but that isn't English. Let's assume you mean to say my page 4 contains an image. In that case, the
byte[]
returned bygetPageContent()
will look somewhat like this:In this syntax
q
andQ
save and restore the state. Thecm
operator defines the size and the position of the image: it will be 20 by 20 user units and positioned at x = 36 and y = 786. The actual image is stored in the resources of the page dictionary as an Image XObject. It is added to the page using theDo
operator.If you do not understand a word of what I'm saying in this answer, you should start reading ISO-32000-1 or why not start reading the iText documentation?
See for instance: