Hi I want to check fully embedding and subset embedding of fonts in PDF using PDFBOX. I have tried using the following logic to check:
private boolean IsEmbedded(Map<String, PDFont> fontsMap, Set<String> keys) {
for(String key:keys) {
PDFont font = fontsMap.get(key);
PDFontDescriptor fontDescriptor = font.getFontDescriptor();
if(null != fontDescriptor && fontDescriptor instanceof PDFontDescriptorDictionary){
PDFontDescriptorDictionary fontDescriptorDictionary = (PDFontDescriptorDictionary)fontDescriptor;
if(null == fontDescriptorDictionary.getFontFile() && null == fontDescriptorDictionary.getFontFile2() && null == fontDescriptorDictionary.getFontFile3())
return false;
}
}
return true;
}
But seems I could not able to find out how to differentiate between Fully Embedding or sub-set embedding. Can anyone please give me the answer?
To quote the PDF specification ISO 32000-1 on font subsets (section 9.6.4):
In a PDF following up to this requirement ("shall", so it really is a requirement) you, therefore, can recognize subset fonts by their name.
Keep in mind, though, that outside of PDFs you can derive a font from another one by including only selected glyphs. This essentially creates a subset font but a PDF creating software making use of it may not notice that fact and name it as a fully embedded font. So in essence you can never know for sure.