Apache POI's Word to PDF - Error when converting: Bigger paragraph spacing than source

620 Views Asked by At

I am trying to converting a word document to PDF by using Apache's POI.

However, it is converting the files with bigger line heights / paragraph spacing.

Extract from Word file:

Word file

Exact same extract from Converted PDF:

Converted to PDF

As you can see, each paragraph has a much bigger spacing than originally intended, even causing it to have a page jump.

The code I am using is the following:

        try {
        InputStream doc = new FileInputStream(new File("random.docx"));
        XWPFDocument document = new XWPFDocument(doc);
        PdfOptions options = PdfOptions.create();
        OutputStream out = new FileOutputStream(new File("random.pdf"));
        PdfConverter.getInstance().convert(document, out, options);
        System.out.println("Done");
    } catch (FileNotFoundException ex) {
        System.out.println(ex.getMessage());
    } catch (IOException ex) {

        System.out.println(ex.getMessage());
    }

This is the input file I am using.

Dependencies include the following:

Dependencies

I need a way to eliminate this spacing, or at best reduce it as close as possible to the original way.

Your input (whatever it is) is very much appreciated.

1

There are 1 best solutions below

0
On

I had the exact same problem and found a little work-around. I created a new .docx-file and set the paragraph spacing to 0. Then I just copied my file contents into the new document and somehow it works now.