XLST transform from XML to HTML results "HTML 1.0 version not supported" error

574 Views Asked by At

I'm trying to convert XML to HTMl using Java Saxon library with Jcabi-XML.

XSLT:

<?xml version="1.0" encoding="UTF-8"?>
<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform" version="2.0">
  <xsl:output method="html" version="5.0"/>

  <xsl:template match="/content">
    <html lang="en">
      <head></head>
      <body></body>
    </html>
  </xsl:template>

</xsl:stylesheet>

XML:

<?xml version="1.0"?>
<content>This text results error</content>

Java:

package com.test;

import java.io.File;
import java.io.FileNotFoundException;

import com.jcabi.xml.XMLDocument;
import com.jcabi.xml.XSLDocument;

public class Main {
    public static void main(String... args) {
        try {
            final XMLDocument xml = new XMLDocument(
                new File("test.xml")
            );
            final XSLDocument xlst = new XSLDocument(
                new File("test.xslt")
            );
            xlst.transform(xml);
        } catch (final FileNotFoundException ex) {
            System.out.println(ex.getMessage());
        }
    }
}

This code results java.lang.IllegalArgumentException: net.sf.saxon.trans.XPathException: Unsupported HTML version: 1.0.

I explicitly set HTML5 version, but it doesn't affect to result. How to avoid this error?

Libraries versions:

  • Jcabi-XML: 0.23.2
  • Saxon-HE: 9.8.0-15
2

There are 2 best solutions below

1
On BEST ANSWER

As far as I can tell from the stack trace (showing

Exception in thread "main" java.lang.IllegalArgumentException: net.sf.saxon.trans.XPathException: Unsupported HTML version: 1.0
    at com.jcabi.xml.XMLDocument.asString(XMLDocument.java

the exception is triggered by the line https://github.com/jcabi/jcabi-xml/blob/0.23.2/src/main/java/com/jcabi/xml/XMLDocument.java#L505 doing trans.setOutputProperty(OutputKeys.VERSION, "1.0");, that seems to be the culprit.

So basically it seems that the library uses Saxon with the JAXP Transformer API but at that line tries to override the version as 1.0 and Saxon indeed doesn't support it for the HTML output it is trying to do based on the html output method being set in the stylesheet.

As you want to use Saxon 9.8 (or perhaps later, given that the supported versions are 10 and 11), I would suggest to look at Saxon's API documented at https://www.saxonica.com/documentation11/index.html#!using-xsl/embedding/s9api-transformation and just use that API to perform the XSLT transformation.

   Processor processor = new Processor(false);
   XsltCompiler compiler = processor.newXsltCompiler();
   XsltExecutable stylesheet = compiler.compile(new StreamSource(new File("test.xslt")));
   Serializer out = processor.newSerializer(new File("result.html"));
   Xslt30Transformer transformer = stylesheet.load30();
   transformer.transform(new StreamSource(new File("test.xml")), out);
9
On

Adding xmlns="http://www.w3.org/1999/xhtml" into xsl:stylesheet tag fixes this error:

Full XSLT file after change:

<?xml version="1.0" encoding="UTF-8"?>
<xsl:stylesheet
  xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
  xmlns="http://www.w3.org/1999/xhtml"
  version="2.0">
  <xsl:output method="html" version="5.0"/>

  <xsl:template match="/content">
    <html lang="en">
      <head></head>
      <body></body>
    </html>
  </xsl:template>

</xsl:stylesheet>