如何使用 Java 和 Xerces 解析符合 1.1 规范的 XML?

2023-01-13Java开发问题

本文介绍了如何使用 Java 和 Xerces 解析符合 1.1 规范的 XML?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着跟版网的小编来一起学习吧！

问题描述

我正在尝试解析包含符合 XML 1.1 规范的 XML 内容的字符串.XML 包含在 XML 1.0 规范中不允许但在 XML 1.1 规范中允许的字符引用(字符引用转换为 U+0001–U+001F 范围内的 Unicode 字符).

I'm trying to parse a String which contains XML content which conforms to the XML 1.1 spec. The XML contains character references which are not allowed in the XML 1.0 spec but which are allowed in the XML 1.1 spec (character references which translate to Unicode characters in the range U+0001–U+001F).

根据 Xerces2 网站，Xerces2 解析器支持解析 XML 1.1 文档.但是，我不知道如何告诉它我们尝试解析的 XML 包含符合 1.1 的 XML.

According the Xerces2 website, the Xerces2 parser supports parsing XML 1.1 documents. However, I cannot figure out how to tell it the XML we are trying to parse contains 1.1-compliant XML.

我正在使用 DocumentBuilder 来解析 XML(类似这样):

I'm using a DocumentBuilder to parse the XML (something like this):

public Element parseString(String xmlString) {
    try {
          DocumentBuilderFactory dbf = DocumentBuilderFactory.newInstance();
          DocumentBuilder documentBuilder = dbf.newDocumentBuilder();

          InputSource source = new InputSource(new StringReader(xmlString));

      // Throws org.xml.sax.SAXParseException becuase of the invalid character refs
          Document doc = documentBuilder.parse(source);

          return doc.getDocumentElement();

    } catch (ParserConfigurationException pce) {
          // Handle the error
    } catch (SAXException se) {
          // Handle the error
    } catch (IOException ioe) {
          // Handle the error
    }
}

我已尝试设置 XML 标头以指示 XML 符合 1.1 规范...

I've tried setting the XML header to indicate the XML conforms to the 1.1 spec...

xmlString = "<?xml version="1.1" encoding="UTF-8" ?>" + xmlString;

...但仍被解析为 1.0 XML(仍会生成无效字符引用异常).

...but it is still parsed as 1.0 XML (still generates the invalid character reference exceptions).

如何配置 Xerces 解析器以将 XML 解析为 XML 1.1?是否有其他解析器可以为 XML 1.1 提供更好的支持?

How can I configure the Xerces parser to parse the XML as XML 1.1? Is there an alternative parser which provides better support for XML 1.1?

相关推荐

如何使用 JAVA 向 COM PORT 发送数据?

如何使报表页面方向更改为“rtl"?

在 Eclipse 项目中使用西里尔文 .properties 文件

有没有办法在 Java 中检测 RTL 语言?

如何在 Java 中从 DB 加载资源包消息?

如何更改 Java 中的默认语言环境设置以使其保持一致?

热门文章

热门精品源码

最新VIP资源