FactoryAdapter loadDocument() modifies XMLReader and breaks it
stevedlawrence opened this issue · 4 comments
We have an XMLReader
(via Xerces used for validation) that breaks if setEntityResolver()
is called. Instead, the EntityResolver
must be set by calling setProperty(..., ...)
with the resolver object as the value. If setEntityResolver
is called, this XMLReader
sees the loaded XML as invalid and fails to parse. Note that in this XMLReader
, getEntityResolver() returns null when the entity resolver is set via
setProperty`.
Unfortunately, the loadDocument
function in the FactoryAdapter
sets an entity resolver if getEnitytResolver
returns null:
This causes our XMLReader to fail when used with a FactoryAdapter
. It looks like this was added in commit 1cc9ea8 and moved to the FactoryAdapter in subsequent changes, so wasn't an issue prior to v2.2.0 as best I can tell.
Can setting the entity resolver (and maybe other XMLReader
changes made in loadDocument
) be made configurable so we can disable it if needed?
Or maybe alternatively, change the def reader
in XMLLoader.scala so that it makes these changes only to the XMLReader it creates? For example
def reader: XMLReader = {
val r = parser.getXMLReader
r.setContentHandler(adapter)
r.setDTDHandler(adapter)
r.setEnityResolver(adapter)
r.setProperty("http://xml.org/sax/properties/lexical-handler", adapter)
...
r
}
This way if we override the reader to provide a custom one we avoid potentially breaking changes, and have full control over the reader, with the understanding that we may need to call some or all of the other functions ourselves to work with a FactoryAdapter.
We have an
XMLReader
(via Xerces used for validation) that breaks ifsetEntityResolver()
is called.
Can setting the entity resolver (and maybe otherXMLReader
changes made inloadDocument
) be made configurable so we can disable it if needed?
Or maybe alternatively, change the def reader in XMLLoader.scala so that it makes these changes only to the XMLReader it creates?
I do not think that making setting various SAX handlers on the XMLReader configurable or optional, let alone disabling it altogether and relying on the caller to figure out what methods need to be called and actually calling them is a good idea: Scala XML should be able to install the SAX handlers it needs (or, in the case of the EntityResolver
, might need in the future :)).
Now that the def XMLLoader.reader: XMLReader
extension point is available, it can be used to supply an XMLReader
that uses the one you need but does not break when yours does: wrap org.xml.sax.helpers.XMLFilterImpl
around yours and override set/getEntityResolver()
as appropriate...
It looks like XMLFilterImpl
calls setEntityResolver
right before parser under the hood and still causes issues:
However, I was able to do as you suggest but with a custom XMLReader
wraper. That seems to have solved the problem and allows things to work with v2.2.0. Thanks!
(XMLFilterImpl.parse() can be overwritten ;))
things ... work with v2.2.0. Thanks!
Glad it worked out!
I had originally tried that and it didn't seem to work and I didn't look too much into it. But I now realize that if parse()
is overridden to remove the setupParse call, then I also need to override the set/get functions for ErrorHandler
/ContentHandler
/DTDHandler
to pass through to the parent as well. With those overrides, the XMLFilterImpl
approach also works.