How to not load the comments while parsing XML in lxml
Matthew Martinez
I try to parse XML file in Python using lxml like this:
objectify.parse(xmlPath, parserWithSchema)but XML file may contains comments in strange places:
<root> <text>Sam<!--comment-->ple text</text> <!--comment--> <float>1.2<!--comment-->3456</float>
</root>It is a way to not load or delete comments before parsing?
1 Answer
Set remove_comments=True on the parser (documentation):
from lxml import etree, objectify
parser = etree.XMLParser(remove_comments=True)
tree = objectify.parse(xmlPath, parser=parser)Or, using the makeparser() method:
parser = objectify.makeparser(remove_comments=True)
tree = objectify.parse(xmlPath, parser=parser)Hope that helps.
2