Velvet Star Monitor

Standout celebrity highlights with iconic style.

updates

How to not load the comments while parsing XML in lxml

Writer Matthew Martinez

I try to parse XML file in Python using lxml like this:

objectify.parse(xmlPath, parserWithSchema)

but XML file may contains comments in strange places:

<root> <text>Sam<!--comment-->ple text</text> <!--comment--> <float>1.2<!--comment-->3456</float>
</root>

It is a way to not load or delete comments before parsing?

1 Answer

Set remove_comments=True on the parser (documentation):

from lxml import etree, objectify
parser = etree.XMLParser(remove_comments=True)
tree = objectify.parse(xmlPath, parser=parser)

Or, using the makeparser() method:

parser = objectify.makeparser(remove_comments=True)
tree = objectify.parse(xmlPath, parser=parser)

Hope that helps.

2

Your Answer

Sign up or log in

Sign up using Google Sign up using Facebook Sign up using Email and Password

Post as a guest

By clicking “Post Your Answer”, you agree to our terms of service, privacy policy and cookie policy