How do I save an Element Tree to a list based on an attribute in a child tag using Python’s LXML module?

Question

I have an xml document that I have to parse. I&#8217;m using python 3.8 and the lxml module. The XML contains Titles which has other child element tags like the xml below. I need to only find the &#8220;change&#8221; events and keep that &#8220;Title&#8221; in a list. I would like to save all of the tags of t…

Accepted Answer

If txt is your XML snippet from the question, then you can do this to extract tags which contain <Event type="change">:from lxml import etree, htmlroot = etree.fromstring(txt)for title in root.xpath('.//Title[.//Event[@type="change"]]'): print(html.tostring(title).decode('utf-8')) print('-' * 80)Prints:<Title ref="111111"> <Events> <Event type="change"></Event> </Events> <tag1>John</tag1> <tag2>A.</tag2> <tag3>Smith</tag3> -------------------------------------------------------------------------------- <Events> <Event type="change"></Event> </Events> <tag1>Julie</tag1> <tag2>A.</tag2> <tag3>Moore</tag3> --------------------------------------------------------------------------------

Advertisement

Answer