XML parsing in python issue using elementTree

Question

I need to parse a soap response and convert to a text file. I am trying to parse the values as detailed below. I am using ElementTree in python I have the below xml response which I need to parse I need to use the below code snippet. The issue is that The below code is not able to find

Accepted Answer

Your document declares a default namespace of alu.v1:...Any attribute without an explicit namespace is in the alu.v1 namespace. You need to qualify your attribute name appropriately:vendorExtensions = device.find("{alu.v1}vendorExtensions")While the above is a real problem with your code that needs to be corrected (the Wikipedia entry on XML namespaces may be useful reading if you’re unfamiliar with how namespaces work), there are also some logic problems with your code.Let’s drop the big list of conditionals from the code and see if it’s actually doing what we think it’s doing. If we run this:from xml.etree import ElementTreeparser = ElementTree.parse("data.xml")root = parser.getroot()queryObjectData = root.find(".//{alu.v1}queryObjectData")for queryObject in queryObjectData: for device in queryObject: print(device.tag)Then using your sample data (once it has been corrected to be syntactically valid), we see as output:{alu.v1}name{alu.v1}vendorExtensionsYour search for the {alu.v1}vendorExtensions element will never succeed before the thing on which you’re trying to search (the device variable) is the thing you’re trying to find.Additionally, the conditional in your loop…if (device.tag.split("}")[1]) == "me":…will never match (there is no element in the entire document for which tag.split("}")[1] == "me" is True).I’m not entirely clear what you’re trying to do, but here’s are some thoughts:Given your example data, you probably don’t want that for device in inventoryObject: loopWe can drastically simplify your code by replacing that long block of conditionals with a list of attributes in which we are interested and then a for loop to extract them.Rather than assigning a bunch of individual variables, we can build up a dictionary with the data from the queryObjectThat might look like:from xml.etree import ElementTreeimport jsonattributeNames = [ "mdNm", "meNm", "userLabel", "resourceState", "location", "manufacturer", "productName", "version",]parser = ElementTree.parse("data.xml")root = parser.getroot()queryObjectData = root.find(".//{alu.v1}queryObjectData")for queryObject in queryObjectData: device = {} for name in attributeNames: if (value := queryObject.find(f".//{{tmf854.v1}}{name}")) is not None: device[name] = value.text vendorExtensions = queryObject.find("{alu.v1}vendorExtensions") extensionMap = {} for extension in vendorExtensions.findall(".//{alu.v1}NameAndStringValue"): extname = extension.find("{tmf854.v1}name").text extvalue = extension.find("{tmf854.v1}value").text extensionMap[extname] = extvalue device["vendorExtensions"] = extensionMap print(json.dumps(device, indent=2))Given your example data, this outputs:{ "mdNm": "AMS", "meNm": "CHEERLAVANCHA_281743", "vendorExtensions": { "hubSubtendedStatus": "NONE", "productAndRelease": "DF.6.1", "adminUserName": "isadmin" }}An alternate approach, in which we just transform each queryObject into a dictionary, might look like this:from xml.etree import ElementTreeimport jsondef localName(ele): return ele.tag.split("}")[1]def etree_to_dict(t): if list(t): d = {} for child in t: if localName(child) == "NameAndStringValue": d.update(dict([[x.text.strip() for x in child]])) else: d.update({localName(child): etree_to_dict(child) for child in t}) return d else: return t.text.strip()parser = ElementTree.parse("data.xml")root = parser.getroot()queryObjectData = root.find(".//{alu.v1}queryObjectData") or []for queryObject in queryObjectData: d = etree_to_dict(queryObject) print(json.dumps(d, indent=2))This will output:{ "name": { "mdNm": "AMS", "meNm": "CHEERLAVANCHA_281743", "ptpNm": "/type=NE/CHEERLAVANCHA_281743" }, "vendorExtensions": { "package": { "hubSubtendedStatus": "NONE", "productAndRelease": "DF.6.1", "adminUserName": "isadmin" } }}That may or may not be appropriate depending on the structure of your real data and exactly what you’re trying to accomplish.

Advertisement

Answer