a bytes-like object is required, not ‘str’ while parsing XML files

Question

I am trying to parse an xml that looks like this. I want to extract information regarding the katagorie i.e ID, parent ID etc: I am trying this but I get this error: even though I am already using encode(&#8216;utf-8&#8217;) in my code. How can I get rid of this error? Answer EDIT 2 If want to find regarding …

Accepted Answer

EDIT 2If want to find regarding nested attributes or sub-classes, there are two ways:You can use a nested loop:for elem in root: for child in elem: print([(child.attrib.get(name) or 'c') for name in fields])Output:[‘1’, ‘0’]Here, it can also return for classes which have id and parent_id but not the name kategorie.If you want to perform the task with a bit more performance and less memory:for elem in root.iter('kategorie'): print([(elem.attrib.get(name) or 'c') for name in fields])Output:[‘1’, ‘0’]For this method, it will return for every class and sub-class named kategorie.EDIT 1: For the issue in comments: Sprache For the above xml file, the code seems to work perfectly:fields = ['id', 'parent_id']for elem in tree.getroot(): print([(elem.attrib.get(name) or 'c') for name in fields])Output:[‘1’, ‘0’]Original Answer:Looks like you are looking at the wrong location for the error. The error is actually occurring atwriter.writerow(fields)fields is a list containing str and not byte, that is why it is giving you the error. I would have recommended you to change the write type from wb to w, but looking at the rest of the code, it looks like you want to write in byte.writer.writerow([x.encode('utf-8') for x in fields])encode() just converts your data to byte form.

Advertisement

Answer