I’m using Python’s ElementTree to parse xml files. I have a findall to find

Question

0

Asked: May 26, 20262026-05-26T11:03:24+00:00 2026-05-26T11:03:24+00:00

I’m using Python’s ElementTree to parse xml files. I have a findall to find

0

I’m using Python’s ElementTree to parse xml files. I have a “findall” to find all “revision” subelements, but when I iterate through the result, they are not in document order. What can I be doing wrong?

Here’s my code:

allrevisions = page.findall('{http://www.mediawiki.org/xml/export-0.5/}revision')
for rev in allrevisions:
    print rev
    print rev.find('{http://www.mediawiki.org/xml/export-0.5/}timestamp').text

Here’s a link to the document I’m parsing: http://pastie.org/2780983

Thanks,
bsg

-Oops. By going through my code and running it piece by piece, I worked out the problem – I had stuck in a reverse() on the elements list in the wrong place, which was causing all the trouble. Thank you so much for your help – I’m sorry it was such a silly issue.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-26T11:03:25+00:00

The documentation for ElementTree says that findall returns the elements in document order.

A quick test shows the correct behaviour:

import xml.etree.ElementTree as et

xmltext = """
<root>
    <number>1</number>
    <number>2</number>
    <number>3</number>
    <number>4</number>
</root>
"""

tree = et.fromstring(xmltext)

for number in tree.findall('number'):
    print number.text

Result:

It would be helpful to see the document you are parsing.

Update:

Using the source data you provided:

from __future__ import with_statement
import xml.etree.ElementTree as et

with open('xmldata.xml', 'r') as f:
    xmldata = f.read()

tree = et.fromstring(xmldata)

for revision in tree.findall('.//{http://www.mediawiki.org/xml/export-0.5/}revision'):
    print revision.find('{http://www.mediawiki.org/xml/export-0.5/}text').text[0:10].encode('utf8')

Result:

‘The Mind 
{{db-spam}
‘The Mind 
'''The Min
<!-- Pleas

The same order as they appear in the document.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m using Python’s ElementTree to parse xml files. I have a findall to find

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply