I’m learning lxml (after using ElementTree) and I’m baffled why .fromstring and .tostring do

Question

0

Asked: May 28, 20262026-05-28T18:28:47+00:00 2026-05-28T18:28:47+00:00

I’m learning lxml (after using ElementTree) and I’m baffled why .fromstring and .tostring do

0

I’m learning lxml (after using ElementTree) and I’m baffled why .fromstring and .tostring do not appear to be reversible. Here’s my example:

import lxml.etree as ET
f = open('somefile.xml','r')
data = f.read()
tree_in = ET.fromstring(data)
tree_out = ET.tostring(tree_in)
f2 = open('samefile.xml','w')
f2.write(tree_out)
f2.close

‘somefile.xml’ was 132 KB.
‘samefile.xml’ – the output – was 113 KB, and it is missing the end of the file at some arbirtrary point. The closing tags of the overall tree and a few of the pieces of the final element are just gone.

Is there something wrong with my code, or must there be something wrong with the nesting in the original XML file? If so, am I forced to use BeautifulSoup of ElementTree again (without xpath)?

One note: The text inside many elements had a bunch of crap that was converted to text, but is that what’s causing this problem?

Example:

<QuestionIndex Id="Perm"><Answer><![CDATA[confirm]]></Answer><Answer><![CDATA[NotConfirm]]></Answer></QuestionIndex>
<QuestionIndex Id="Actor"><Answer><![CDATA[GirlLt16]]></Answer><Answer><![CDATA[Fem17to25]]></Answer><Answer><![CDATA[BoyLt16]]></Answer><Answer><![CDATA[Mal17to25]]></Answer><Answer><![CDATA[Moth]]></Answer><Answer><![CDATA[Fath]]></Answer><Answer><![CDATA[Elder]]></Answer><Answer><![CDATA[RelLead]]></Answer><Answer><![CDATA[Auth]]></Answer><Answer><![CDATA[Teach]]></Answer><Answer><![CDATA[Oth]]></Answer></QuestionIndex>

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-28T18:28:48+00:00

Editorial Team

2026-05-28T18:28:48+00:00Added an answer on May 28, 2026 at 6:28 pm

This problem turned out to be way simpler than it appears, and the answer is hidden in the code I provided.

f.close

should have been

f.close()

The difference is the remaining buffer of a few dozen characters that never made it into the notepad++ file I was checking results in. Closing the file for real made all the difference, and the code works.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m learning lxml (after using ElementTree) and I’m baffled why .fromstring and .tostring do

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply