Possible Duplicate:
Convert XML/HTML Entities into Unicode String in Python
I am reading an excel XML document using Python. I end up with a lot of characters such as
é
That represent various accented letters (and the like). Is there an easy way to convert these characters to utf-8?
If you just want to parse the HTML entity to its unicode equivalent:
This is for Python 2.x, for 3.x the import is
import html.parser