Possible Duplicate:
Decode HTML entities in Python string?
I have a malformed string in Python:
Muhammad Ali's fight with Larry Holmes
where ' is a apostrophe.
Firstly what representation is this: '? Secondly, how can I parse the string in python so that it replaces ' with '
The Python Standard Library’s
HTMLParseris able to decode HTML entities in strings.A range of solutions are described here: http://fredericiana.com/2010/10/08/decoding-html-entities-to-text-in-python/