As jalf suggests in the comments, you probably want to…

Question

0

Asked: May 11, 20262026-05-11T09:35:08+00:00 2026-05-11T09:35:08+00:00

Does anyone know an easy way in Python to convert a string with HTML

0

Does anyone know an easy way in Python to convert a string with HTML entity codes (e.g. < &) to a normal string (e.g. < &)?

cgi.escape() will escape strings (poorly), but there is no unescape().

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

score 0 · Answer 1 · 2026-05-11T09:35:08+00:00

HTMLParser has the functionality in the standard library. It is, unfortunately, undocumented:

(Python2 Docs)

>>> import HTMLParser >>> h= HTMLParser.HTMLParser() >>> h.unescape('alpha &lt; &beta;') u'alpha < \u03b2'

(Python 3 Docs)

>>> import html.parser >>> h = html.parser.HTMLParser() >>> h.unescape('alpha &lt; &beta;') 'alpha < \u03b2'

htmlentitydefs is documented, but requires you to do a lot of the work yourself.

If you only need the XML predefined entities (lt, gt, amp, quot, apos), you could use minidom to parse them. If you only need the predefined entities and no numeric character references, you could even just use a plain old string replace for speed.

How to approach applying for a job at a company ...

How to handle personal stress caused by utterly incompetent and ...

What is a programmer’s life like?

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions