I’m trying to parse a malformed XHTML page in Python. I just want to

Question

0

Asked: May 27, 20262026-05-27T16:23:03+00:00 2026-05-27T16:23:03+00:00

I’m trying to parse a malformed XHTML page in Python. I just want to

0

I’m trying to parse a malformed XHTML page in Python. I just want to get a few tags of the same type from it, but it seems impossible. Normal XHTML parsers doesn’t like the malformedness, and BeautifulSoup won’t work because of syntax errors in its code. What would be the best way to parse malformed XHTML and get the content of a couple of tags of the same type?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-27T16:23:03+00:00

Editorial Team

2026-05-27T16:23:03+00:00Added an answer on May 27, 2026 at 4:23 pm

Thanks for the help! “Unfortunately” I solved it myself by using this parser and setting html.parser.HTMLParser(strict=False). That made it read malformed XHTML quite well.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m trying to parse a malformed XHTML page in Python. I just want to

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply