An XML document can only have a single root element.…

Question

0

Asked: May 10, 20262026-05-10T21:00:04+00:00 2026-05-10T21:00:04+00:00

In .Net, I found this great library, HtmlAgilityPack that allows you to easily parse

0

In .Net, I found this great library, HtmlAgilityPack that allows you to easily parse non-well-formed HTML using XPath. I’ve used this for a couple years in my .Net sites, but I’ve had to settle for more painful libraries for my Python, Ruby and other projects. Is anyone aware of similar libraries for other languages?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

score 0 · Answer 1 · 2026-05-10T21:00:04+00:00

2026-05-10T21:00:04+00:00Added an answer on May 10, 2026 at 9:00 pm

In python, ElementTidy parses tag soup and produces an element tree, which allows querying using XPath:

>>> from elementtidy.TidyHTMLTreeBuilder import TidyHTMLTreeBuilder as TB >>> tb = TB() >>> tb.feed('<p>Hello world') >>> e= tb.close() >>> e.find('.//{http://www.w3.org/1999/xhtml}p') <Element {http://www.w3.org/1999/xhtml}p at 264eb8>

0

Reply
Share
Share

- Report

How to approach applying for a job at a company ...

How to handle personal stress caused by utterly incompetent and ...

What is a programmer’s life like?

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions