I have some non well-formed xml (HTML) data in JAVA, I used JAXP Dom, but It complains.
The Question is :Is there any way to
use JAXP to parse such documents ??
I have a file containing data such as :
<employee>
<name value="ahmed" > <!-- note, this element is not closed, So it is not well-formed xml-->
</employee>
Not really. JAXP wants well-formed markup. Have you considered the Cyberneko HTML Parser? We’ve been very successful with it at our shop.
EDIT: I see you are wanting to parse XML too. Hrmm…. Cyberneko works well for HTML but I don’t know about others. It has a tag balancer that would close some tags off, but I don’t know if you can train it to recognize tags that are not HTML.