Create a QActionGroup and let it be the parent of…

Question

0

Asked: May 14, 20262026-05-14T23:11:19+00:00 2026-05-14T23:11:19+00:00

I am working with XPATH, Java and want to extract some text out of

0

I am working with XPATH, Java and want to extract some text out of one html page.
The text is located under some div with some whitespace characters in between, like   <br> etc.
I want these to be converted into ‘space’ and ‘newline’ respectively while extracting.
The method I am using to extract text is Element.getTextContent() which does not respect whitespace characters.

Could somebody tell me if there is a way to extract text with whitespace normalization
OR
Extract whole html markup under the ‘Node’ so that i could replace it by myself.
Thanks
Nayn

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-14T23:11:20+00:00

Editorial Team

2026-05-14T23:11:20+00:00Added an answer on May 14, 2026 at 11:11 pm

<br> isn’t text content, it’s an element. I’m not sure what you’re looking for. Try just visiting all the text nodes underneath the element (remembering to recursively check element children) and calling getNodeValue();

0

Reply
Share
Share

- Report

How to approach applying for a job at a company ...

What is a programmer’s life like?

How to handle personal stress caused by utterly incompetent and ...

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions