I’m working on this html snippet: <p class=pageSelector> <a href=/BlaBla>< Prev</a>  <a href=/BlaBla>1</a>  <a

Question

0

Asked: May 20, 20262026-05-20T03:18:50+00:00 2026-05-20T03:18:50+00:00

I’m working on this html snippet: <p class=pageSelector> <a href=/BlaBla>< Prev</a>  <a href=/BlaBla>1</a>  <a

0

I’m working on this html snippet:

<p class="pageSelector">
    <a href="/BlaBla">&lt; Prev</a>&nbsp;
    <a href="/BlaBla">1</a>&nbsp;
    <a href="/BlaBla">2</a>&nbsp;
    <a href="/BlaBla">3</a>&nbsp;
    4&nbsp;
    <a href="/BlaBla">5</a>&nbsp;
    <a href="/BlaBla">6</a>&nbsp;
    <a href="/BlaBla">Next &gt;</a>&nbsp;
</p>

rendered (more or less) as < Prev 1 2 3 4 5 6 Next > .

I want to select the “4” because I need to discover the ‘current’ page. Using

//p[@class='pageSelector']/text()[normalize-space()]

(tested with Firefox XPath Ckecker) I thougth I’d solved but no, because I obtained 7 matches.

Anyone please could tell me where I’m wrong?
Thank you

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-20T03:18:51+00:00

normalize-space removes whitespace, but the no-break-space character (despite its visual appearance) is not considered to be whitespace for this purpose. So I would do

text()[translate(., '&#x20;&#x09;&#x0a;&#x0d;&#xa0;', '')]

which will return you those child text nodes that contain a character other than whitespace or no-break-space; you may then need to process this further to extract the part of the content you want.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m working on this html snippet: <p class=pageSelector> <a href=/BlaBla>&lt; Prev</a>&nbsp; <a href=/BlaBla>1</a>&nbsp; <a

Leave an answerCancel reply

1 Answer

I’m working on this html snippet: <p class=pageSelector> <a href=/BlaBla>< Prev</a> <a href=/BlaBla>1</a> <a

Leave an answer
Cancel reply