Your second statement is not going to return any results.…

Question

0

Asked: May 13, 20262026-05-13T22:45:27+00:00 2026-05-13T22:45:27+00:00

I’m trying to find a single regular expression that I can use to parse

0

I’m trying to find a single regular expression that I can use to parse a block of HTML to find some specific text, but only if that text is not part of an existing hyperlink. I want to turn the non-links into links, which is easy, but identifying the non-linked ones with a single expression seems more troublesome. In the following example:

  This problem is a result of BugID 12.
  If you want more information, refer to <a href="/bug.aspx?id=12">BugID 12</a>.

I want a single expression to find “BugID 12” so I can link it, but I don’t want to match the second one because it’s already linked.

In case it matters, I’m using .NET’s regular expressions.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-13T22:45:28+00:00

If .Net supports negative look aheads (which I think it does):

(BugID 12)(?!</a>)  // match BugID 12 if it is not followed by a closing anchor tag.

However, there is still the danger that BugID 12 will be inside an anchor like

<a href="...">Something BugID 12 Something</a>

But you can mostly overcome this with

(BugID 12)(?!(?:\s*\w*)*</a>)  // (?:\s*\w*)* matches any word characters or spaces between the string and the end tag.

Disclaimer: Parsing html with regex is not reliable and should only be done as a last resort, or in the most simple of cases. I’m sure there are plenty of instances where the above expression does not perform as desired. (example: BugID 12</span></a>)

How to approach applying for a job at a company ...

How to handle personal stress caused by utterly incompetent and ...

What is a programmer’s life like?

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions