I am dealing a problem to write a python regex ‘not’to identify a certain

Question

0

Asked: May 23, 20262026-05-23T17:33:23+00:00 2026-05-23T17:33:23+00:00

I am dealing a problem to write a python regex ‘not’to identify a certain

0

I am dealing a problem to write a python regex ‘not’to identify a certain pattern within href tags.

My aim is to replace all occurrences of DSS[a-z]{2}[0-9]{2} with a href link as shown below,but without replacing the same pattern occurring inside href tags

Present Regex:

replaced = re.sub("[^http://*/s](DSS[a-z]{2}[0-9]{2})", "<a href=\"http://test.com=\\1\">\\1</a>", input)

I need to add this new regex using an OR operator to the existing one I have

EDIT:

I am trying to use regex just for a simple operation. I want to replace the occurrences of the pattern anywhere in the html using a regex except occurring within<a><\a>.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-23T17:33:23+00:00

Editorial Team

2026-05-23T17:33:23+00:00Added an answer on May 23, 2026 at 5:33 pm

The answer to any question having regexp and HTML in the same sentence is here.

In Python, the best HTML parser is indeed Beautilf Soup.

If you want to persist with regexp, you can try a negative lookbehind to avoid anything precessed by a ". At your own risk.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I am dealing a problem to write a python regex ‘not’to identify a certain

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply