I am trying to come up with a way to match content that does

Question

0

Asked: May 12, 20262026-05-12T13:38:23+00:00 2026-05-12T13:38:23+00:00

I am trying to come up with a way to match content that does

0

I am trying to come up with a way to match content that does not exist inside any xml or html tags. I’ve read that using regular expressions is fundamentally bad for parsing xml/html, and I’m open for any solution that will solve my problem, but if a regex works too all the better.

Here’s an example of what I’m looking for:

the lazy fox jumped <span>over</span> the brown fence.

What I want back is

the lazy fox jumped  the brown fence

Any ideas?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-12T13:38:23+00:00

Editorial Team

2026-05-12T13:38:23+00:00Added an answer on May 12, 2026 at 1:38 pm

It’s probably a naive technique, but my first instinct would be to run the regular expression, figure out what text it matches within your parent string, and REMOVE it from that string, returning the remainder. In pseudocode,

String input = "whatever";
matches = Regex.Matches(input,"<.*>.*?</.*>");
foreach (match m in Matches)
{
input = input.Remove(m.Value);
}

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I am trying to come up with a way to match content that does

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply