How to clean HTML fromany special tag via Regex in C#? Here is a

Question

0

Editorial Team

Asked: June 11, 20262026-06-11T07:04:57+00:00 2026-06-11T07:04:57+00:00

How to clean HTML fromany special tag via Regex in C#? Here is a

0

How to clean HTML fromany special tag via Regex in C#?

Here is a sample HTML where Ineed to delete <font size="-2">

R&amp;usg=AFQjCNFYiDC6u3xOGn4JpO-GF83PjdSbtw&amp;url=http://online.wsj.com/article/SB10000872396390444426404577647060576633348.html"><img src="//nt2.ggpht.com/news/tbn/bm6jvTMtF-PpnM/6.jpg" alt="" border="1" width="80" height="80" /><br /><font size="-2">Wall Street Journal</font></a></font>
            </td>

I know we have to use somehow Regex, but I cannot figure out how we can use it.

I have tried to adjust this method but it cleans ALL tags.

public string Strip(string text) 
{ 
   return Regex.Replace(text, @”<(.|\n)*?>”, string.Empty); 
}

In fact I am looking to some approach to do like this

public string Strip(string text, HTMLTags.Font)
{

}

where HTMLTags.Font is a enum of some of the HTML tags

enum HTMLTags
{
    Font,
    Div,
    Td
    ...
}

Thank you for any clue!!!

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-11T07:04:59+00:00

Editorial Team

2026-06-11T07:04:59+00:00Added an answer on June 11, 2026 at 7:04 am

use HtmlAgilityPack to parse html

HtmlAgilityPack.HtmlDocument doc = new HtmlAgilityPack.HtmlDocument();
doc.LoadHtml(html);

foreach (var font in doc.DocumentNode.Descendants("font").ToArray())
{
    font.Remove();
}

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

How to clean HTML fromany special tag via Regex in C#? Here is a

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply