I need to grab inline script tags inside html pages. The regex will eventually

Question

0

Asked: June 1, 20262026-06-01T02:31:50+00:00 2026-06-01T02:31:50+00:00

I need to grab inline script tags inside html pages. The regex will eventually

0

I need to grab inline script tags inside html pages.
The regex will eventually be driven from c#.
Now I am using Expresso for test purpose.

The following is the best for now:

.*<script.*\r\n(.*\r\n)*\s*</script>

i.e.

.*<script catch the script tag
.*\r\n catch anything till the end of line
(.*\r\n)* catch other lines of the script
\s*</script> catch the closing script, with any indentation before

It grabs ALL the stuff between the first tag, inculding html and other script tags.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-01T02:31:51+00:00

Two scripts on the same line will break your regex. Try it on the source of the page with your question.

Parsing HTML with regex is not a very good idea (there is a link in the comment to your question which answers why the <center> cannot hold); use HTML parser instead.

The next code snippet selects the <script> nodes by using HtmlAgilityPack:

var doc = new HtmlDocument();
doc.Load(html);
var scripts = doc.DocumentNode.SelectNodes("//script");

Isn’t this is simplier than regex?

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I need to grab inline script tags inside html pages. The regex will eventually

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply