I have an html page with many tables. <html> <table> POINTER_TEXT some other stuff

Question

0

Asked: June 8, 20262026-06-08T11:55:23+00:00 2026-06-08T11:55:23+00:00

I have an html page with many tables. <html> <table> POINTER_TEXT some other stuff

0

I have an html page with many tables.

<html>
<table>
  POINTER_TEXT
  some other stuff
  <table that i want START>
  </table that i want END>
  some other stuff
  <table bad>
  </table bad>
</table>
</html>

I wish to grab a table that comes after a specific text. I am good until this stage.

curl -silent http://xyz.com/1.htm | sed -n '/POINTER_TEXT/,$p'

This gives me

  POINTER_TEXT
  some other stuff
  <table that i want START>
  </table that i want END>
  some other stuff
  <table bad>
  </table bad>
</table>
</html>

Then I add this:

curl -silent http://xyz.com/1.htm | sed -n '/POINTER_TEXT/,$p' | sed -n '/<table*/,/<\/table>/p'

which gives me this:

  <table that i want START>
  </table that i want END>
  <table bad>
  </table bad>

My problem is I just need this:

  <table that i want START>
  </table that i want END>

Help me please guys!

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-08T11:55:24+00:00

Editorial Team

2026-06-08T11:55:24+00:00Added an answer on June 8, 2026 at 11:55 am

Add

| sed '\=</table={p;Q}'

at the end. This should throw away everything after the first table end.

But, what will your script do if there are no newlines in the html? It is far more robust to use a real parser to process HTML.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have an html page with many tables. <html> <table> POINTER_TEXT some other stuff

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply