Call the function below: function goToNextId() { var id =…

Question

0

Asked: May 12, 20262026-05-12T15:15:53+00:00 2026-05-12T15:15:53+00:00

I need to grab some content from an HTML (XHTML valid) page. I grab

0

I need to grab some content from an HTML (XHTML valid) page. I grab the page using curl and store it in memory.

I played with the idea of using regex with the PCRE library, but simply I couldn’t find any examples using it with C. Then I moved on to look at HTML parsers and again there is not a good selection. All I could find was a skimpy documented module for libxml called HTMLparser.

Are there any alternatives? If not, then examples for what I found already?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-12T15:15:54+00:00

Editorial Team

2026-05-12T15:15:54+00:00Added an answer on May 12, 2026 at 3:15 pm

You want to use HTML tidy to do this. The Lib curl page has some source code to get you going. Documents traversing the dom tree. You don’t need an xml parser. Doesn’t fail on badly formated html.

http://curl.haxx.se/libcurl/c/htmltidy.html

0

Reply
Share
Share

- Report

How to approach applying for a job at a company ...

What is a programmer’s life like?

How to handle personal stress caused by utterly incompetent and ...

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions