i have made a crawler, but i can´t understand how i can go through

Question

0

Asked: May 26, 20262026-05-26T10:32:37+00:00 2026-05-26T10:32:37+00:00

i have made a crawler, but i can´t understand how i can go through

0

i have made a crawler, but i can´t understand how i can go through a pagination, can someone please help me with this, thanks.

Here is my crawler script:


    if(!$fp = fopen("https://market.android.com/details?id=apps_topselling_paid&cat=LIBRARIES_AND_DEMO&start=0&num=24" ,"r" )) {
        return false;
    }
    $content = "";

    while(!feof($fp)) {
        $content .= fgets($fp, 1024);
    }
    fclose($fp);

    if (!preg_match('/error-section/i', $content)) {
      preg_match_all("/id=([^/i", $content, $matches, PREG_SET_ORDER);

      $i=1;
      foreach ($matches as $val) {

          $link = $val[1];

          if(!$fps = fopen("https://market.android.com/details?id=". $link ,"r" )) {
            return false;
          }
          $content_app = "";

          while(!feof($fps)) {
            $content_app .= fgets($fps, 1024);
          }
          fclose($fps);

          preg_match("/([^/i", $content_app, $regs);
          echo $regs[1]. "
;

      }
    }else{
      echo 'Error page not found!';
    }

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-26T10:32:38+00:00

Editorial Team

2026-05-26T10:32:38+00:00Added an answer on May 26, 2026 at 10:32 am

I assume that the pagination is something similar to comment pagination on blogs.

One way is to find the link to the next page, and follow that link. It can be done quite easily with a regex.

Another way, if you are crawling a single site, is to figure out their url-structure of the pagination, and then just scan pages incrementally until there are no more comments.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

i have made a crawler, but i can´t understand how i can go through

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply