Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • Home
  • SEARCH
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 7813101
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: June 2, 20262026-06-02T04:44:11+00:00 2026-06-02T04:44:11+00:00

I am trying to create a web crawler with python and twisted.What happend is

  • 0

I am trying to create a web crawler with python and twisted.What happend is that at the time of calling
reactor.run()

I don’t know all the link to get.
so the code goes like:

def crawl(url):
    d = getPage(url)
    d.addCallback(handlePage)
    reactor.run()

and the handle page has something like:

def handlePage(output):
    urls = getAllUrls(output)

So now I need to apply the crawl() on each of the url in urls.How do I do that?Should I stop the reactor and start again?If I am missing something obvious please tell me.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-06-02T04:44:13+00:00Added an answer on June 2, 2026 at 4:44 am

    You don’t want to stop the reactor. You just want to download more pages. So you need to refactor your crawl function to not stop or start the reactor.

    def crawl(url):
        d = getPage(url)
        d.addCallback(handlePage)
    
    def handlePage(output):
        urls = getAllUrls(output)
        for url in urls:
            crawl(url)
    
    crawl(url)
    reactor.run()
    

    You may want to look at scrapy instead of building your own from scratch, though.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I am trying to create a simple web crawler using PHP that is capable
I'm trying to create a rails web app that does not use ActiveRecord framework
I am trying to create a small web app that will allow users to
I am trying to create an asp.net web form that allows a user to
I'm trying to create a small web app that is used to remove items
I'm trying to create web page that access the (business) private calendar of the
I am trying create a small web application that allows a user to login
I am trying to create a web app that is querying my oracle database
I'm trying to create a web form that contains checkboxes, among other input elements,
I am trying to create a web-based tool for my company that, in essence,

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.