Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 8769499
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: June 13, 20262026-06-13T17:21:43+00:00 2026-06-13T17:21:43+00:00

I need to scrape a webpage with javascript and looks like this is resolved

  • 0

I need to scrape a webpage with javascript and looks like this is resolved in the example at http://snipplr.com/view/66996/renderedinteractive-javascript-with-gtkwebkitjswebkit/
referred in question Extracting data from Web code uses webkit downloader class. I understand that I need to invoke process_request function. What do I pass in as parameter in request. I looked through scrapy documentation to see if I have to pass request object created in scrapy but that does not work.

Also, I understand that the spider object is to be passed in process_request as the last parameter. Which object should that be? Sorry I am new to python, scrapy and webkit so maybe asking questins with obvious answers.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-06-13T17:21:44+00:00Added an answer on June 13, 2026 at 5:21 pm

    You don’t “invoke” process_request manually, you only have to declare it, the engine will invoke it with all the right parameters. Just create a file called middleware.py (or whatever you want to call it) and type in:

    ...
    class WebkitDownloader( object ):
        def process_request( self, request, spider ):
            if not isinstance(request, FormRequest):
                webview = webkit.WebView()
                ...
    

    and all the rest of it, and then in your settings.py file type in:

    DOWNLOADER_MIDDLEWARES = {
        'mybot.middleware.WebkitDownloader': 1000,
        ...
    }
    

    That should get your middleware working.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

Dear all,I am now using a webtool http://fiddesktop.cs.northwestern.edu/mmp/scrape?url= to parse a webpage. For example,we
Basically I need to scrape some text that has nested tags. Something like this:
I need to scrape a page who is using javascript. this is why I'm
I need to scrape an xml file from http://feeds.feedburner.com/Torrentfreak for its links and description.
Need to apply a filter to a file like this: TUPAC_0006:1:1:2554:2356#0/1 0 * 0
I am trying to scrape the datas from a webpage, but I get need
I need to scrape some websites, and would like to avoid downloading images from
I need to scrape a simple webpage which has the following text: Value=29 Time=128769
I have this situation where I need to get some data from a webpage
Here is a sample block of code I need to scrape: <p>This paragraph contains

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.