I am writing a Python web app and in it I plan to leverage

Question

0

Asked: May 26, 20262026-05-26T00:02:12+00:00 2026-05-26T00:02:12+00:00

I am writing a Python web app and in it I plan to leverage

0

I am writing a Python web app and in it I plan to leverage Wikipedia. When trying out some URL Fetching code I was able to fetch both Google and Facebook (via Google App Engine services), but when I attempted to fetch wikipedia.org, I received an exception. Can anyone confirm that Wikipedia does not accept these types of page requests? How can Wikipedia distinguish between me and a user?

Code snippet (it’s Python!):

    import os
import urllib2
from google.appengine.ext.webapp import template


class MainHandler(webapp.RequestHandler):
    def get(self):
        url = "http://wikipedia.org"
        try:
          result = urllib2.urlopen(url)
        except urllib2.URLError, e:
          result = 'ahh the sky is falling'
        template_values= {
            'test':result,
        }
        path = os.path.join(os.path.dirname(__file__), 'index.html')
        self.response.out.write(template.render(path, template_values))

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-26T00:02:13+00:00

urllib2 default user-agent is banned from wikipedia and it results in a 403 HTTP response.
You should modify your application user-agent with something like this:

#Option 1
import urllib2
opener = urllib2.build_opener()
opener.addheaders = [('User-agent', 'MyUserAgent')]
res= opener.open('http://whatsmyuseragent.com/')
page = res.read()

#Option 2
import urllib2
req = urllib2.Request('http://whatsmyuseragent.com/')
req.add_header('User-agent', 'MyUserAgent')
urllib2.urlopen(req)

#Option 3
req = urllib2.Request("http://whatsmyuseragent.com/", 
                       headers={"User-agent": "MyUserAgent"})
urllib2.urlopen(req)

Bonus link:
High level Wikipedia Python Clients
http://www.mediawiki.org/wiki/API:Client_code#Python

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I am writing a Python web app and in it I plan to leverage

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply