Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • Home
  • SEARCH
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 110541
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 11, 20262026-05-11T02:14:35+00:00 2026-05-11T02:14:35+00:00

is it possible to easily cap the kbps when using urllib2 ? If it

  • 0

is it possible to easily cap the kbps when using urllib2? If it is, any code examples or resources you could direct me to would be greatly appreciated.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. 2026-05-11T02:14:36+00:00Added an answer on May 11, 2026 at 2:14 am

    There is the urlretrieve(url, filename=None, reporthook=None, data=None) function in the urllib module. If you implement the reporthook-function/object as either a token bucket, or a leaky bucket, you have your global rate-limit.

    EDIT: Upon closer examination I see that it isn’t as easy to do global rate-limit with reporthook as I thought. reporthook is only given the downloaded amount and the total size, which on their own isn’t enough to information to use with the token-bucket. One way to get around it is by storing the last downloaded amount in each rate-limiter, but use a global token-bucket.


    EDIT 2: Combined both codes into one example.

    '''Rate limiters with shared token bucket.'''  import os import sys import threading import time import urllib import urlparse  class TokenBucket(object):     '''An implementation of the token bucket algorithm.     source: http://code.activestate.com/recipes/511490/      >>> bucket = TokenBucket(80, 0.5)     >>> print bucket.consume(10)     True     >>> print bucket.consume(90)     False     '''     def __init__(self, tokens, fill_rate):         '''tokens is the total tokens in the bucket. fill_rate is the         rate in tokens/second that the bucket will be refilled.'''         self.capacity = float(tokens)         self._tokens = float(tokens)         self.fill_rate = float(fill_rate)         self.timestamp = time.time()         self.lock = threading.RLock()      def consume(self, tokens):         '''Consume tokens from the bucket. Returns 0 if there were         sufficient tokens, otherwise the expected time until enough         tokens become available.'''         self.lock.acquire()         tokens = max(tokens,self.tokens)         expected_time = (tokens - self.tokens) / self.fill_rate         if expected_time <= 0:             self._tokens -= tokens         self.lock.release()         return max(0,expected_time)      @property     def tokens(self):         self.lock.acquire()         if self._tokens < self.capacity:             now = time.time()             delta = self.fill_rate * (now - self.timestamp)             self._tokens = min(self.capacity, self._tokens + delta)             self.timestamp = now         value = self._tokens         self.lock.release()         return value  class RateLimit(object):     '''Rate limit a url fetch.     source: http://mail.python.org/pipermail/python-list/2008-January/472859.html     (but mostly rewritten)     '''     def __init__(self, bucket, filename):         self.bucket = bucket         self.last_update = 0         self.last_downloaded_kb = 0          self.filename = filename         self.avg_rate = None      def __call__(self, block_count, block_size, total_size):         total_kb = total_size / 1024.          downloaded_kb = (block_count * block_size) / 1024.         just_downloaded = downloaded_kb - self.last_downloaded_kb         self.last_downloaded_kb = downloaded_kb          predicted_size = block_size/1024.          wait_time = self.bucket.consume(predicted_size)         while wait_time > 0:             time.sleep(wait_time)             wait_time = self.bucket.consume(predicted_size)          now = time.time()         delta = now - self.last_update         if self.last_update != 0:             if delta > 0:                 rate = just_downloaded / delta                 if self.avg_rate is not None:                     rate = 0.9 * self.avg_rate + 0.1 * rate                 self.avg_rate = rate             else:                 rate = self.avg_rate or 0.             print '%20s: %4.1f%%, %5.1f KiB/s, %.1f/%.1f KiB' % (                     self.filename, 100. * downloaded_kb / total_kb,                     rate, downloaded_kb, total_kb,                 )         self.last_update = now   def main():     '''Fetch the contents of urls'''     if len(sys.argv) < 4:         print 'Syntax: %s rate url1 url2 ...' % sys.argv[0]         raise SystemExit(1)     rate_limit  = float(sys.argv[1])     urls = sys.argv[2:]     bucket = TokenBucket(10*rate_limit, rate_limit)      print 'rate limit = %.1f' % (rate_limit,)      threads = []     for url in urls:         path = urlparse.urlparse(url,'http')[2]         filename = os.path.basename(path)         print 'Downloading '%s' to '%s'...' % (url,filename)         rate_limiter = RateLimit(bucket, filename)         t = threading.Thread(             target=urllib.urlretrieve,             args=(url, filename, rate_limiter))         t.start()         threads.append(t)      for t in threads:         t.join()      print 'All downloads finished'  if __name__ == '__main__':     main() 
    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Ask A Question

Stats

  • Questions 57k
  • Answers 57k
  • Best Answers 0
  • User 1
  • Popular
  • Answers
  • Editorial Team

    How to approach applying for a job at a company ...

    • 7 Answers
  • Editorial Team

    How to handle personal stress caused by utterly incompetent and ...

    • 5 Answers
  • Editorial Team

    What is a programmer’s life like?

    • 5 Answers
  • added an answer If I've got the right end of your stick the… May 11, 2026 at 8:31 am
  • added an answer An ISessionFactory.OpenSession() can be supplied with a IDbConnection. Another option… May 11, 2026 at 8:31 am
  • added an answer SELECT t1.* FROM yourtable t1 LEFT OUTER JOIN yourtable t2… May 11, 2026 at 8:31 am

Top Members

Trending Tags

analytics british company computer developers django employee employer english facebook french google interview javascript language life php programmer programs salary

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.

      Related Questions

      No related questions found