Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 110541
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 11, 20262026-05-11T02:14:35+00:00 2026-05-11T02:14:35+00:00

is it possible to easily cap the kbps when using urllib2 ? If it

  • 0

is it possible to easily cap the kbps when using urllib2? If it is, any code examples or resources you could direct me to would be greatly appreciated.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. 2026-05-11T02:14:36+00:00Added an answer on May 11, 2026 at 2:14 am

    There is the urlretrieve(url, filename=None, reporthook=None, data=None) function in the urllib module. If you implement the reporthook-function/object as either a token bucket, or a leaky bucket, you have your global rate-limit.

    EDIT: Upon closer examination I see that it isn’t as easy to do global rate-limit with reporthook as I thought. reporthook is only given the downloaded amount and the total size, which on their own isn’t enough to information to use with the token-bucket. One way to get around it is by storing the last downloaded amount in each rate-limiter, but use a global token-bucket.


    EDIT 2: Combined both codes into one example.

    '''Rate limiters with shared token bucket.'''  import os import sys import threading import time import urllib import urlparse  class TokenBucket(object):     '''An implementation of the token bucket algorithm.     source: http://code.activestate.com/recipes/511490/      >>> bucket = TokenBucket(80, 0.5)     >>> print bucket.consume(10)     True     >>> print bucket.consume(90)     False     '''     def __init__(self, tokens, fill_rate):         '''tokens is the total tokens in the bucket. fill_rate is the         rate in tokens/second that the bucket will be refilled.'''         self.capacity = float(tokens)         self._tokens = float(tokens)         self.fill_rate = float(fill_rate)         self.timestamp = time.time()         self.lock = threading.RLock()      def consume(self, tokens):         '''Consume tokens from the bucket. Returns 0 if there were         sufficient tokens, otherwise the expected time until enough         tokens become available.'''         self.lock.acquire()         tokens = max(tokens,self.tokens)         expected_time = (tokens - self.tokens) / self.fill_rate         if expected_time <= 0:             self._tokens -= tokens         self.lock.release()         return max(0,expected_time)      @property     def tokens(self):         self.lock.acquire()         if self._tokens < self.capacity:             now = time.time()             delta = self.fill_rate * (now - self.timestamp)             self._tokens = min(self.capacity, self._tokens + delta)             self.timestamp = now         value = self._tokens         self.lock.release()         return value  class RateLimit(object):     '''Rate limit a url fetch.     source: http://mail.python.org/pipermail/python-list/2008-January/472859.html     (but mostly rewritten)     '''     def __init__(self, bucket, filename):         self.bucket = bucket         self.last_update = 0         self.last_downloaded_kb = 0          self.filename = filename         self.avg_rate = None      def __call__(self, block_count, block_size, total_size):         total_kb = total_size / 1024.          downloaded_kb = (block_count * block_size) / 1024.         just_downloaded = downloaded_kb - self.last_downloaded_kb         self.last_downloaded_kb = downloaded_kb          predicted_size = block_size/1024.          wait_time = self.bucket.consume(predicted_size)         while wait_time > 0:             time.sleep(wait_time)             wait_time = self.bucket.consume(predicted_size)          now = time.time()         delta = now - self.last_update         if self.last_update != 0:             if delta > 0:                 rate = just_downloaded / delta                 if self.avg_rate is not None:                     rate = 0.9 * self.avg_rate + 0.1 * rate                 self.avg_rate = rate             else:                 rate = self.avg_rate or 0.             print '%20s: %4.1f%%, %5.1f KiB/s, %.1f/%.1f KiB' % (                     self.filename, 100. * downloaded_kb / total_kb,                     rate, downloaded_kb, total_kb,                 )         self.last_update = now   def main():     '''Fetch the contents of urls'''     if len(sys.argv) < 4:         print 'Syntax: %s rate url1 url2 ...' % sys.argv[0]         raise SystemExit(1)     rate_limit  = float(sys.argv[1])     urls = sys.argv[2:]     bucket = TokenBucket(10*rate_limit, rate_limit)      print 'rate limit = %.1f' % (rate_limit,)      threads = []     for url in urls:         path = urlparse.urlparse(url,'http')[2]         filename = os.path.basename(path)         print 'Downloading '%s' to '%s'...' % (url,filename)         rate_limiter = RateLimit(bucket, filename)         t = threading.Thread(             target=urllib.urlretrieve,             args=(url, filename, rate_limiter))         t.start()         threads.append(t)      for t in threads:         t.join()      print 'All downloads finished'  if __name__ == '__main__':     main() 
    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

Is it possible to easily configure autofac so it will only resolve using non-obsolete
I'm not sure if this is easily possible, but would be really handy if
Is it possible to easily embed ActiveX controls in Java application? Is it worth
Anyone know if it is easily possible to send **kwargs over PyAMF from NetConnection.call()
I am wanting to find out if it is possible to easily debug a
Possible Duplicate: How do I calculate someone's age in C#? Maybe this could be
is it possible to easily and dynamically decorate an object? for example, lets say
Is it possible to easily round a figure up to the nearest 100 (or
OpenOffice ships with HSQLDB. Is it possible to easily import the contents of an
I'll try and explain this as much and as easily as possible. I have

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.