Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 7911271
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: June 3, 20262026-06-03T13:10:47+00:00 2026-06-03T13:10:47+00:00

How can I download multiple links simultaneously? My script below works but only downloads

  • 0

How can I download multiple links simultaneously? My script below works but only downloads one at a time and it is extremely slow. I can’t figure out how to incorporate multithreading in my script.

The Python script:

from BeautifulSoup import BeautifulSoup
import lxml.html as html
import urlparse
import os, sys
import urllib2
import re

print ("downloading and parsing Bibles...")
root = html.parse(open('links.html'))
for link in root.findall('//a'):
  url = link.get('href')
  name = urlparse.urlparse(url).path.split('/')[-1]
  dirname = urlparse.urlparse(url).path.split('.')[-1]
  f = urllib2.urlopen(url)
  s = f.read()
  if (os.path.isdir(dirname) == 0): 
    os.mkdir(dirname)
  soup = BeautifulSoup(s)
  articleTag = soup.html.body.article
  converted = str(articleTag)
  full_path = os.path.join(dirname, name)
  open(full_path, 'w').write(converted)
  print(name)

The HTML file called links.html:

<a href="http://www.youversion.com/bible/gen.1.nmv-fas">http://www.youversion.com/bible/gen.1.nmv-fas</a>

<a href="http://www.youversion.com/bible/gen.2.nmv-fas">http://www.youversion.com/bible/gen.2.nmv-fas</a>

<a href="http://www.youversion.com/bible/gen.3.nmv-fas">http://www.youversion.com/bible/gen.3.nmv-fas</a>

<a href="http://www.youversion.com/bible/gen.4.nmv-fas">http://www.youversion.com/bible/gen.4.nmv-fas</a>
  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-06-03T13:10:49+00:00Added an answer on June 3, 2026 at 1:10 pm

    It looks to me like the consumer – producer problem – see wikipedia

    You may use

    import Queue, thread
    
    # create a Queue.Queue here
    queue = Queue.Queue()
    
    print ("downloading and parsing Bibles...")
    root = html.parse(open('links.html'))
    for link in root.findall('//a'):
      url = link.get('href')
      queue.put(url) # produce
    
    
    
    
    def thrad():
      url = queue.get() # consume
      name = urlparse.urlparse(url).path.split('/')[-1]
      dirname = urlparse.urlparse(url).path.split('.')[-1]
      f = urllib2.urlopen(url)
      s = f.read()
      if (os.path.isdir(dirname) == 0): 
        os.mkdir(dirname)
      soup = BeautifulSoup(s)
      articleTag = soup.html.body.article
      converted = str(articleTag)
      full_path = os.path.join(dirname, name)
      open(full_path, 'wb').write(converted)
      print(name)
    
    thread.start_new(thrad, ()) # start 1 threads
    
    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I'm trying to write a bash script that will let me download multiple web
If I download multiple JQuery themes how can I give the users of my
How can I download multiple files as a zip-file using php?
I have a program which needs to download multiple files at once. I can
I can download a file from S3 using either of the following methods. s3cmd
I know that: browsers can download two components in parallel per hostname example www.example.com
Can anybody help me, from where I can download LevelDB for Java-Windows because I
is there a dictionary i can download for java? i want to have a
We're writing a flash application that can download a MP3 file, convert it to
I want to ask if I can download files from the blobstore in google

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.