I am trying to scrape the website here: ftp://ftp.sec.gov/edgar/daily-index/ . Using the code as

Question

0

Asked: June 7, 20262026-06-07T10:25:49+00:00 2026-06-07T10:25:49+00:00

I am trying to scrape the website here: ftp://ftp.sec.gov/edgar/daily-index/ . Using the code as

0

I am trying to scrape the website here: ftp://ftp.sec.gov/edgar/daily-index/. Using the code as shown below:

from bs4 import BeautifulSoup  
import urllib.request
html = urllib.request.urlopen("ftp://ftp.sec.gov/edgar/daily-index/")
soup = BeautifulSoup(line, "lxml")
soup.a # or soup.find_all('a') neither of them works
#return None.

Please help, I am really frustrated by this. My suspicion is that the tag is causing the problem. The site’s Html looks well formated (matched tags), so I am lost as to why BeautifulSoup doesn’t find anything. Thanks

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-07T10:25:51+00:00

Editorial Team

2026-06-07T10:25:51+00:00Added an answer on June 7, 2026 at 10:25 am

The ftp://ftp.sec.gov/edgar/daily-index/ URL leads to a FTP directory, not an HTML page.

Your browser could generate HTML based on the FTP directory contents, but the server does not send you HTML when you load that resource with urllib.request.

You probably want to use the ftplib module directly instead to read the directory listing, or inspect the return value of urlopen(...).read() first.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I am trying to scrape the website here: ftp://ftp.sec.gov/edgar/daily-index/ . Using the code as

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply