I am writing a script to read a web page, and build a database

Question

0

Asked: May 22, 20262026-05-22T15:49:32+00:00 2026-05-22T15:49:32+00:00

I am writing a script to read a web page, and build a database

0

I am writing a script to read a web page, and build a database of links that matches a certain criteria. Right now I am stuck with lxml and understanding how to grab all the <a href>‘s from the html…

result = self._openurl(self.mainurl)
content = result.read()
html = lxml.html.fromstring(content)
print lxml.html.find_rel_links(html,'href')

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-22T15:49:33+00:00

Editorial Team

2026-05-22T15:49:33+00:00Added an answer on May 22, 2026 at 3:49 pm

Use XPath. Something like (can’t test from here):

urls = html.xpath('//a/@href')

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I am writing a script to read a web page, and build a database

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply