Is there any way using urlib , urllib2 or BeautifulSoup to extract HTML tag

Question

0

Editorial Team

Asked: May 24, 20262026-05-24T23:51:59+00:00 2026-05-24T23:51:59+00:00

Is there any way using urlib , urllib2 or BeautifulSoup to extract HTML tag

0

Is there any way using urlib, urllib2 or BeautifulSoup to extract HTML tag attributes?

for example:

<a href="xyz" title="xyz">xyz</a>

gets href=xyz, title=xyz

There is another thread talking about using regular expressions

Thanks

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-24T23:51:59+00:00

Editorial Team

2026-05-24T23:51:59+00:00Added an answer on May 24, 2026 at 11:51 pm

You could use BeautifulSoup to parse the HTML, and for each <a> tag, use tag.attrs to read the attributes:

In [111]: soup = BeautifulSoup.BeautifulSoup('<a href="xyz" title="xyz">xyz</a>')

In [112]: [tag.attrs for tag in soup.findAll('a')]
Out[112]: [[(u'href', u'xyz'), (u'title', u'xyz')]]

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

Is there any way using urlib , urllib2 or BeautifulSoup to extract HTML tag

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply