I’m new to Python and I have been trying to search through html with regular expressions that has been parsed with BeautifulSoup. I haven’t had any success and I think the reason is that I don’t completely understand how to set up the regular expressions properly. I’ve looked at older questions about similar problems but I still haven’t figured it out. If somebody could extract the “/torrent/32726/0/” and “Slackware Linux 13.0 [x86 DVD ISO]” as well as a detailed expression of how the regular expression works, it would be really helpful.
<td class="name">
<a href="/torrent/32726/0/">
Slackware Linux 13.0 [x86 DVD ISO]
</a>
</td>
Edit: What I meant to say is, I am trying to extract “/torrent/32726/0/” and “Slackware Linux 13.0 [x86 DVD ISO]” using BeautifulSoups functions to search the parse tree. I’ve been trying various things after searching and reading the documentation, but I’m still not sure on how to go about it.
BeautifulSoup could also extract node values from your html.
Output: