I am trying to parse webpages to find links to special pages
for instance if we had the below as input
flowers that never end.')" onmouseout="return nd();" href="/flowers/images/download/01d6ac.html"><img src="http://static.rarbg.com/over/01d6acc21110e68af7476bce50dec3c234343032.jpg" border="0
and on an other page had :
flowers that never end')" onmouseout="return nd();" href="/flowers/01d6acc21110e68af7476bce50dec3c234343032.html" src="http://static.rarbg.com/over/01d6acc21110e68af7476bce50dec3c234343032.jpg" border="0
I tried to use the below re to pick up the link:
'href="/flowers/(.+?)"[^>]
but it is still picking up the link from both inputs not just the second one!
can anyone help me?
If for some reason you have to use regex, better use this expression:
However your suffering will continue until you use a parser as you can read in the comments.