I wish to match (java regex) all urls belonging to a certain domain except the ones looking like a query string.
For e.g. I wish to match
http://www.thehindu.com/arts/music/marrying-keys-to-chips/article4061904.ece
But avoid
http://www.thehindu.com/arts/music?article=23417
I tried the following but it allows both the above patterns.
+^http://www\.thehindu\.com([^\?=])*
Try this:
Where the (^|\s) and (\s|$) are delimiters you expect between urls. Add more in those if you need.