I have the following problem. I need to mirror password protected site. Sounds like simple task:
wget -m -k -K -E --cookies=on --keep-session-cookies --load-cookies=myCookies.txt http://mysite.com
in myCookies.txt I am keeping proper session cookie. This works until wget come accross logout page – then session is invalidated and, effectively, further mirroring is usless.
W tried to add --reject option, but it works only with file types – I can block only html file download or swf file download, I can’t say
--reject http://mysite.com/*.php?type=Logout*
Any ideas how to skip certain URLs in wget? Maybe there is other tool that can do the job (must work on MS Windows).
What if you first download (or even just
touch) the logout page, and thenThis should skip the logout page, as it has already been downloaded
(Disclaimer: I didn’t try this myself)