I need to migrate our website from a proprietary CMS that uses active server pages. Is there a tool or technique that will help download the resources from the existing site? I guess I’m looking for a tool that will crawl and scrape the entire site.
An additional challenge is that the site uses SSL and is protected with forms-based authentication. I have the necessary credentials and I can grab the cookie that validates the session but I’m not sure where to go from here and I don’t want to reinvent the wheel if existing tools can help me.
EDIT – I’m using Windows OS
wget may be a good tool for you to use
add –convert-links if you wish to make it more suitably for a local archive, rather than something you can re-upload somewhere.
A windows version of wget is available from the gnuwin32 project on sourceforge.net
http://gnuwin32.sourceforge.net/packages/wget.htm