We are migrating web servers, and it would be nice to have an automated way to check some of the basic site structure to see if the rendered pages are the same on the new server as the old server. I was just wondering if anyone knew of anything to assist in this task?
Share
Get the formatted output of both sites (here we use w3m, but lynx can also work):
Then use wdiff, it can give you a percentage of how similar the two texts are.
It can be also easier to see the differences using colordiff.
Excerpt of output:
(he actually put google.com into french… funny)
The common % values are how similar both texts are. Plus you can easily see the differences by word (instead of by line which can be a clutter).