I made a script that crawls through a domain, and I want it to determine if there were any unauthorized modifications. For static pages I can simply compare to a pre-set hash value, but for dynamic-length pages, what’s a good way to check if any significant changes were made?
Sorry if it sounds dumb.
You could download the pages locally, then use
diffto check the length of the answer. Eg,which will give you a summary of the length of the changes.