What should I do when I see some IP in my logs scrolling through 100s of pages on my site? I have a wordpress blog, and it seems like this isn’t a real person. This happens almost daily with different IPs.
UPDATE: Oh, i forgot to mention, I’m pretty sure it’s not a search engine spider. The hostname is not a searchengine, but some random person from india (ends in ‘.in’).
What I am concerned with, is if it is a scraper, is there anything I can do? Or could it possibly be something worse than a scraper e.g. hacker?
It’s a spider/crawler. Search engines use these to compile their listings, researchers use them to figure out the structure of the internet, the Internet Archive uses them to download the contents of the Internet for future generations, spammers use them to search for e-mail addresses, and many more such situations.
Checking out the user agent string in your logs may give you more information on what they’re doing. Well-behaved bots will generally indicate who/what they are – Google’s search bots, for example, are called Googlebot.