i have some doubt about PHP based web crawlers,can it run like the java thread based one? i am asking it because, in java the thread can be executed again and again, i dont think, PHP have something like thread function, can you guys please say, which web crawler will be more use full?A PHP Based or A Java Based
Share
Instead of writing your own use one of the following. Btw, Java based web crawlers are preferred. My fav Nutch.
Java based: Nutch, Heritrix, JSpider, JoBo (simple crawler)
PHP based: PHPCrawl