I am bulding my own parser which was not using the search engine API and only returning the URL’s with the search results in the bing environment, using python.
I created the usual HTMLParser class and in the main i created the class instance
parser = MyHTMLParser()
parser.feed(content)
This works for single calls but if I change the content and put this in a loop the feed does not work.
Any help would be appreciated.
Before you feed the parser instance a new document you should reset it.
E.g.
parser.reset()See: http://docs.python.org/library/htmlparser.html