Give permission to the "Approved" workflow state.

Question

Editorial Team

Asked: May 15, 20262026-05-15T16:37:54+00:00 2026-05-15T16:37:54+00:00

currently I have a spider written in Java that logs into a supplier website

currently I have a spider written in Java that logs into a supplier website and spiders the website. (using htmlunit)

It keeps the session (cookie) and even lets me enable/disable javascript etc.

I also use htmlparser (java) to help parse the html and extract the relevant information.

Does python have something similar to do this?

You must login to add an answer.

Need An Account,

Editorial Team · Answer 1 · 2026-05-15T16:37:55+00:00

Editorial Team

Python has urllib2 to crawl pages, which supports password authentication and cookies.

There is also a HTMLParser for extracting html, but some people prefer the more feature-full BeatifulSoup.