I need to do lot html parsing / scraping /search engine /crawling. There are

Question

0

Editorial Team

Asked: May 23, 20262026-05-23T02:03:02+00:00 2026-05-23T02:03:02+00:00

I need to do lot html parsing / scraping /search engine /crawling. There are

0

I need to do lot html parsing / scraping /search engine /crawling.

There are many libraries currently like Scrapy, Beautiful Soup, lxml , lxml2 requests, pyquery.

Now i don’t want to try each of these and then decide. basically i want to follow on one and then study in detail and then use that most often.

So which library should i go for which can perform all function mentioned above. Even though there may be diff solutions for diff problems. But i want onelibrary which could do all things even though it takes time to code but should be possible

Is it possible to do indexing in lxml? Is PyQuery same as lxml or its different?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-23T02:03:03+00:00

Editorial Team

2026-05-23T02:03:03+00:00Added an answer on May 23, 2026 at 2:03 am

I’m using Beautiful Soup and am very happy with it. So far it answered all my scraping needs. Two main benefits:

It’s pretty good at handling non-perfect HTML. Since browsers are quite lax, many HTML documents aren’t 100% well-formed
In addition to high-level access APIs, it has low-level APIs which make it extendible if some specific scraping need isn’t directly provided

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I need to do lot html parsing / scraping /search engine /crawling. There are

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply