I'd recommend you to have a look at the NetBeans…

Question

0

Asked: May 13, 20262026-05-13T17:19:21+00:00 2026-05-13T17:19:21+00:00

If I am creating a simple web scraper (from root url, grab all links,

0

If I am creating a simple web scraper (from root url, grab all links, then from those links grab all emails) would it be worthwhile to use HTML Agility Pack? I am not actually looking through HTML tags, I am simply looking to scan for emails within the entire document.

Would it be more efficient to use HTML agility pack?

I am stripping them strictly because it is necessary I have these emails, and there are about 100 links. Only about 500 emails will be scraped. No worries, I’m keeping ethics in mind here.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-13T17:19:21+00:00

Editorial Team

2026-05-13T17:19:21+00:00Added an answer on May 13, 2026 at 5:19 pm

There are many question on SO about this – most of the ones I read say – don’t use regular expressions for web scraping.

On the other hand – if all you want is text parsing regardless of the HTML nature of the text (which you do if I understand you correctly), it may be better to use regular expressions.

0

Reply
Share
Share

- Report

How to approach applying for a job at a company ...

How to handle personal stress caused by utterly incompetent and ...

What is a programmer’s life like?

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions