now before you prepare to right a speech about the perils of HTML parsing with regex, I already know it. This is more just a curiosity question, than actually wanting to know the question for practical usage.
Basically, given a file of HTML in some random, but perfectly valid format, can you parse out the content of <p> tags using a half-sane number of regular expressions? (and also pretending that <p> tags can not be nested or some other minor limitation)
It’s certainly possible to extract all the text between {insert character sequence 1 here} and {insert character sequence 2 here} with regular expressions, so long as those sequences aren’t overlapping. For example:
Of course, it’s terribly brittle and will break horribly if what you’re running it on is even slightly malformed, or contains either character sequence outside the context where it’s meaningful, or any number of other ways. If you oversimplify the problem, then yes you can get away with an oversimplified solution.