Hi I have Python String as shown below:
<html><table border = 1><tr><td>JDICOM</td><td>Thu Sep 16 10:13:34 CDT 2010</td></tr></html>
From above string I am interested in two words
JDICOM
Thu Sep 16 10:13:34 CDT 2010
I tried find, findall, split but it did not help because of multiple regex.
I am quite new to python. If anyone knows please help.
Statutory Warning: don’t use regular expressions to parse (X)HTML. You are much better off using a parser such as BeautifulSoup.
For e.g.
That said, here is a regular expression to do the same thing. Warning: this will stop working if the markup is irregular.