My python env is 2.7
I know this is an old question, but I’ve lost my mind while I was searching and reading other people’s questions and answers. Some of them is really out of date. Like the code below:
import lxml #wrong
import xml #correct
So, since I’m a newbie to python and know nothing whatsoever in the great python history, I wanna make things more clear to me. Such as, what is the so-called standard xml-parser module in python now? what can I do when I need parse some HTML by using the xpath syntax. If I have a mal-formed HTML source code, how can handle it by not using BeautifulSoup or something else like. If u can brief me with something, I’ll be much appreciated.
OK, all in all, I just got one question. How can I parse mal-formed html code by using standard python module with python2.7?
Read the python library documentation if you need to stick to the standard library.
If you don’t, definitely look at lxml, which does much more.