I’m not sure if this is possible and lxml documentation is not very good

Question

0

Editorial Team

Asked: May 26, 20262026-05-26T05:17:02+00:00 2026-05-26T05:17:02+00:00

I’m not sure if this is possible and lxml documentation is not very good

0

I’m not sure if this is possible and lxml documentation is not very good to me.

Can I for example use something like:

import lxml.html as lx
x = lx.parse('http://web.info/page.html')
y = x.xpath('\\something\interesting'[2])

or similar, so that I don’t download whole page?

If not with lxml is there some Python module that can do this?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-26T05:17:02+00:00

No: lxml has to parse the whole page before it can be guaranteed to find an individual bit of it, and to parse it the whole page, it obviously has to download the whole page. (But see also unutbu’s answer for a potential partial downloading/parsing approach.)

And although I believe one can make HTTP requests for part of a file (I think via the range header?), that’s not guaranteed to be supported on the server side.

It’s a shame that HTTP doesn’t include a method for sending an XPath query to the server along with the page request, and have the results of running that query on the page sent back.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m not sure if this is possible and lxml documentation is not very good

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply