So I parsed a html page with .findAll (BeautifulSoup) to variable named result .

Question

0

Asked: May 31, 20262026-05-31T19:29:59+00:00 2026-05-31T19:29:59+00:00

So I parsed a html page with .findAll (BeautifulSoup) to variable named result .

0

So I parsed a html page with .findAll (BeautifulSoup) to variable named result.
If I type result in Python shell then press Enter, I see normal text as expected, but as I wanted to postprocess this result as string object, I noticed that str(result) returns garbage, like this sample:

\xd1\x87\xd0\xb8\xd0\xbb\xd0\xbd\xd0\xb8\xd1\x86\xd0\xb0</a><br />\n<hr />\n</div>

Html page source is utf-8 encoded

How can I handle this?

Code is basically this, in case it matters:

from BeautifulSoup import BeautifulSoup
soup = BeautifulSoup(urllib.open(url).read())
result = soup.findAll(something)

Python is 2.7

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-31T19:30:00+00:00

Editorial Team

2026-05-31T19:30:00+00:00Added an answer on May 31, 2026 at 7:30 pm

Python 2.6.7
BeautifulSoup.version 3.2.0

This worked for me:

unicode.join(u'\n',map(unicode,result))

I’m pretty sure a result is a BeautifulSoup.ResultSet object, which seems to be an extension of the standard python list

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

So I parsed a html page with .findAll (BeautifulSoup) to variable named result .

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply