I have a string which is in unicode. In the string, I am replacing

Question

0

Asked: June 12, 20262026-06-12T10:21:50+00:00 2026-06-12T10:21:50+00:00

I have a string which is in unicode. In the string, I am replacing

0

I have a string which is in unicode. In the string, I am replacing the ‘\r’ character with ‘<\p>’ in the string, and passing that to BeautifulSoup for parsing.

If I print after the string after the replacement, I see that the replacement went fine.
But when I pass the string to BeautifulSoup it treats it like < and >. Why is that ?

I seems to have to do with encoding, but not sure what.

replacing the string

fileString.encode('utf-8')  
fileString = re.sub('\r', "/<\p>", fileString)  
fileString.encode('utf-8')

htmlTag = BeautifulSoup(fileString, from_encoding='utf-8')

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-12T10:21:51+00:00

Editorial Team

2026-06-12T10:21:51+00:00Added an answer on June 12, 2026 at 10:21 am

<\p> isn’t a closing tag. The slash goes the other way:

fileString = fileString.replace('\r', '</p>')

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have a string which is in unicode. In the string, I am replacing

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply