I wish to seek some clarifications on Unicode and str methods in Python. After

Question

0

Asked: May 27, 20262026-05-27T08:53:21+00:00 2026-05-27T08:53:21+00:00

I wish to seek some clarifications on Unicode and str methods in Python. After

0

I wish to seek some clarifications on Unicode and str methods in Python. After reading some explanation on Unicode, there are still couple of doubts I hope folks can help me on:

Am I right to say that when declaring a unicode string e.g word=u'foo', python uses the encoding of the terminal and decodes foo in e.g UTF-8, and assigning word the hex representation in unicode?
So, in general, is the process of printing out characters in a file, always decoding the byte stream according to the encoding to unicode representation, before displaying the mapped characters out?
In my terminal, Why does 'é'.lower() or str('é') displays in hex '\xc3\xa9', whereas ‘a’.lower() does not?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-27T08:53:22+00:00

First we should be clear we are talking about Python 2 only. Python 3 is different.

You’re right. But if you write u”abcd” in a py file, the declaration of the encoding of the source file will determine how the interpreter decode you string.
You need to decode it first, and then encode it and print. In Python 2, DON’T print out unicode directly! Otherwise, if the system is encoding it in an incompatitable way (like “ascii”), an exception will be raised.
You have to do all these explicitly.
The short answer is “a” doesn’t have to be represented in “\x61”, “a” is simply more readable. A longer answer: typically in the interactive shell, if you type a value and press enter, Python will show the repr() of your string. I think “repr” will try to print everything in ascii representation. For “a”, it’s already ascii, so it’s outputed directly. For str “é”, it’s UTF-8 encoded binary stream, so Python escape each byte and print as ‘xc3\xa9’

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I wish to seek some clarifications on Unicode and str methods in Python. After

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply