Having an UTF-8 string like this: mystring = işğüı is it possible to get

Question

0

Asked: May 17, 20262026-05-17T02:22:31+00:00 2026-05-17T02:22:31+00:00

Having an UTF-8 string like this: mystring = işğüı is it possible to get

0

Having an UTF-8 string like this:

mystring = "işğüı"

is it possible to get its (in memory) size in Bytes with Python (2.5)?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-17T02:22:31+00:00

Assuming you mean the number of UTF-8 bytes (and not the extra bytes that Python requires to store the object), it’s the same as for the length of any other string. A string literal in Python 2.x is a string of encoded bytes, not Unicode characters.

Byte strings:

>>> mystring = "işğüı"
>>> print "length of {0} is {1}".format(repr(mystring), len(mystring))
length of 'i\xc5\x9f\xc4\x9f\xc3\xbc\xc4\xb1' is 9

Unicode strings:

>>> myunicode = u"işğüı"
>>> print "length of {0} is {1}".format(repr(myunicode), len(myunicode))
length of u'i\u015f\u011f\xfc\u0131' is 5

It’s good practice to maintain all of your strings in Unicode, and only encode when communicating with the outside world. In this case, you could use len(myunicode.encode('utf-8')) to find the size it would be after encoding.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

Having an UTF-8 string like this: mystring = işğüı is it possible to get

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply