What does it mean when the icon_strlen fails on bad character sequences specifically character sequences is what I want to know. Thanks
What does it mean when the icon_strlen fails on bad character sequences specifically character
Share
Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.
Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Please briefly explain why you feel this question should be reported.
Please briefly explain why you feel this answer should be reported.
Please briefly explain why you feel this user should be reported.
A
character sequenceis a series of bytes. When using UTF-8 not all combinations of bytes are valid.The byte sequence
\xc2\xbcforms the Unicode characterU+00BCwhich is theVULGAR FRACTION ONE QUARTERsymbol (¼) when using UTF-8 encoding.The byte sequence
\xe2\x88\x9cforms the Unicode characterU+221Cwhich is theFOURTH ROOTsymbol (∜) when using UTF-8 encoding.A bad character sequence for UTF-8 encoding would be any byte combination that doesn’t fit into the required schema for UTF-8 byte streams, e.g. the byte sequence
\xbc\xbcwould be illegal because two byte characters must have110xxxxxin the first byte but\xbcis10111100written as bits.