Is it possible, that some UTF8 symbol includes bytes 0x0D 0x0A as it’s part? If yes, what are such symbols?
(that task that I’m trying to solve is reading textual UTF8 file from the certain point rather then from the very beginning)
Is it possible, that some UTF8 symbol includes bytes 0x0D 0x0A as it’s part?
Share
No, every byte of a multibyte encoded codepoint will always have the most significant bit set.
Bytes with values 0-127 in an UTF-8 stream are uniquely mapped to ASCII.