On the Unicode site it’s written that UTF-8 can be represented by 1-4 bytes. As I understand from this question UTF-8 is an 8-bit encoding.
So, what’s the truth?
If it’s an 8-bit encoding, then what’s the difference between ASCII and UTF-8?
If it’s not, then why is it called UTF-8 and why do we need UTF-16 and others if they occupy the same memory?
On the Unicode site it’s written that UTF-8 can be represented by 1-4 bytes.
Share
The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!) by Joel Spolsky – Wednesday, October 08, 2003
Excerpt from above: