For globalization of scripts, it is very common to use UTF-8 as the default charset; for example in HTML or default charset of mysql. This is also the case for latin website in which characters are in the class of ISO-8859-1. Isn’t it advantageous to use ISO-8859-1 when UTF-8 characters are not needed. From advantageous, I mean critically beneficial.
My point is that only 0 – 127 characters of UTF-8 are 1 byte, and from 128 – 255 are 2-byte; where ISO-8859-1 is 1 byte system. Doesn’t it play a critical role in database storage?
Most of these 127
UTF-81-byte characters are the most used when you work withISO-8859-1. Let’s have a look here. If you useUTF-8you will need 1 extra byte only when you use one of the 127-255 characters (not so commons I bet).My opinion? Use
UTF-8if you can and if you haven’t problem handling it. The time you save the day you will need some extra characters (or the day you have to translate your content) really worth a few extra bytes here and there in the DB…