I have heard that PHP6 will natively support unicode, which will hopefully make multi-language support much easier. However, PHP5 has pretty weak support for unicode and multi-language (i.e. just a bunch of specialized string functions).
I was wondering what are your strategies to enable unicode and multi-languaage support in your PHP5 applications?
Also, how do you store translations since PHP5 doesn’t have WebResource file like ASP.NET does?
It’s not all that hard really, but you may want to make your question a bit more specific.
If you’re talking to a database, make sure your database stores data in UTF-8 and the connection to your database is in UTF-8 (a common pitfall). Make sure to run this when establishing a connection:
For user input, set the
accept-charsetattribute on your forms.Serve your sites with an appropriate HTTP header:
or at least set appropriate meta tags for your site:
Keep your source code files encoded in UTF-8.
If you keep everything in UTF-8, you usually don’t need to worry about anything. It’s only getting problematic once you start mixing encodings throughout your app.
If you’re starting to talk about string manipulation of course, you’ll have to take a little more care. Mostly you’ll want to use the
mb_set of string functions, as you pointed out yourself.