I’ve got an input box that allows UTF8 characters — can I detect whether the characters are in Chinese, Japanese, or Korean programmatically (part of some Unicode range, perhaps)? I would change search methods depending on if MySQL’s fulltext searching would work (it won’t work for CJK characters).
Thanks!
CJK characters are restricted to certain Unicode Blocks. You need to check the characters if they are inside these blocks, and should consider surrogates (32bit characters) too.