Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • Home
  • SEARCH
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 245071
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 11, 20262026-05-11T21:03:30+00:00 2026-05-11T21:03:30+00:00

Given that Unicode has been around for 18 years , why are there still

  • 0

Given that Unicode has been around for 18 years, why are there still apps that don’t have Unicode support? Even my experiences with some operating systems and Unicode have been painful to say the least. As Joel Spolsky pointed out in 2003, it’s not that hard. So what’s the deal? Why can’t we get it together?

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-11T21:03:30+00:00Added an answer on May 11, 2026 at 9:03 pm

    Start with a few questions

    How often…

    • do you need to write an application that deals with something else than ascii?
    • do you need to write a multi-language application?
    • do you write an application that has to be multi-language from its first version?
    • have you heard that Unicode is used to represent non-ascii characters?
    • have you read that Unicode is a charset? That Unicode is an encoding?
    • do you see people confusing UTF-8 encoded bytestrings and Unicode data?

    Do you know the difference between a collation and an encoding?

    Where did you first heard of Unicode?

    • At school? (really?)
    • at work?
    • on a trendy blog?

    Have you ever, in your young days, experienced moving source files from a system in locale A to a system in locale B, edited a typo on system B, saved the files, b0rking all the non-ascii comments and… ending up wasting a lot of time trying to understand what happened? (did your editor mix things up? the compiler? the system? the… ?)

    Did you end up deciding that never again you will comment your code using non-ascii characters?

    Have a look at what’s being done elsewhere

    Python

    Did I mention on SO that I love Python? No? Well I love Python.

    But until Python3.0, its Unicode support sucked. And there were all those rookie programmers, who at that time knew barely how to write a loop, getting UnicodeDecodeError and UnicodeEncodeError from nowhere when trying to deal with non-ascii characters. Well they basically got life-traumatized by the Unicode monster, and I know a lot of very efficient/experienced Python coders that are still frightened today about the idea of having to deal with Unicode data.

    And with Python3, there is a clear separation between Unicode & bytestrings, but… look at how much trouble it is to port an application from Python 2.x to Python 3.x if you previously did not care much about the separation/if you don’t really understand what Unicode is.

    Databases, PHP

    Do you know a popular commercial website that stores its international text as Unicode?

    You will (perhaps) be surprised to learn that Wikipedia backend does not store its data using Unicode. All text is encoded in UTF-8 and is stored as binary data in the Database.

    One key issue here is how to sort text data if you store it as Unicode codepoints. Here comes the Unicode collations, which define a sorting order on Unicode codepoints. But proper support for collations in Databases is missing/is in active development. (There are probably a lot of performance issues, too. — IANADBA) Also, there is no widely-accepted standard for collations yet: for some languages, people don’t agree on how words/letters/wordgroups should be sorted.

    Have you heard of Unicode normalization? (Basically, you should convert your Unicode data to a canonical representation before storing it) Of course it’s critical for Database storage, or local comparisons. But PHP for example only provides support for normalization since 5.2.4 which came out in August 2007.

    And in fact, PHP does not completely supports Unicode yet. We’ll have to wait PHP6 to get Unicode-compatible functions everywhere.

    So, why isn’t everything we do in Unicode?

    1. Some people don’t need Unicode.
    2. Some people don’t care.
    3. Some people don’t understand that they will need Unicode support later.
    4. Some people don’t understand Unicode.
    5. For some others, Unicode is a bit like accessibility for webapps: you start without, and will add support for it later
    6. A lot of popular libraries/languages/applications lack proper, complete Unicode support, not to mention collation & normalization issues. And until all items in your development stack completely support Unicode, you can’t write a clean Unicode application.

    The Internet clearly helps spreading the Unicode trend. And it’s a good thing. Initiatives like Python3 breaking changes help educating people about the issue. But we will have to wait patiently a bit more to see Unicode everywhere and new programmers instinctively using Unicode instead of Strings where it matters.

    For the anecdote, because FedEx does not apparently support international addresses, the Google Summer of Code ’09 students all got asked by Google to provide an ascii-only name and address for shipping. If you think that most business actors understand stakes behind Unicode support, you are just wrong. FedEx does not understand, and their clients do not really care. Yet.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Ask A Question

Stats

  • Questions 192k
  • Answers 192k
  • Best Answers 0
  • User 1
  • Popular
  • Answers
  • Editorial Team

    How to approach applying for a job at a company ...

    • 7 Answers
  • Editorial Team

    What is a programmer’s life like?

    • 5 Answers
  • Editorial Team

    How to handle personal stress caused by utterly incompetent and ...

    • 5 Answers
  • Editorial Team
    Editorial Team added an answer You can use the "for" loop to do this. something… May 12, 2026 at 6:20 pm
  • Editorial Team
    Editorial Team added an answer Piping means sending the output of a program as input… May 12, 2026 at 6:20 pm
  • Editorial Team
    Editorial Team added an answer Lambda, which originated from Lambda Calculus and (AFAIK) was first… May 12, 2026 at 6:20 pm

Related Questions

I've never been sure that I understand the difference between str/unicode decode and encode.
I've been pulling my hear out over this problem for a few hours yesterday:
I've a method that I want to be able to accept either a single
I need to convert between python objects and c strings of various encodings. Going

Trending Tags

analytics british company computer developers django employee employer english facebook french google interview javascript language life php programmer programs salary

Top Members

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.