Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 6350635
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 24, 20262026-05-24T21:50:57+00:00 2026-05-24T21:50:57+00:00

The Wikipedia entry for Subversion contains a paragraph about problems with different ways of

  • 0

The Wikipedia entry for Subversion contains a paragraph about problems with different ways of Unicode encoding:

While Subversion stores filenames as Unicode, it does not specify if
precomposition or decomposition is used for certain accented
characters (such as é). Thus, files added in SVN clients running on
some operating systems (such as OS X) use decomposition encoding,
while clients running on other operating systems (such as Linux) use
precomposition encoding, with the consequence that those accented
characters do not display correctly if the local SVN client is not
using the same encoding as the client used to add the files

While this describes a specific problem with Subversion client implementations, I am not sure if the underlying Unicode composition problem could also appear with regular Delphi applications. I guess that the problem can only arise if Delphi applications are able to use both Unicode encoding ways (maybe in Delphi XE2). If yes, what could Delphi developers do to avoid it?

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-24T21:50:58+00:00Added an answer on May 24, 2026 at 9:50 pm

    There is a minor display issue in that many fonts used on Windows won’t render the decomposed form in the ideal way, by using the combined glyph for both the letter and the diacritical. Instead it falls back to rendering the letter and than overlaying the standalone diacritical mark on top, which typically results in a less visually pleasing, potentially-lopsided grapheme.

    However that is not the issue the Subversion bug referenced from wiki is talking about. It’s actually completely fine to check in filenames to SVN that contain composed or decomposed character sequences; SVN neither knows nor cares about composition, it just uses the Unicode code points as-is. As long as the backend filesystem leaves filenames in the same state as they were put in, all is fine.

    Windows and Linux both have filesystems that are equally blind to composition. Mac OS X, unfortunately, does not. Both HFS+ and UFS filesystems perform ‘normalisation’ to decomposed form before storing an incoming filename, so the filename you get back won’t necessarily be the same sequence of Unicode code points you put in.

    It is this [IMO: insane] behaviour that confuses SVN—and many other programs—when being run on OS X. It’s particularly likely to bite because Apple happened to choose decomposed (NFD) as their normalisation form, whereas most of the rest of the world uses composed (NFC) characters.

    (And it’s not even real NFD, but an incompatible Apple-only variant. Joy.)

    The best way to cope with this is, if you can, is never to rely on the exact filename something’s stored under. If you only ever read a file from a given name, that’s fine, as it’ll be normalised to match the filesystem at the time. But if you’re reading a directory listing and trying to match filenames you find in there to what you expected the filename to be—which is what Subversion is doing—you’re going to get mismatches.

    To do a filename match reliably you would have to detect that you’re running on OS X, and manually normalise both the filename and the string to some normal form (NFC or NFD) before doing the comparison. You shouldn’t do this on other OSes which treat the two forms as different.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

The Wikipedia entry doesn't give details and the RFC is way too dense. Does
From the wikipedia entry on NP-Complete: The easiest way to prove that some new
So the Wikipedia entry on Lambda Calculus was interesting but I've finished it. I
In the wikipedia entry Unobtrusive JavaScript there is an example of obtrusive JavaScript: <input
On this Wikipedia entry I found out that ITU Telecommunication Standardization Sector (ITU-T) is
After reading the Test-and-Set Wikipedia entry , I am still left with the question
More information from the Perl 6 Wikipedia entry Junctions Perl 6 introduces the concept
The image above is from Wikipedia's entry on AVL trees which Wikipedia indicates is
I was reading the C# entry on Wikipedia, and came across: Managed memory cannot
The Wikipedia article about special member functions doesn't contain any reference to move constructors

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.