Is it possible to use UTF-8 in a subdomain? If so, which characters are allowed and how does the can’t-mix-encodings thing work?
I’ve tried to RTFM, but Google wan’t of much help
Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.
Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Please briefly explain why you feel this question should be reported.
Please briefly explain why you feel this answer should be reported.
Please briefly explain why you feel this user should be reported.
There aren’t many things special about subdomains. A given domain name
foo.example.comis an ordered list of labels (foo,example,com). So you might want to know if you can use UTF-8 in a given label.The low level answer is that a label is defined as:
which means that you can only find
[-a-zA-Z0-9]in a label.However, IDNA can be used to encode Unicode characters. In short, a label containing other characters is encoded with:
"xn--" + punycode(nameprep(label)).As for limitations at least: