According to http://www.regular-expressions.info,
You can consider
\Xthe Unicode version of the dot in regex engines that use plain ASCII.
Does this mean that it will match any possible Unicode code point?
Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.
Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Please briefly explain why you feel this question should be reported.
Please briefly explain why you feel this answer should be reported.
Please briefly explain why you feel this user should be reported.
The site’s description is pretty good:
So, the thing that makes it Unicode-aware is that it can match several code points when those combine to a single visible “thing” (grapheme).
See Wikipedia’s page on Combining Characters for more detail, it lists the U+0300 codepoint mentioned above, for instance.