According to http://www.regular-expressions.info , You can consider \X the Unicode version of the dot

Question

0

Editorial Team

Asked: June 1, 20262026-06-01T00:18:09+00:00 2026-06-01T00:18:09+00:00

According to http://www.regular-expressions.info , You can consider \X the Unicode version of the dot

0

According to http://www.regular-expressions.info,

You can consider \X the Unicode version of the dot in regex engines that use plain ASCII.

Does this mean that it will match any possible Unicode code point?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-01T00:18:10+00:00

The site’s description is pretty good:

\X Matches a single Unicode grapheme, whether encoded as a single code point or multiple code points using combining marks. A grapheme most closely resembles the everyday concept of a “character”. \X matches à encoded as U+0061 U+0300, à encoded as U+00E0, ©, etc.

So, the thing that makes it Unicode-aware is that it can match several code points when those combine to a single visible “thing” (grapheme).

See Wikipedia’s page on Combining Characters for more detail, it lists the U+0300 codepoint mentioned above, for instance.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

According to http://www.regular-expressions.info , You can consider \X the Unicode version of the dot

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply