I would say moving on to an ORM like LINQ…

Question

0

Asked: May 10, 20262026-05-10T18:43:52+00:00 2026-05-10T18:43:52+00:00

I’m looking for a way to match only fully composed characters in a Unicode

0

I’m looking for a way to match only fully composed characters in a Unicode string.

Is [:print:] dependent upon locale in any regular expression implementation that incorporates this character class? For example, will it match Japanese character ‘あ’, since it is not a control character, or is [:print:] always going to be ASCII codes 0x20 to 0x7E?

Is there any character class, including Perl REs, that can be used to match anything other than a control character? If [:print:] includes only characters in ASCII range I would assume [:cntrl:] does too.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

score 0 · Answer 1 · 2026-05-10T18:43:52+00:00

2026-05-10T18:43:52+00:00Added an answer on May 10, 2026 at 6:43 pm

echo あ| perl -nle 'BEGIN{binmode STDIN,':utf8'} print'[$_]'; print /[[:print:]]/ ? 'YES' : 'NO''

This mostly works, though it generates a warning about a wide character. But it gives you the idea: you must be sure you’re dealing with a real unicode string (check utf8::is_utf8). Or just check perlunicode at all – the whole subject still makes my head spin.

0

Reply
Share
Share

- Report

How to approach applying for a job at a company ...

How to handle personal stress caused by utterly incompetent and ...

What is a programmer’s life like?

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions