What’s the correct way to write Unicode-aware one-liners in Perl? The obvious way: $

Question

0

Asked: May 30, 20262026-05-30T18:07:02+00:00 2026-05-30T18:07:02+00:00

What’s the correct way to write Unicode-aware one-liners in Perl? The obvious way: $

0

What’s the correct way to write Unicode-aware one-liners in Perl? The obvious way:

$ echo 'フーバー' | perl  -lne 'print if /フ/'  
フーバー

…kinda appears to work on first sight, but this is just an accident: the Unicode is interpreted as bytes as the next example shows:

$ echo 'フーバー != フウバー' | perl  -mString::Diff=diff -lne 'print join(" ", diff($1, $2)) if /(.*)!=(.*)/'                                                                                 => 29
フ?[??]バー[ ] { }フ?{??}バー

Just using the -C flag to set the STDIN/STDOUT etc. to UTF‑8 is not enough by itself:

$ echo 'フーバー' | perl -C -lne 'print if /フ/' 
[no output]

…because now the text in -e is not interpreted as Unicode.

So is this the way to go (assuming a sane LOCALE — that is, one in the form "*.UTF‑8") like this:

$ perl -C -Mutf8 [...]

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-30T18:07:03+00:00

Editorial Team

2026-05-30T18:07:03+00:00Added an answer on May 30, 2026 at 6:07 pm

Yes, loading the utf8 pragma is required to interpret the “フ” UTF‑8 sequence in the source code as a character instead as separate bytes.

The Perl -C command-line switch and the utf8 pragma are locale-independent, but the shell’s echo command is not.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

What’s the correct way to write Unicode-aware one-liners in Perl? The obvious way: $

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply