I’m trying mount a regex that get some words on a file where all

Question

0

Editorial Team

Asked: May 19, 20262026-05-19T09:19:28+00:00 2026-05-19T09:19:28+00:00

I’m trying mount a regex that get some words on a file where all

0

I’m trying mount a regex that get some words on a file where all letters of this word match with a word pattern.

My problem is, the regex can’t find accented words, but in my text file there are alot of accented words.

My command line is:

cat input/words.txt | grep '^[éra]\{1,4\}$' > output/words_era.txt
cat input/words.txt | grep '^[carroça]\{1,7\}$' > output/words_carroca.txt

And the content of file is:

carroça
éra
éssa
roça
roco
rato
onça
orça
roca

How can I fix it?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-19T09:19:28+00:00

If your file is encoded in ISO-8859-1 but your system locale is UTF-8, this will not work.

Either convert the file to UTF-8 or change your system locale to ISO-8859-1.

# convert from ISO-8859-1 to the environmental locale before grepping
# output will be in the current locale
$ iconv -f 8859_1 input/words.txt | grep ...

# run grep with an ISO-8859-1 locale
# output will be in ISO-8859-1 encoding
$ cat input/words.txt | env LC_ALL=en_US grep ...

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m trying mount a regex that get some words on a file where all

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply