I am generating CSV files. Occasionally the data source will pass along characters with

Question

0

Asked: June 9, 20262026-06-09T05:34:29+00:00 2026-06-09T05:34:29+00:00

I am generating CSV files. Occasionally the data source will pass along characters with

0

I am generating CSV files. Occasionally the data source will pass along characters with accents etc… that I would like to strip out. Is there a reasonably straightforward way to detect and strip out UTF-8 characters?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-09T05:34:30+00:00

If you’re sure you’re getting UTF-8 as input, use iconv to convert the values to the encoding you’re using in your output – detecting UTF-8 chars isn’t failsafe (as the values are valid iso-8859-1 characters as well (or all 8 bit encodings, really).

If you just want to use the regular ascii set of values (byte-values 0 – 127), you can let iconv convert to the ‘ascii’ encoding and transliterate:

iconv("utf-8", "ascii//TRANSLIT", "Hei og hå")

will result in

hei og ha

being returned.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I am generating CSV files. Occasionally the data source will pass along characters with

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply