I’m writing an application that needs to transcode its input from UTF-8 to ISO-8859-1

Question

0

Asked: June 4, 20262026-06-04T04:02:09+00:00 2026-06-04T04:02:09+00:00

I’m writing an application that needs to transcode its input from UTF-8 to ISO-8859-1

0

I’m writing an application that needs to transcode its input from UTF-8 to ISO-8859-1 (Latin 1).

All works fine, except I sometimes get strange encodings for some umlaut characters. For example the Latin 1 E with 2 dots (0xEB) usually comes as UTF-8 0xC3 0xAB, but sometimes also as 0xC3 0x83 0xC2 0xAB.

This happened a number of times from different sources and noting that first and last characters match what I expect, could there be an encoding rule that my library doesn’t know about ?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-04T04:02:11+00:00

Editorial Team

2026-06-04T04:02:11+00:00Added an answer on June 4, 2026 at 4:02 am

$ "\xC3\x83\xC2\xAB"
Ã«
$ use Encode

$ decode 'UTF-8', "\xC3\x83\xC2\xAB"
ë

You have double-encoded UTF-8. Encode::Repair is one way to deal with that.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m writing an application that needs to transcode its input from UTF-8 to ISO-8859-1

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply