I ended up using this .NET iCal Library DDay.iCal And…

Question

0

Asked: May 14, 20262026-05-14T06:16:29+00:00 2026-05-14T06:16:29+00:00

PHP’s str_replace() was intended only for ANSI strings and as such can mangle UTF-8

0

PHP’s str_replace() was intended only for ANSI strings and as such can mangle UTF-8 strings. However, given that it’s binary-safe would it work properly if it was only given valid UTF-8 strings as arguments?

Edit: I’m not looking for a replacement function, I would just like to know if this hypothesis is correct.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-14T06:16:30+00:00

Yes. UTF-8 is deliberately designed to allow this and other similar non-Unicode-aware processing.

In UTF-8, any non-ASCII byte sequence representing a valid character always begins with a byte in the range \xC0-\xFF. This byte may not appear anywhere else in the sequence, so you can’t make a valid UTF-8 sequence that matches part of a character.

This is not the case for older multibyte encodings, where different parts of a byte sequence are indistinguishable. This caused a lot of problems, for example trying to replace an ASCII backslash in a Shift-JIS string (where byte \x5C might be the second byte of a character sequence representing something else).

How to approach applying for a job at a company ...

How to handle personal stress caused by utterly incompetent and ...

What is a programmer’s life like?

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions