I know that BOM is used for UTF-8 files, but what about the text

Question

0

Asked: June 12, 20262026-06-12T11:52:51+00:00 2026-06-12T11:52:51+00:00

I know that BOM is used for UTF-8 files, but what about the text

0

I know that BOM is used for UTF-8 files, but what about the text files where every character is 2-bytes, should I add the byte order mark to them, too?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-12T11:52:53+00:00

BOM’s were invented for UCS-2 and UTF-16, and then only later appropriated by Microsoft (and then XML) for UTF-8. Think about the name: ‘byte order mark’. UTF-8 has only one possible byte order, so it doesn’t need a BOM to reveal the order. The three-byte sequence for U+FEFF in UTF-8 has, instead, become a Unicode signature for file type sniffing.

However, early versions of the XML support in Java did not respond well to a UTF-8 BOM, in spite of the inclusion of the UTF-8 BOM in the XML standard. Further, a file with a BOM can’t be simply concatenated onto another file, because U+FEFF isn’t BOM in the middle of the file; it’s ZWNBSP.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I know that BOM is used for UTF-8 files, but what about the text

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply