My proprietary text encoding uses all 256 byte values with the lower 128 being

Question

0

Asked: June 13, 20262026-06-13T14:27:53+00:00 2026-06-13T14:27:53+00:00

My proprietary text encoding uses all 256 byte values with the lower 128 being

0

My proprietary text encoding uses all 256 byte values with the lower 128 being mostly the same as ascii (the important stuff i.e. control characters, spaces, newlines are all exactly the same). I want to be able to read this file as bytes in C# .NET and still maintain the ability to read it line by line and do regex searches on it. What is the best way to do this in C# .NET?

I realize that if my encoding only used the first 128 byte values this would be simple. I just don’t want the higher characters to get accidentally converted to unicode values.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-13T14:27:54+00:00

It sounds like you just want to implement your own subclass of Encoding. It’s reasonably straightforward to do this, and then you can pass it to the StreamReader constructor (or File.OpenText etc.

If you look at the code I wrote (many years ago) to handle EBCDIC, you should be able to use that as a reasonable starting point.

The overlap with ASCII seems pretty much irrelevant to this, by the way.

I just don’t want the higher characters to get accidentally converted to unicode values.

Any time you convert any binary data into text, you’re converting to Unicode values. That’s how text in .NET is defined.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

My proprietary text encoding uses all 256 byte values with the lower 128 being

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply