My source XML has the copyright character in it as ©. When writing the XML with this code:
var stringWriter = new StringWriter();
segmentDoc.Save(stringWriter);
Console.WriteLine(stringWriter.ToString());
it is rendering that copyright character as a little “c” with a circle around it. I’d like to preserve the original code so it gets spit back out as ©. How can I do this?
Update: I also noticed that the source declaration looks like <?xml version="1.0" encoding="utf-8"?> but my saved output looks like <?xml version="1.0" encoding="utf-16"?>. Can I indicate that I want the output to still be utf-8? Would that fix it?
Update2: Also,   is getting output as ÿ. I definitely don’t want that happening!
Update3: § is becoming a little box and that is wrong, too. It should be §
I strongly suspect you won’t be able to do this. Fundamentally, the copyright sign is
©– they’re different representations of the same thing, and I expect that the in-memory representation normalizes this.What are you doing with the XML afterwards? Any sane application processing the resulting XML should be fine with it.
You may be able to persuade it to use the entity reference if you explicitly encode it with ASCII… but I’m not sure.
EDIT: You can definitely make it use a different encoding. You just need a
StringWriterwhich reports that its “native” encoding is UTF-8. Here’s a simple class you can use for that:You could try changing it to use
Encoding.ASCIIas well and see what that does to the copyright sign…