Trying to read unicode characters from a word document but getting symbols (????). Here

Question

0

Editorial Team

Asked: June 16, 20262026-06-16T20:08:26+00:00 2026-06-16T20:08:26+00:00

Trying to read unicode characters from a word document but getting symbols (????). Here

0

Trying to read unicode characters from a word document but getting symbols (????).

Here my code :

   Microsoft.Office.Interop.Word.Application word = new Microsoft.Office.Interop.Word.Application();
            object miss = System.Reflection.Missing.Value;
             object enc = Microsoft.Office.Core.MsoEncoding.msoEncodingEUCJapanese; 
            object path = @"C:\Users\file.doc"
            object readOnly = true;
            Microsoft.Office.Interop.Word.Document docs = word.Documents.Open(ref path, ref miss, ref readOnly, ref miss, ref miss,
                ref miss, ref miss, ref miss, ref miss, ref miss, ref enc, ref miss, ref miss, ref miss, ref miss, ref miss);
            string totaltext = "";
            for (int i = 0; i < docs.Paragraphs.Count; i++)
            {
                totaltext += " \r\n " + docs.Paragraphs[i + 1].Range.Text.ToString();

                Console.WriteLine(totaltext);
            }
           // Console.WriteLine(totaltext);
            docs.Close();
            word.Quit();

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-16T20:08:27+00:00

Given the comments, it sounds like the problem may well just be with Console.WriteLine.

Try writing to a file instead:

// This will use Encoding.UTF8 by default.
using (var writer = File.CreateText("test.txt"))
{
    for (int i = 0; i < docs.Paragraphs.Count; i++)
    {
        writer.WriteLine(docs.Paragraphs[i + 1].Range.Text.ToString());
    }
}

Then open the file in Notepad, specifying UTF-8 as the encoding, and I suspect you’ll see everything correctly.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

Trying to read unicode characters from a word document but getting symbols (????). Here

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply