when i try to get the text from a document, if it is followed

Question

0

Asked: June 3, 20262026-06-03T19:48:47+00:00 2026-06-03T19:48:47+00:00

when i try to get the text from a document, if it is followed

0

when i try to get the text from a document, if it is followed by some special characters such as TM or C (for copyright) and so on, after writing it into a text file it will makes some unexpected added to it. as an example, we can consider the following:

if we have Apache™ Hadoop™! and then if we try to write in into a text using FileOutputStream then result would be like Apacheâ Hadoopâ which the â is nonsense for me and generally i want a way to detect such characters in the text and just skipping them for writing them, is there solution to this?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-03T19:48:48+00:00

If you want just the printable ASCII range, then iterate over your string character by character building a new string. Include the character only if it’s within the range 0x20 to 0x7E.

final StringBuilder buff = new StringBuilder();
for (char c : string.toCharArray())
{
  if (c >= 0x20 && c <= 0x7E)
  {
    buff.append(c);
  }
}

final FileWriter w = new FileWriter(...);
w.write(buff.toString());
w.close();

If you want to keep carriage returns and newlines, you also need to consider 0x0A and 0x0D.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

when i try to get the text from a document, if it is followed

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply