Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 8082447
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: June 5, 20262026-06-05T17:06:13+00:00 2026-06-05T17:06:13+00:00

I get user input including non-ASCII characters and non-printable characters, such as \xc2d \xa0

  • 0

I get user input including non-ASCII characters and non-printable characters, such as

\xc2d
\xa0
\xe7
\xc3\ufffdd
\xc3\ufffdd
\xc2\xa0
\xc3\xa7
\xa0\xa0

for example:

email : abc@gmail.com\xa0\xa0
street : 123 Main St.\xc2\xa0

desired output:

  email : abc@gmail.com
  street : 123 Main St.

What is the best way to removing them using Java?
I tried the following, but doesn’t seem to work

public static void main(String args[]) throws UnsupportedEncodingException {
        String s = "abc@gmail\\xe9.com";
        String email = "abc@gmail.com\\xa0\\xa0";

        System.out.println(s.replaceAll("\\P{Print}", ""));
        System.out.println(email.replaceAll("\\P{Print}", ""));
    }

Output

abc@gmail\xe9.com
abc@gmail.com\xa0\xa0
  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-06-05T17:06:16+00:00Added an answer on June 5, 2026 at 5:06 pm

    Your requirements are not clear. All characters in a Java String are Unicode characters, so if you remove them, you’ll be left with an empty string. I assume what you mean is that you want to remove any non-ASCII, non-printable characters.

    String clean = str.replaceAll("\\P{Print}", "");
    

    Here, \p{Print} represents a POSIX character class for printable ASCII characters, while \P{Print} is the complement of that class. With this expression, all characters that are not printable ASCII are replaced with the empty string. (The extra backslash is because \ starts an escape sequence in string literals.)


    Apparently, all the input characters are actually ASCII characters that represent a printable encoding of non-printable or non-ASCII characters. Mongo shouldn’t have any trouble with these strings, because they contain only plain printable ASCII characters.

    This all sounds a little fishy to me. What I believe is happening is that the data really do contain non-printable and non-ASCII characters, and another component (like a logging framework) is replacing these with a printable representation. In your simple tests, you are failing to translate the printable representation back to the original string, so you mistakenly believe the first regular expression is not working.

    That’s my guess, but if I’ve misread the situation and you really do need to strip out literal \xHH escapes, you can do it with the following regular expression.

    String clean = str.replaceAll("\\\\x\\p{XDigit}{2}", "");
    

    The API documentation for the Pattern class does a good job of listing all of the syntax supported by Java’s regex library. For more elaboration on what all of the syntax means, I have found the Regular-Expressions.info site very helpful.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I posted a question about how to get user input such as YES or
I basically need to get user input: gets.chomp(input?) And then to convert the given
is there a way to get user input like an inputbox in c#? i
I'm trying to get edit text to behave in this way: User can input
How can I get the input for an editable JComboBox . When user gives
In C# Winforms, while user first load the form, I get the input as
I need to create a Dialog / Prompt including TextBox for user input. My
I've having trouble with parsing user input with Javascript, where I can't get the
Whats the best way to get User Input from the view to controller. I
I am using the FreeTextBox.dll to get user input, and storing that information in

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.