I am creating a Java application were i have a paragraph that i want to grab words from it. I want to write a Regular expression to grab only English words and ignore any words having accents inside them(Having 1 or more non English letters).
Example: berøres af eller er afgørende for TEAM
Should i use \p{M} or what ?
This will match all words containing the letters A-Z.
Some flavors of regex will match
\bon the accented characters. Because of that, you’ll need to add punctuation to the lookahead if that is a requirement.