Both languages claim to use Perl style regular expressions. If I have one language

Question

0

Asked: May 11, 20262026-05-11T05:54:10+00:00 2026-05-11T05:54:10+00:00

Both languages claim to use Perl style regular expressions. If I have one language

0

Both languages claim to use Perl style regular expressions. If I have one language test a regular expression for validity, will it work in the other? Where do the regular expression syntaxes differ?

The use case here is a C# (.NET) UI talking to an eventual Java back end implementation that will use the regex to match data.

Note that I only need to worry about matching, not about extracting portions of the matched data.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

score 0 · Answer 1 · 2026-05-11T05:54:10+00:00

There are quite (a lot of) differences.

Character Class

Character classes subtraction [abc-[cde]]
- .NET YES (2.0)
- Java: Emulated via character class intersection and negation: [abc&&[^cde]])
Character classes intersection [abc&&[cde]]
- .NET: Emulated via character class subtraction and negation: [abc-[^cde]])
- Java YES
\p{Alpha} POSIX character class
- .NET NO
- Java YES (US-ASCII)
Under (?x) mode ^{COMMENTS/IgnorePatternWhitespace}, space (U+0020) in character class is significant.
- .NET YES
- Java NO
Unicode Category (L, M, N, P, S, Z, C)
- .NET YES: \p{L} form only
- Java YES:
  - From Java 5: \pL, \p{L}, \p{IsL}
  - From Java 7: \p{general_category=L}, \p{gc=L}
Unicode Category (Lu, Ll, Lt, …)
- .NET YES: \p{Lu} form only
- Java YES:
  - From Java 5: \p{Lu}, \p{IsLu}
  - From Java 7: \p{general_category=Lu}, \p{gc=Lu}
Unicode Block
- .NET YES: \p{IsBasicLatin} only. (Supported Named Blocks)
- Java YES: (name of the block is free-casing)
  - From Java 5: \p{InBasicLatin}
  - From Java 7: \p{block=BasicLatin}, \p{blk=BasicLatin}
Spaces, and underscores allowed in all long block names (e.g. BasicLatin can be written as Basic_Latin or Basic Latin)
- .NET NO
- Java YES (Java 5)

Quantifier

?+, *+, ++ and {m,n}+ (possessive quantifiers)
- .NET NO
- Java YES

Quotation

\Q...\E escapes a string of metacharacters
- .NET NO
- Java YES
\Q...\E escapes a string of character class metacharacters (in character sets)
- .NET NO
- Java YES

Matching construct

Conditional matching (?(?=regex)then|else), (?(regex)then|else), (?(1)then|else) or (?(group)then|else)
- .NET YES
- Java NO
Named capturing group and named backreference
- .NET YES:
  - Capturing group: (?<name>regex) or (?'name'regex)
  - Backreference: \k<name> or \k'name'
- Java YES (Java 7):
  - Capturing group: (?<name>regex)
  - Backreference: \k<name>
Multiple capturing groups can have the same name
- .NET YES
- Java NO (Java 7)
Balancing group definition (?<name1-name2>regex) or (?'name1-name2'subexpression)
- .NET YES
- Java NO

Assertions

(?<=text) (positive lookbehind)
- .NET Variable-width
- Java Obvious width
(?<!text) (negative lookbehind)
- .NET Variable-width
- Java Obvious width

Character Class

Quantifier

Quotation

Matching construct

Assertions

Mode Options/Flags

Miscellaneous

References

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

Both languages claim to use Perl style regular expressions. If I have one language

Leave an answerCancel reply

1 Answer

Character Class

Quantifier

Quotation

Matching construct

Assertions

Mode Options/Flags

Miscellaneous

References

Leave an answer
Cancel reply