How can the negation meta-character, ~, be used in ANTLR’s lexer- and parser rules?
How can the negation meta-character, ~ , be used in ANTLR’s lexer- and parser
Share
Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.
Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Please briefly explain why you feel this question should be reported.
Please briefly explain why you feel this answer should be reported.
Please briefly explain why you feel this user should be reported.
Negating can occur inside lexer and parser rules.
Inside lexer rules you can negate characters, and inside parser rules you can negate tokens (lexer rules). But both lexer- and parser rules can only negate either single characters, or single tokens, respectively.
A couple of examples:
lexer rules
To match one or more characters except lowercase ascii letters, you can do:
(the negation-meta-char,
~, has a higher precedence than the+, so the rule above equals(~('a'..'z'))+)Note that
'a'..'z'matches a single character (and can therefor be negated), but the following rule is invalid:Because
'ab'(obviously) matches 2 characters, it cannot be negated. To match a token that consists of 2 character, but not'ab', you’d have to do the following:parser rules
Inside parser rules,
~negates a certain token, or more than one token. For example, you have the following tokens defined:If you now want to match any token except the
A, you do:And if you want to match any token except
BandD, you can do:However, if you want to match any two tokens other than
Afollowed byB, you cannot do:Just as with lexer rules, you cannot negate more than a single token. To accomplish the above, you need to do:
Note that the
.(DOT) char in a parser rules does not match any character as it does inside lexer rules. Inside parser rules, it matches any token (A,B,C,DorE, in this case).Note that you cannot negate parser rules. The following is illegal: