ANTLR gramar to detect ambiguous tokens

290 Views Asked by Alfre2 At 27 June 2025 at 09:01

I'm creating a simple grammar in ANTLR to match somekind of commands. I'm stuck with tokens which use special characters.

Those commands would match sentences like...

connect "HAL" computer 4
connect "HAL256" computer 8
connect "HAL2⁸" computer 16
connect "HAL 9000" computer 32
connect "HAL \x0A25 | 32" computer 64

... to produce something like:

interpretation

It's clear that my problem is in the ID token, but I don't know how to solve it. Here is my current grammar:

grammar foo;
ID      :   '"' ('\u0000'..'\uFFFF')+ '"' ;
NUMBER  :    ('0'..'9')* ;
SENTENCE    :    'connect ' ID ' computer' NUMBER ;

How could I do it?

Original Q&A

There are 1 best solutions below

Bart Kiers On 13 January 2014 at 08:42 BEST ANSWER

There are a couple of issues with your grammar:

NUMBER matches an empty string: lexer rules must always match at least 1 character
SENTENCE should be a parser rule (see: Practical difference between parser rules and lexer rules in ANTLR?)
('\u0000'..'\uFFFF')+ also matches a '"', which you most probably son't want

Try something like this instead:

sentence   : K_CONNECT ID K_COMPUTER NUMBER;

K_CONNECT  : 'connect';
K_COMPUTER : 'computer';
ID         : '"' (~'"')+ '"';
NUMBER     : ('0'..'9')+;
SPACE      : (' ' | '\t' | '\r' | '\n')+ {skip();};

ANTLR gramar to detect ambiguous tokens

There are 1 best solutions below

Related Questions in ANTLR

Related Questions in GRAMMAR

Related Questions in ANTLRWORKS

Trending Questions

Popular # Hahtags

Popular Questions