In Parse::RecDescent, how do I effectively ignore C++/Java style comments? This includes single-line ('//' until the end of the line) and multi-line (/everything between here/).
how to skip all single- and multi-line comments in a Parse::RecDescent parser
1.4k Views Asked by Nate Glenn At
2
There are 2 best solutions below
3

You have to set the the value of $Parse::RecDescent::skip
. By default, Parse::RecDescent skips all white space. If you set this variable to a regex matching whitespace and comments, you can skip them. Use this:
$Parse::RecDescent::skip =
qr{
(
\s+ #whitespace
| #or
/[*] .*? [*]/ \s* #a multiline comment
| #or
//.*?$ #a single line comment
)* #zero or more
}mxs;
# m allows '$' to match a newline, x allows regex comments/whitespace,
# s allows '.' to match newlines.
<skip>
defines what the parser considers whitespace.Unlike Nate Glenn's solution, mine
Note:
(?:(?!STRING).)*
is to(?:STRING)
as[^CHAR]
is toCHAR
.