While I was parsing the Snort regex set I found a very odd character class syntax, like [\x80-t]
or [\x01-t\x0B\x0C\x0E-t\x80-t]
, and I can't figure out (really no clue) what -t means. I don't even know if it's standard PCRE or a sort of Snort extension.
Here are some regular expression that contains these character classes:
/\x3d\x00\x12\x00..........(.[\x80-t]|...[\x80-t])/smiR
/^To\x3A[^\r\n]+[\x01-t\x0B\x0C\x0E-t\x80-t]/smi
PS: please note that \x80-t
is not even a valid range in the standard way because character t is \x74.
This could reference a different character encoding where
t
is larger thanx80
andx80
can't be addressed normally.Take EBCDIC Scan codes for example (see here for a reference).
(But I too have no clue why somebody would want to write it that way)
For ASCII I have a wild guess: If
-t
means "until the next token -1" or if placed last in line "until the end of allowed characters" the second query would state this:So basically the expression
[\x01-t\x0B\x0C\x0E-t\x80-t]
would mean[^\r\n]
.If one applies that to
(.Ç-t]|...[Ç-t])
that would address any character larger than 7bit ASCII which also could address all of unicode (besides the first 127 characters).(That being said, I still have no clue why somebody should write it like this, but at least thats a coherent explanation besides "Its a bug")
Maybe helpful: What does the rexexes you posted mean if one writes out the \xYY? ASCII:
Looking after the
\0x12
akaDevice control 2
could help, because that won't show up in text, but maybe in net traffic.