I had filter="ip or vlan" and I pass it to libpcap like below and it is working in my application for a long time without any problem.
pcap_set_filter(pdesc->pd, filter, 0, netp);
Now I have been asked to parse arp trafic also . So I set my filter
"ip or vlan or arp or rarp"
. But my callback function is not being called for arp packets even for ip packets my function still being called.
In short my question is how to set libpcap filter correctly to get arp packets from system ?
I examined the BPF instructions in WireShark to figure out what happens. Here are the BPF filter programs for various filters:
1. Simple cases:
Filter:
vlan
Filter:
ip
Filter:
arp
Filter:
rarp
Filter:
ip or arp or rarp
2. Combining
ip
withvlan
reveals that the order of search tags is important:Your first filter was
ip or vlan
. Its pseudocode is as follows:For filter
vlan or ip
, we would like to see this:This would mean the same, which is OK, because
A or B
should mean the same asB or A
, shouldn't it. But we get this:This means something like the following pseudocode:
This doesn't make sense. Line (002) is unnecessary. The instructions should look like this:
Maybe I will be killed for saying this, but I think this is bug in libpcap. Where does the above line
(002) ldh [16]
come from? If the filter wasvlan and ip
, then checking the bytes at offset 16 would make sense: now we want to find VLAN-tagged frames which contain an IP packet. In such frames, there are two EtherType fields: the first (at offset 12) contains the VLAN EtherType value (0x8100), an the second (at offset 16) contains the EtherType of the IP protocol (#0x800):3. Why didn't your filter find ARP and RARP packets?
Your filter was
ip or vlan or arp or rarp
. This compiles to:This code has the above bug: libpcap tries to find the ARP and RARP EtherTypes at offset 16.
4. The solution to your problem
You can avoid the bug by adding them at the beginning of the filter:
arp or rarp or ip or vlan
. This compiles to:Which means: