I am new to Hbase world and trying the below query. I triggered the below hbase command form shell. I expect to retrieve only the rows which match the filtering criteria: rows matching the column name 'serviceId' with value '61490992624'.
However, it retrieves other rows (rod ids: 100545, 100546,100547,Australia, NSW,NT) as well. I expect only row id TX005 to be retrieved.
any reason to this behaviour?.
hbase(main):024:0> scan 'api_cus_usage', {FILTER=>"SingleColumnValueFilter('usage','serviceId',=,'binary:61490992624')"}
ROW COLUMN+CELL
100545 column=usage:colC, timestamp=1569912663258, value=100545
100546 column=usage:colC, timestamp=1569912663269, value=100546
100547 column=usage:colC, timestamp=1569912663279, value=100547
Australia column=usage:colB, timestamp=1569912663276, value=Australia
NSW column=usage:colA, timestamp=1569912663262, value=NSW
NT column=usage:colA, timestamp=1569912663272, value=NT
TX005 column=usage:balanceInstanceId, timestamp=1567147803234, value=3
TX005 column=usage:eventDateAndTime, timestamp=1567147802721, value=20190815T000000+1000
TX005 column=usage:serviceId, timestamp=1567147802752, value=61490992624
I figured out the reason for the behaviour. The filterIfMissing flag has to be set to true. The below command gave the appropriate results. the last Boolean arg 'true' is for latestVersionOnly. Without that flag, command didnt work.
Thanks to this link: HBase: Records returned where field is not present