I have been tasked to decode an alarm event from a Hikvision ANPR camera, and one of the xml fields is <plateCharBelieve>x,x,x,x,x,x,x</plateCharBelieve>
where x is a byte representing 0-100 on what confidence the ANPR has on each character of the licence plate. When the xml unmarshal happens it seems to try and unmarshal it as an ASCII character, and every now and then one of those ASCII characters is an illegal character and throws an error. XML syntax error on line 19: illegal character code U+000C
.
Is there a way I can prevent the default behaviour? or maybe implement a custom decoder? If I can decode it as int of 0-100. How would I just drop/ignore that line as I don't really require it. Just wont to stop it throwing an error.
Appreciate any feedback.
golang don't support decode the non-standard xml
follow is the code:
src/encoding/xml
line1142
and line1154
standard xml Section 2.2 Characters don't allow some byte
follow is some solutoin:
let xml decode any byte, just delete the
isInCharacterRange
:src/encoding/xml
line1142
then defined PlateCharBelieve as string or others:
code:
output:
<root><otherInfo>otherInfo</otherInfo><plateCharBelieve>>!,abc</plateCharBelieve></root
if you want decode the []byte, impl the Unmarshaler interface, like follow
if you dont want to change the golang code,you can relpace the bytes by regx:
1.you can use
<![CDATA[]]>
enclose it2.you can use regexp.replace delete it
3.you can use regexp.match decode it yourself
follow is a demo