This code serializes an array of 32 bytes exactly as I want:
#[derive(Debug, Clone, Serialize, Deserialize, PartialEq, Eq, PartialOrd, Ord)]
struct Hash([u8; 32]);
let hash = Hash([1u8; 32]);
let hash_bin = bincode::serialize(&hash).unwrap();
assert_eq!(hash_bin, [1u8; 32]);
How does it work?
According to https://serde.rs/impl-serializer.html there is a serialize_bytes()
function, but the bincode version it prepends a length to the data.
fn serialize_bytes(self, v: &[u8]) -> Result<()> {
O::IntEncoding::serialize_len(self, v.len())?;
self.writer.write_all(v).map_err(Into::into)
}
What function in Serialize does the code (further above) call, to serialize the 32 bytes as themselves with no length prefix?
Context: I'm implementing a customer serialize for a type and I want it (under some circumstances) to serialize arrays of bytes so that bincode encodes them as bytes with no length prefix. This is a problem because calling serialize_bytes()
adds a length prefix.
I want to understand how arrays of bytes are serialized by default, as I do not know which method to call in place of serialize_bytes()
to get bytes without a length prefix.
How are
[u8; N]
and[u8]
serialized?To cut strait to the point, here is how
serde
1.0.151 implements each method.serialize_bytes
is not actually part ofserde
so it gets treated as a sequence.The methods
serialize_tuple
andcollect_seq
are implemented by the specific serializer you are using.The easy way
One common problem is that
serde
only implementsSerialize
/Deserialize
for arrays up to length 32. The easiest approach is to use a crate likeserde_with
which adds extra serialize/deserialize implementations you can attach to your structs. Here is an example taken from their documentation:How can we implement it ourselves? Rust Playground
Serialize
Performing serialization is actually quite easy. Serde does not have a concept of arrays, so we need to choose between
serialize_tuple
orserialize_seq
. Under the hood, the only difference isserialize_seq
may not have a known length so we can chooseserialize_tuple
.Deserialize
On the other hand, deserialize gets a bit more complicated. We need to define a visitor that then specifies how each element should be visited. I wrote out a single example of how it could be done in the general case of an array, but this is not the most optimal solution since it first deserializes onto the stack. I also had to make use of
unsafe
code to only initialize the array one element at a time, but thatunsafe
code can easily be removed ifT: Default
or if an expanding data structure like aVec<T>
is used instead. Generally, this is more intended to be a guide for implementing deserialize on a sequence.If anyone is curious how
derive(Deserialize)
works on structs, I would recommend looking at this Rust Playground where I expanded macros and then cleaned up the output to be more human readable. Seeing how serialize/deserialize works can really help to demystify the process.