Big Endian bytes vs. Strings as keys in string - string databases

347 Views Asked by At

I haven't seen the common sense notion of converting an integer to network order and to write the resulting bytes into an indexable entity in a string - string database vs. writing the string representation of the number anywhere in the documentation of such databases.

Surely the size overhead of writing a 64-bit int as a string into a database must outweigh the trivial complexity of having to do a ntohl call before writing the bytes back into an integer type.

I am therefore missing something here, what are the downsides to using big-endian bytes vs. strings as indexable entities in string-string databases ?

(C++/C tags as I am talking about writing bytes into the memory location of a programatic type, BDB as that is the database I am using, could be kyotodb as well).

2

There are 2 best solutions below

4
On BEST ANSWER

The advantage of big-endian in this case is that the strings would sort correctly in ascending order.

If the database architecture cannot natively store 64-bit integers, but you need to store them anyway, stringifying them this way is a way to do it.

Of course if you later upgrade the database to one that can store 64-bit integers natively, you will either be "stuck" with the implementation or have to go through a migration process.

0
On

If the database validates that the string data you send is valid in the expected encoding then you can't just give it any data you want. You'll only be able to send such integers as happen to look like a valid encoding. I don't know if BDB or kyotodb do such validation.

Also it seems to me like a hack to try to trick one data type to hold something else, and then rely on clients to all know the trick. Of course that applies whether you're using the string to hold a ascii-decimal representation of the integer or if you're using the string as a raw memory buffer to hold the integer. It seems to me that it'd be better to use a database that actually holds the types you want to hold, instead of just strings.