For example, in case of 32-bit processors, a word is 4-byte long. Is it also possible to use 5-byte word or others?
In computers 32-bit or 64-bit processors are used, why not 40-bit or other numbers?
793 Views Asked by rneha725 At
2
There are 2 best solutions below
3

Historically, there have been a few computers with word sizes not a power of two, as in this Table of word sizes. However, eventually people discovered that address arithmetic is much easier to implement when the address size is a power of two.
Consider an operation such as "jump forward by 14 words". If word size is a power of two, say 64, then the circuit needs to shift the number 14 by log(64)/log(2)=6
and add to ip
, and that can easily be done in 1 cycle. If, however, the word size is 36, as in IBM 701, then the number 14 would have to be multiplied by 36, and that would take more cycles. Given that multiplying an integer by word size is a very common operation the slowdown would be significant.
If you're talking about whether it's possible for a hardware architecture to have 5-byte word then obviously yes, the word size can be anything that the designer wants and it's entirely possible to have a platform with 55-bit word and 11-bit byte. It's just that powers of 2 are common nowadays, but in the pass other word sizes exist. There were 40-bit platforms with 5-bit byte or 60-bit architectures with 6-bit char for example. See also
If you mean using a 40-bit type in a platform with 32-bit word then the answer is still yes. You can even use just a few bits instead of the whole byte/word through bit fields. Technically compilers can support any integer sizes on any architectures, like a 12-bit, 30-bit or 96-bit int on a 16-bit computer. In fact Clang has just got a new extension for integers with arbitrary bit width called
_ExtInt
and that's going to be standardized as_BitInt
in C23. See alsoPerformance would suffer a lot. We need more instructions to deal with non-native integer sizes. If the size is smaller than a word then compilers need to emit bitwise instructions to mask out the remaining bits. In the reverse case we need multiple instructions to work on multiple words
Another important thing is misalignment. Modern CPUs work more efficiently when a variable's address is a multiple of its size, or at least a multiple of the data bus width/word size. Working with an odd-sized variable is just awkward
That said, there exist many 32-bit architectures with a 40-bit type, like the TI C6000 or the TI C5500 DSPs
It's because those DSPs have a special 40-bit accumulator to allow adding 32-bit numbers 256 times without overflow. But why don't just use a 64-bit accumulator? That'll need a much bigger ALU, require more power and run slower (because bigger area means longer distance between hardware components, and operating on big numbers are slower than on smaller ones) which is unacceptable in a DSP which is designed for performance (and possibly also power)
In fact, 40-bit int is very common among DSPs (other examples being Blackfin and SHARC). SHARC even has an 80-bit accumulator so you can add a lot of 64-bit values without worrying about overflow
But you don't need a special architecture or special compiler support for that. You can still use a 40-bit variable if you really need to, for example when you work with a huge array where 64-bit integers would make it too large to fit in main memory and fewer items would fit in the cache. The simplest way to do that is to disable alignment with
#pragma pack
or__attribute__((packed))
or access the values from a simple char array (based on cmm's solution)
But those will result in bad performance on architectures without unaligned access. You can pack four 40-bit ints to a single 20-byte struct to have better alignment