I can't seem to grasp the concept on these stuff, even with the help of Google and a textbook in my hand.
Following the format (opcode, rs, rt, offset)...
- Do you sign extend the offset before adding it to the value of the address? Or add before extending?
- In the case of lb and lbu, what's the difference? Does it also follow the MIPS arithmetic definition that 'unsigned' just means it won't report an overflow?
- Why doesn't lw have an unsigned version? Even the store instructions don't have one...
The "load byte" instructions
lb
andlbu
load a single byte into the right-most byte of a 32-bit register. How do you set the upper 24 bits? The unsigned operation will set them to zero; the signed operation will sign-extend the loaded byte.For example, suppose you read the byte
0xFF
from memory.lbu
will 0-extend this value to0x000000FF
and interpret it as 255, whilelb
will sign-extend it to0xFFFFFFFF
, which is interpreted as -1.The "load word" instruction (
lw
), on the other hand, loads a 32-bit quantity into a 32-bit register, so there is no ambiguity, and no need to have a special signed version.If you are storing less than a full 32-bit word, there is nothing you can do with the extra bits in the register except throw them away (ignore them).
I think this convention is only for the add and subtract instructions. For the other instructions, signed/unsigned indicates whether sign-extension will be performed.
If an offset is sign-extended, it only makes sense to do it before adding it to the base address. I think a review of two's complement arithmetic will make this clear.