Why does the standard allow my compiler to apply copy elision even when it involves visible side effects, thus breaks the as-if rule?
It is somehow plausible'ish for me when one has guaranteed copy elision, because the actual functionality for copy/move (which would invoke visible changes in the program behaviour) does not necessarily have to exist, but why/how was this before C++17?
Is it because a compiler can not generally detect side effects (I don't know whether this is possible)?
The cases that allow for this optimization involve copies of temporaries. Conceptually, these should have no visible effect but the language allows class writers to put whatever they want in a copy constructor.
As such, copy constructors might sometimes actually have visible side effects. Strictly speaking, the as-if rule may not be applicable.
It was deemed that this optimization was useful enough and the harm minimal enough to be worth including in the language. Consider that this optimization predates move semantics where return values would always be copied.