scanf
is supposed to consume and discard any character in the format string that is not part of a conversion specifier. But its behavior seems different when a non-whitespace, non-conversion character comes first in the format string. Why?
int main() {
int num, num2;
printf("> ");
while (scanf("> %d %d", &num, &num2)) {
printf("You entered the number %d %d.\n", num, num2);
printf("> ");
}
return EXIT_SUCCESS;
}
If you build and run this and enter
> 3 4
at the prompt, it prints the message and the repeated prompt and then quits immediately.
So that means that scanf
returns 2
the first time and then returns 0
before the user can enter another set of tokens. If you remove the >
from the format string, the loop will run until the user enters something not a numeral, which then causes scanf
to return 0
- the behavior I would expect.
Also, if I put that same symbol after the first conversion specifier, the loop continues to run as expected. That is, if the format string has, say, "%d > %d"
, and the user enters
3 > 4
the loop will run again and accept another round of input.
I have not seen any documentation on this behavior.
From some documentation on
fscanf
:While the
fscanf
specifier%d
consumes any and all1 leading whitespace, it does not consume the line feed that follows it, and'>'
does not exactly match that newline character ('\n'
) on subsequent iterations.From the same documentation:
So a leading whitespace in your format specifier will consume the trailing newline character from the input:
Aside: this will loop forever on the truthy return value of
EOF
(a negativeint
, almost universally-1
):You should explicitly check the return value of
scanf
is the expected number of conversion specifiers (i.e.,2
).1. As do all format specifiers except
%c
,%[
, and%n
(assuming no errors occur).