So, I've been having a bit of confusion regarding linking of various things. For this question I'm going to focus on opaque pointers.
I'll illustrate my confusion with an example. Let's say I have these three files:
main.c
#include <stdio.h>
#include "obj.h" //this directive is replaced with the code in obj.h
int main()
{
myobj = make_obj();
setid(myobj, 6);
int i = getid(myobj);
printf("ID: %i\n",i);
getchar();
return 0;
}
obj.c
#include <stdlib.h>
struct obj{
int id;
};
struct obj *make_obj(void){
return calloc(1, sizeof(struct obj));
};
void setid(struct obj *o, int i){
o->id = i;
};
int getid(struct obj *o){
return o->id;
};
obj.h
struct obj;
struct obj *make_obj(void);
void setid(struct obj *o, int i);
int getid(struct obj *o);
struct obj *myobj;
Because of the preprocessor directives, these would essentially become two files:
(I know technically stdio.h and stdlib.h would have their code replace the preprocessor directives, but I didn't bother to replace them for the sake of readability)
main.c
#include <stdio.h>
//obj.h
struct obj;
struct obj *make_obj(void);
void setid(struct obj *o, int i);
int getid(struct obj *o);
struct obj *myobj;
int main()
{
myobj = make_obj();
setid(myobj, 6);
int i = getid(myobj);
printf("ID: %i\n",i);
getchar();
return 0;
}
obj.c
#include <stdlib.h>
struct obj{
int id;
};
struct obj *make_obj(void){
return calloc(1, sizeof(struct obj));
};
void setid(struct obj *o, int i){
o->id = i;
};
int getid(struct obj *o){
return o->id;
};
Now here's where I get a bit confused. If I try to make a struct obj in main.c, I get an incomplete type error, even though main.c has the declaration struct obj;
.
Even if I change the code up to use extern
, It sill won't compile:
main.c
#include <stdio.h>
extern struct obj;
int main()
{
struct obj myobj;
myobj.id = 5;
int i = myobj.id;
printf("ID: %i\n",i);
getchar();
return 0;
}
obj.c
#include <stdlib.h>
struct obj{
int id;
};
So far as I can tell, main.c and obj.c do not communicate structs (unlike functions or variables for some which just need a declaration in the other file).
So, main.c has no link with struct obj types, but for some reason, in the previous example, it was able to create a pointer to one just fine struct obj *myobj;
. How, why? I feel like I'm missing some vital piece of information. What are the rules regarding what can or can't go from one .c file to another?
ADDENDUM
To address the possible duplicate, I must emphasize, I'm not asking what an opaque pointer is but how it functions with regards to files linking.
Converting comments into a semi-coherent answer.
The problems with the second
main.c
arise because it does not have the details ofstruct obj
; it knows that the type exists, but it knows nothing about what it contains. You can create and use pointers tostruct obj
; you cannot dereference those pointers, not even to copy the structure, let alone access data within the structure, because it is not known how big it is. That's why you have the functions inobj.c
. They provide the services you need — object allocation, release, access to and modification of the contents (except that the object release is missing; maybefree(obj);
is OK, but it's best to provide a 'destructor').Note that
obj.c
should includeobj.h
to ensure consistency betweenobj.c
andmain.c
— even if you use opaque pointers.At the moment, you could have
struct obj *make_obj(int initializer) { … }
inobj.c
, but because you don't includeobj.h
inobj.c
, the compiler can't tell you that your code inmain.c
will call it without the initializer — leading to quasi-random (indeterminate) values being used to 'initialize' the structure. If you includeobj.h
inobj.c
, the discrepancy between the declaration in the header and the definition in the source file will be reported by the compiler and the code won't compile. The code inmain.c
wouldn't compile either — once the header is fixed. The header files are the 'glue' that hold the system together, ensuring consistency between the function definition and the places that use the function (references). The declaration in the header ensures that they're all consistent.As to why you can have pointers to types without knowing all the details, it is an important feature of C that provides for the interworking of separately compiled modules. All pointers to structures (of any type) must have the same size and alignment requirements. You can specify that the structure type exists by simply saying
struct WhatEver;
where appropriate. That's usually at file scope, not inside a function; there are complex rules for defining (or possibly redefining) structure types inside functions. And you can then use pointers to that type without more information for the compiler.Without the detailed body of the structure (
struct WhatEver { … };
, where the braces and the content in between them are crucial), you cannot access what's in the structure, or create variables of typestruct WhatEver
— but you can create pointers (struct WhatEver *ptr = NULL;
). This is important for 'type safety'. Avoidvoid *
as a universal pointer type when you can, and you usually can avoid it — not always, but usually.Yes.
The structures are all different, but the pointers to them are all the same size.
If the compiler knows the details of the structure (there's a definition of the structure type with the
{ … }
part present), then the pointer can be dereferenced (and variables of the structure type can be defined, as well as pointers to it, of course). If the compiler doesn't know the details, you can only define (and use) pointers to the type.You avoid
void *
because you lose all type safety. If you have the declaration:then the compiler can't complain if you write the calls:
If you have the declaration:
then the compiler will tell you when you call it with a wrong pointer type, such as
stdin
(aFILE *
) orargv[1]
(achar *
if you're inmain()
), etc. or if you assign to the wrong type of pointer variable.