I am struggling to wrap my head around malloc in c – specifically when it needs to be free()’d. I am getting weird errors in gcc such as:
... free(): invalid next size (fast): ...
when I try to free a char pointer. For example, when reading from an input file, it will crash on certain lines when doing the following:
FILE *f = fopen(file,"r");
char x[256];
while(1) {
if(fgets(x,sizeof x,f)==NULL) break;
char *tmp = some_function_return_char_pointer(x); //OR malloc(nbytes);
// do some stuff
free(tmp); // this is where I get the error, but only sometimes
}
I checked for obvious things, such as x being NULL, but it’s not; it just crashes on random lines.
But my REAL question is – when do I need to use free()? Or, probably more correctly, when should I NOT use free? What if malloc is in a function, and I return the var that used malloc()? What about in a for or while loop? Does malloc-ing for an array of struct have the same rules as for a string/char pointer?
I gather from the errors I’m getting in gcc on program crash that I’m just not understanding malloc and free. I’ve spent my quality time with Google and I’m still hitting brick walls. Are there any good resources you’ve found? Everything I see says that whenever I use malloc I need to use free. But then I try that and my program crashes. So maybe it’s different based on a variable’s scope? Does C free the memory at the end of a loop when a variable is declared inside of it? At the end of a function?
So:
for(i=0;i<100;i++) char *x=malloc(n); // no need to use free(x)?
but:
char *x;
for(i=0;i<100;i++) {
x=malloc(n);
free(x); //must do this, since scope of x greater than loop?
}
Is that right?
Hopefully I’m making sense…
malloc()is C’s dynamic allocator. You have to understand the difference between automatic (scoped) and dynamic (manual) variables.Automatic variables live for the duration of their scope. They’re the ones you declare without any decoration:
int x;Most variables in a C program should be automatic, since they are local to some piece of code (e.g. a function, or a loop), and they communicate via function calls and return values.
The only time you need dynamic allocation is when you have some data that needs to outlive any given scope. Such data must be allocated dynamically, and eventually freed when it is no longer necessary.
The prime usage example for this is your typical linked list. The list nodes cannot possibly be local to any scope if you are going to have generic “insert/erase/find” list manipulation functions. Thus, each node must be allocated dynamically, and the list manipulation functions must ensure that they free those nodes that are no longer part of the list.
In summary, variable allocation is fundamentally and primarily a question of scope. If possible keep everything automatic and you don’t have to do anything. If necessary, use dynamic allocation and take care to deallocate manually whenever appropriate.
(Edit: As @Oli says, you may also want to use dynamic allocation in a strictly local context at times, because most platforms limit the size of automatic variables to a much smaller limit than the size of dynamic memory. Think “huge array”. Exceeding the available space for automatic variables usually has a colourful name such as “pile overrun” or something similar.)