I created a program in C that writes a program as its output.
The objective is to test aspects of performance in monolithic programs.
The first test was configured with 10 000 iterations and the result program compiled and ran.
The second test with 100 000 iterations is compiling (3030 minutes until now) in Ubuntu 12.04 x86_64 in an i7 3770 with 16 GB RAM (16 GB SWAP).
I know that a parse complexity is from O(n**2) to O(n**3), but this is taking too long.
In the worst scenario, 1000 times more compile timing.
Is consuming 35,2% of memory and still increases.
My question is:
-
Does GCC have limitations in the number of variables per module or module size?
-
Is this a bug?
The original program generator is:
#include <stdio.h>
#define MAX_INTERACTION 100000
int main(int argc, char **argv)
{
FILE * fp;
fp = fopen("source.c","w");
fprintf(fp,"#include <stdio.h> \n \n \n");
fprintf(fp,"int main(int argc, char **argv) \n");
fprintf(fp,"{ \n");
// local variables and exchange variables
for (int i=0; i< MAX_INTERACTION ; ++i)
{
// passed variable, return label , local variable
fprintf(fp," int pv%d , rl%d, loc%d ; \n",i,i,i);
}
fprintf(fp," int pvd =0 ;\n \n \n");
//code blocks
for (int i=0; i< MAX_INTERACTION ; ++i)
{
fprintf(fp," block%d : \n",i);
fprintf(fp," loc%d = pv%d +1 ; \n",i,i);
fprintf(fp," goto rl%d; \n",i);
}
//call blocks
for (int i=1; i< MAX_INTERACTION +1; ++i)
{
fprintf(fp," pvd = pv%d ;\n",(i-1));
fprintf(fp," goto block%d; \n",(i-1));
fprintf(fp," rl%d: \n",(i-1));
}
fprintf (fp,"printf( \"Concluido \\n \"); \n");
fprintf(fp,"}\n");
fclose(fp);
}
I did some timing on a MacBook Pro with 8 GB main memory (2.3 GHz Intel Core i7).
Modified the generator to take a parameter indicating the program size, and then ran it repeatedly:
Clearly, at the small sizes, there is overhead in simply getting the compiler run (especially the first run!), reading and writing files, etc. Redoing in multiples of 10,000 up to 90,000, I got the results in the table below. The slowdown is appreciable, especially at between 20,000 and 30,000. I also got fairly significant variability in the times for any given size. And making the code configurable and then running incrementally when you run into a problem is only sensible.
Your mileage will vary, of course.
For reference, the GCC I’m using is:
The compilation command line is:
A home-built GCC 4.7.1 took 34.6 s for 10K, and 150.9 s for 20K.