I want to implement the parallel prefix sum algorithm using C++. My program should take the input array x[1....N],and it should display the output in the array y[N]. (Note the maximum value of N is 1000.)
So far, I went through many research papers and even the algorithm in Wikipedia.
But my program should also display the output, the steps and also the operations/instructions of each step.
I want the fastest implementation like I want to minimise the number of operations as well as the steps.
For example::
x = {1, 2, 3, 4, 5, 6, 7, 8 } - Input
y = ( 1, 3, 6, 10, 15, 21, 28, 36) - Output
But along with displaying the y array as output, my program should also display the operations of each step. I also refer this thread calculate prefix sum ,but could get much help from it.
Following piece of code will do the job
Following is the output on execution
You can avoid the user input of 1000 elements of array x[] by feeding it from file or so.