scalars can only be used with projection i am getting this error while using foreach.How can i resolved this error ? how can i use LIMIT within foreach ? please suggest some
thanks in advance..
Edit (Tichdroma): Copied code from comment
A = LOAD 'part-r-00000';
G = Group A by ($0,$2 );
Y = foreach G generate FLATTEN(group), FLATTEN($1);
sorted = order Y by $0 ASC, $1 DESC;
X = foreach Y {
lim = LIMIT sorted 3;
generate lim;
};
Dump x;
LIMIT is available in Pig 0.9 in the FOREACH nested_op.
If you want the top N element of each group, you might want to try to iterate on each one and individually sort and limit them:
Notice that TOP can be efficient when you just have a column of comparable values (not in this case).