I am wondering how PigStorage in Pig stores data to S3? Does it save

Question

0

Asked: June 15, 20262026-06-15T05:43:33+00:00 2026-06-15T05:43:33+00:00

I am wondering how PigStorage in Pig stores data to S3? Does it save

0

I am wondering how PigStorage in Pig stores data to S3? Does it save output to HDFS and then copy them over? Or saving each reducer output to local directory of each reducer and then copying them over to S3? I guess this can’t be streaming since S3 supports only putting files or a directory?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-15T05:43:34+00:00

Editorial Team

2026-06-15T05:43:34+00:00Added an answer on June 15, 2026 at 5:43 am

My understanding is that each reducer writes its output locally and then copies the output to S3.

As you have correctly stated – since S3 doesn’t support streaming, the reducer can only copy its output once it has finished processing.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I am wondering how PigStorage in Pig stores data to S3? Does it save

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply