When I connected my Hadoop cluster to Amazon storage and downloaded files to HDFS, I found s3:// did not work. When looking for some help on the Internet I found I can use S3n. When I used S3n it worked. I do not understand the differences between using S3 and S3n with my Hadoop cluster, can someone explain?
When I connected my Hadoop cluster to Amazon storage and downloaded files to HDFS,
Share
I think your main problem was related with having
S3andS3nas two separate connection points for Hadoop.s3n://means “A regular file, readable from the outside world, at this S3 url”.s3://refers to an HDFS file system mapped into an S3 bucket which is sitting on AWS storage cluster. So when you were using a file from Amazon storage bucket you must be using S3N and that’s why your problem is resolved. The information added by @Steffen is also great!!