dfs.data.dir allows more than one directory for datanode to store data blocks. When data is copied to the HDFS, how is the data distributed across the directories?
dfs.data.dir allows more than one directory for datanode to store data blocks. When data
Share
When
dfs.data.dirhas multiple values, data is copied to the HDFS in a round-robin fashion. If one of the directory’s disk is full, round-robin data copy will continue on the rest of the directories.