I have sets of these files-
objectA-record-data.log file – Contains multiple lines of data with timestamps.
objectA-record-metadata.log file – Contains just one line. Think of it as a metadata for all the records of objectA.
Together they will form one record for processing.
I have been able to process the data file. The mapper sets objectA as the key for all the lines in the data file and reducer processes and prints the text file. Now i want to add metadata also to each of these records. Any ideas how can i do that?
Use DistributedCache:
In the driver/configuration class:
In the mapper: