My application deals with a large amounts of real time sensor data. The data can be used for big data analysis, which hadoop is of course the best choice to work on. But now I am just thinking whether I should firstly put the data into mongodb, and through the mongodb to hadoop interface to do the big data thing. Is mongodb needed since both of them can store the data and map reduce (hadoop is more advanced)?
May anyone help me to analyze the differences between hadoop and mongodb?
Should I combine them, or just keep hadoop only in this sensor based application architecture?
Hadoop is not a database – it’s a data analysis framework.
But it needs to get its data from somewhere. MongoDB is one possible solution for storing your data for analysis by Hadoop. But whether it’s a good or a bad solution depends on a lot of factors which you didn’t mention in your question. Mostly how your data is structured and if you expect the structure to change in the future.