I have started learning Data Mining and wish to create a small project in

Question

0

Asked: May 25, 20262026-05-25T14:13:14+00:00 2026-05-25T14:13:14+00:00

I have started learning Data Mining and wish to create a small project in

0

I have started learning Data Mining and wish to create a small project in C++/Java that allows me to utilize a database, say from twitter and then publish a particular set of results (for eg. all the news items on a feed). I want to know how to go about it? Where should I start?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-25T14:13:15+00:00

This is a really broad question, so it’s hard to answer. Here are some things to consider:

Where are you going to get the data? You mention twitter, but you’ll still need to collect the data in some way. There are probably libraries out there for listening to twitter streams, or you could probably buy the data if someone is selling it.
Where are you going to store the data? Depending on how much you’ll have and what you plan to do with it, a traditional relational database may or may not be the best fit. You may be better off with something that supports running mapreduce jobs out-of-the box.

Based on the answers to those questions, the choice of programming languages and libraries will be easier to make.

If you’re really set on Java, then I think a Hadoop cluster is probably what you want to start out with. It supports writing mapreduce jobs in Java, and works as an effective platform for other systems such as HBase, a column-oriented datastore.

If your data are going to be fairly regular (that is, not much variation in structure from one record to the next), maybe Hive would be a better fit. With Hive, you can write SQL-like queries, given only data files as input. I’ve never used Mahout, but I understand that its machine learning capabilities are suited for data mining tasks.

These are just some ideas that come to mind. There are lots of options out there and choosing between them has as much to do with the particular problem you’re trying to solve and your own personal tastes as anything else.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have started learning Data Mining and wish to create a small project in

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply