Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 781653
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 14, 20262026-05-14T20:15:56+00:00 2026-05-14T20:15:56+00:00

I have a set of Books objects, classs Book is defined as following :

  • 0

I have a set of Books objects, classs Book is defined as following :

Class Book{

String title;
ArrayList<tags> taglist;

}

Where title is the title of the book, example : Javascript for dummies.

and taglist is a list of tags for our example : Javascript, jquery, “web dev”, ..

As I said a have a set of books talking about different things : IT, BIOLOGY, HISTORY, …
Each book has a title and a set of tags describing it..

I have to classify automaticaly those books into separated sets by topic, example :

IT BOOKS :

  • Java for dummies
  • Javascript for dummies
  • Learn flash in 30 days
  • C++ programming

HISTORY BOOKS :

  • World wars
  • America in 1960
  • Martin luther king’s life

BIOLOGY BOOKS :

  • ….

Do you guys know a classification algorithm/method to apply for that kind of problems ?

A solution is to use an external API to define the category of the text, but the problem here is that books are in different languages : french, spanish, english ..

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-14T20:15:57+00:00Added an answer on May 14, 2026 at 8:15 pm

    This looks like a reasonably straightforward keyword-based classification task. Since you’re using Java, good packages to consider for this would be Classifier4J, Weka, or Lucene Mahout.

    Classifier4J

    Classifier4J supports classification using naive Bayes and a vector space model.

    As seen in this source code snippet on training and scoring using its naive Bayes classifier, the package is reasonably easy to use. It’s also distributed under the liberal Apache Software License.

    Weka

    Weka is a very popular tool for data mining. An advantage of using it is that you’d be able to readily experiment with using numerous different machine learning models to categorize the books into topics including naive Bayes, decision trees, support vector machines, k-nearest neighbor, logistic regression, and even a rule set based learner.

    You’ll find a tutorial on using Weka for text categorization here.

    Weka is, however, distributed under the GPL. You won’t be able to use it for closed source software that you want to distribute. But, you could still use it to back a web service.

    Lucene Mahout

    Mahout is designed for doing machine learning on very large datasets. It’s built on top of Apache Hadoop and supports supervised classification using naive Bayes.

    You’ll find a tutorial covering how to use Mahout for text classification here.

    Like Classifier4J, Mahout is distributed under the liberal Apache Software License.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

Here's an example: If I have these classes class Author(models.Model): name = models.CharField(max_length=45) class
Outline OK, I have Google'd this and already expecting a big fat NO!! But
I've two entities with 1 to N relation in between. Let's say Books and
I have an object instance which I access with the ME as it accesses
You may think this question is like this question asked on StackOverflow earlier. But
So far, I've had great success using PyAMF to communicate between my Flex front-end
Doing a bit of reading around domain driven design and it seems that you're
One thing that really interests me that I don’t see much written about or
I'm not sure how to ask the question, for I don't know what I

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.