Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 6886737
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 27, 20262026-05-27T05:47:49+00:00 2026-05-27T05:47:49+00:00

I have a few algorithms that extract and rank keywords [both terms and bigrams

  • 0

I have a few algorithms that extract and rank keywords [both terms and bigrams] from a paragraph [most are based on the tf-idf model].

I am looking for an experiment to evaluate these algorithms. This experiment should give a grade to each algorithm, indicating “how good was it” [on the evaluation set, of course].

I am looking for an automatic / semi-automatic method to evaluate each algorithm’s results, and an automatic / semi-automatic method to create the evaluation set.

Note: These experiments will be ran off-line, so efficiency is not an issue.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-27T05:47:50+00:00Added an answer on May 27, 2026 at 5:47 am

    The classic way to do this would be to define a set of key words you want the algorithms to find per paragraph, then check how well the algorithms do with respect to this set, e.g. (generated_correct – generated_not_correct)/total_generated (see update, this is nonsense). This is automatic once you have defined this ground truth. I guess constructing that is what you want to automate as well when you talk about constructing the evaluation set? That’s a bit more tricky.

    Generally, if there was a way to generate key words automatically that’s a good way to use as a ground truth – you should use that as your algorithm ;). Sounds cheeky, but it’s a common problem. When you evaluate one algorithm using the output of another algorithm, something’s probably going wrong (unless you specifically want to benchmark against that algorithm).

    So you might start harvesting key words from common sources. For example:

    • Download scientific papers that have a keyword section. Check if those keywords actually appear in the text, if they do, take the section of text including the keywords, use the keyword section as ground truth.

    • Get blog posts, check if the terms in the heading appear in the text, then use the words in the title (always minus stop words of course) as ground truth

    • …

    You get the idea. Unless you want to employ people to manually generate keywords, I guess you’ll have to make do with something like the above.

    Update
    The evaluation function mentioned above is stupid. It does not incorporate how many of the available key words have been found. Instead, the way to judge a ranked list of relevant and irrelevant results is to use precision and recall. Precision rewards the absence of irrelevant results, Recall rewards the presence of relevant results. This again gives you two measures. In order to combine these two into a single measure, either use the F-measure, which combines those two measures into a single measure, with an optional weighting. Alternatively, use Precision@X, where X is the number of results you want to consider. Precision@X interestingly is equivalent to Recall@X. However, you need a sensible X here, ie if you have less than X keywords in some cases, those results will be punished for never providing an Xth keyword. In the literature on tag recommendation for example, which is very similar to your case, F-measure and P@5 are often used.

    http://en.wikipedia.org/wiki/F1_score

    http://en.wikipedia.org/wiki/Precision_and_recall

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

In almost all of my projects I have few files that are project wide
I have an algorithm that takes 7 days to Run To Completion (and few
I have a program that loads a file (anywhere from 10MB to 5GB) a
We have several different optimization algorithms that produce a different result for each run.
I have made a quite few genetic algorithms; they work (they find a reasonable
Over the last few days I have noted a few web sites that demonstrated
We have a few sections of our application that are using AJAX.NET 5.7.25.1. Our
I'm in the process of learning about simulated annealing algorithms and have a few
I have probem / strange question, i got algorithm with few for loops and
I have few asynchronous tasks running and I need to wait until at least

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.