Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 7684087
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 31, 20262026-05-31T18:56:42+00:00 2026-05-31T18:56:42+00:00

I am working on a project which is basically a knowledge based question answering

  • 0

I am working on a project which is basically a knowledge based question answering system. My system takes query from the user, download the relevant documents from Wikipedia, strips all the html tags and extracts the plain text. After this, it tokenizes the document into sentences, then forms the term-document(TD) matrix(The query is also passed as a sentence). This TD matrix is then forwarded to pLSA(Probabilistic Latent Symentic Analysis) algorithm. Then, finally calculates the cosine similarity among the document(sentence) vectors with query vector. Based on the similarity with the query vector, the most relevant sentence is displayed as the answer. (Stemming is also done at the formation of TD Matrix).
The problem is that is does displays the result, but not the most relevant. Where am I going wrong? Is the strategy I am following is correct, or any other algorithm does exists that may help??
Below I show some of the Question and their answers as returned by my system :

What is photosynthesis?
ANSWER  1 :   The stroma contains stacks (grana) of thylakoids, which are the site of photosynthesis 

ANSWER  2 :   Factors leaf is the primary site of photosynthesis in plants 

ANSWER  3 :   Samuel Ruben and Martin Kamen used radioactive isotopes to determine that the oxygen liberated in photosynthesis came from the water 

ANSWER  4 :   In plants, algae and cyanobacteria, photosynthesis releases oxygen 

Another question

What is Artificial Intelligence?
ANSWER  1 :   the problem of creating 'artificial intelligence' will substantially be solved" 

ANSWER  2 :   37 The leading-edge definition of artificial intelligence research is changing over time 

ANSWER  3 :   Stories of these creatures and their fates discuss many of the same hopes, fears and ethical concerns that are presented by artificial intelligence 

ANSWER  4 :   History of artificial intelligence and Timeline of artificial intelligence Thinking machines and artificial beings appear in Greek myths , such as Talos of Crete , the bronze robot of Hephaestus , and Pygmalion's Galatea 13 Human likenesses believed to have intelligence were built in every major civilization 

Another question

Who is a hacker?

ANSWER  1 :   19 Hackers (short stories) Helba from the  

ANSWER  2 :   16 Rafael Núñez aka RaFa was a notorious most wanted hacker by the FBI since 2001 

ANSWER  3 :   Often, this type of 'white hat' hacker is called an ethical hacker 
ANSWER  4 :   Hackers also commonly use port scanners  

yet another run

What is biology?
ANSWER  1 :   Molecular biology is the study of biology at a molecular level 

ANSWER  2 :   molecular biology studies the complex interactions of systems of biological molecules 

ANSWER  3 :   The similarities and differences between cell types are particularly relevant to molecular biology 

ANSWER  4 :   Contents History Foundations of modern biology 2 
  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-31T18:56:43+00:00Added an answer on May 31, 2026 at 6:56 pm

    I think that it will be difficult to improve your system if you keep a full statistical approach. From a statistical NLP standpoint, you really do the right things. Now, you may fine-tune some parameters. To do that, you must build a training corpus by telling the system which answer is the right one… and then see which value the parameter has to take to give you this answer.

    That being said, I don’t think that fine-tuning parameters will improve your accuracy by more than 20% ~30%.

    If you want to go further, you’ll need a more semantic approach, and represent knowledge symbolically. Check for instance http://www.jfsowa.com/

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I am working on a project which is based on an FTP user with
I'm working on a project which uses user authentication. I'm facing a issue with
I am working on a project which is basically a Java application with an
I am working on a project which basically involves parsing and storing information about
We have an Ant and Ivy-based build management system, which basically consists of a
I'm working on personal project which is basically a code editor. Imagine standard File
I'm working on a small project which will basically do some Facebook stuff and
I'm working on a Ruby on Rails (3 beta) project which is basically a
Iam working on a project which involves writing a Mork File (Mork is a
I'm working on a project which is just about to start, and since I

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.