Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • Home
  • SEARCH
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 5954935
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 22, 20262026-05-22T18:02:32+00:00 2026-05-22T18:02:32+00:00

I am planning to develop a web-based application which could crawl wikipedia for finding

  • 0

I am planning to develop a web-based application which could crawl wikipedia for finding relations and store it in a database. By relations, I mean searching for a name say,’Bill Gates’ and find his page, download it and pull out the various information from the page and store it in a database. Information may include his date of birth, his company and a few other things. But I need to know if there is any way to find these unique data from the page, so that I could store them in a database. Any specific books or algorithms would be greatly appreciated. Also mentioning of good opensource libraries would be helpful.

Thank You

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-22T18:02:33+00:00Added an answer on May 22, 2026 at 6:02 pm

    If you haven’t already, you should have a look at DBpedia. Many categories of wiki articles have “Infoboxes” for the kinds of information you describe, and they’ve made a database out of it:

    http://en.wikipedia.org/wiki/DBpedia

    You might also leverage some of the information in Metaweb’s Freebase (which overlaps and I believe may even integrate the info from DBpedia.) They have an API for querying their graph database, and there’s a Python wrapper for it called freebase-python.

    UPDATE: Freebase is no more; they were acquired by Google and eventually folded into the Google Knowledge Graph. There is an API but I don’t think they have anything like the formal sync’ing Freebase had with public sources like Wikipedia. I’m personally disappointed in how this looks to have turned out. :-/

    As for the natural language processing bit, if you do make headway on that problem you might consider these databases as repositories for any information you do mine.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I'm planning to develop a web application which will have many static pages (about,
I'm planning to develop a web-services (SOAP to C++ client) in Java with Metro/Hibernate
I was planning to use url routing for a Web Forms application. But, after
I'm planning to develop a web app where users will list their site/blog. When
I am planning to develop a very simple java application (not mobile, but desktop
I am planning to develop an gyroscope based project like rotating an opengl texture
I am planning to develop a function in C# which will return a two
I'm planning to develop an ASP.NET server control to provide asynchronous username availability validation
I'm planning to write a program in Ruby to analyse some data which has
I'm planning to develop a small business app that I'd like to be deployable

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.