Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 9215317
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: June 18, 20262026-06-18T02:10:40+00:00 2026-06-18T02:10:40+00:00

Is there a discipline, framework, or tool sets, for programming using information from html

  • 0

Is there a discipline, framework, or tool sets, for programming using information from html pages as part of the input data? Something like a meta search engine. how do you parse the webpage ?

I would prefer on java or flex/flash, or some pointers to some reading.

Thank you!

UPDATE February 7 2013

Thank you for your answers! web scraping was the term i was looking for!

Found this awesome java library: http://jsoup.org/ from this post Web scraping with Java.

Looking for the flex one, i´ll update as soon as i find it.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-06-18T02:10:41+00:00Added an answer on June 18, 2026 at 2:10 am

    I think your question is a bit vague to get good answers, and I don’t have Java/Flex experience myself, but most languages have library support for making an HTTP request to the resource in question (and, most likely, some sort of support for parsing the HTML/XML into a data structure of some sort that you can pull data from.)

    Depending on what you’re trying to get out of it, you may just be able to do simple string searches on the HTTP response for what you need. This is essentially what @pablochan is recommending when he suggests the wiki page on web scraping.

    Be aware that some services/sites are designed to confound your attempts to page-scrape their data, and may indeed list such actions as a violation of their terms of service. If you are successful in doing so but do so too frequently, you may find your IP blocked or other sorts of actions taken to keep you from doing so.

    Most static sites won’t have protections like these, but large services may well.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

In the field of Data Mining, is there a specific sub-discipline called 'Similarity'? If
There is a directed graph having a single designated node called root from which
There are two intents on the receiver side which are called from the same
There's a webpage with something annoying on it which I'd like to hide every
There is a properly formatted text in Html textarea which looks like : As
I am using a hierarchy of generic collection classes that derive from an abstract
There is something strange happening. Param interceptor is the one in charge to take
There is something I miss with the notion of Synchronizing code in Android. Scenario
There is a moment in my app, that I need to force to show
There is a column that exists in 2 tables. In table 1, this column

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.