Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • Home
  • SEARCH
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 3602958
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 18, 20262026-05-18T20:48:44+00:00 2026-05-18T20:48:44+00:00

Having the HTML of a webpage, what would be the easiest strategy to get

  • 0

Having the HTML of a webpage, what would be the easiest strategy to get the text that’s visible on the correspondent page? I have thought of getting everything that’s between the <a>..</a> and <p>...</p> but that is not working that well.

Keep in mind as that this is for a school project, I am not allowed to use any kind of external library (the idea is to have to do the parsing myself). Also, this will be implemented as the HTML of the page is downloaded, that is, I can’t assume I already have the whole HTML page downloaded. It has to be showing up the extracted visible words as the HTML is being downloaded.

Also, it doesn’t have to work for ALL the cases, just to be satisfatory most of the times.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-18T20:48:45+00:00Added an answer on May 18, 2026 at 8:48 pm

    Literally all the text that is visible sounds like a big ask for a school project, as it would depend not only on the HTML itself, but also any in-page or external styling. One solution would be to simply strip the HTML tags from the input, though that wouldn’t strictly meet your requirements as you have stated them.

    Assuming that near enough is good enough, you could make a first pass to strip out the content of entire elements which you know won’t be visible (such as script, style), and a second pass to remove the remaining tags themselves.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I am having a HTML table with many rows. I have seven columns in
I'm having a problem with a simple html login page I made, where when
I am having issues embedding SVG into a webpage. I have found the simplest
I have a txt file which actually is a html source of some webpage.
I have a very basic web page that uses flot to create a canvas
I currently have this on a webpage I'm making: HTML <div id=pageHeader> <nav id=siteNav>
I am having a problem with my web page, because I have a wrapper
I'm having some concurrency issues with a webpage I'm building. Basically, I have three
I have a webpage (htm) that has a textbox (going to change it to
I have a webpage having two user control uc1 and uc2. When user clicks

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.