Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 7541333
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 30, 20262026-05-30T07:51:54+00:00 2026-05-30T07:51:54+00:00

I have a webpage that I read using Python and BeautifulSoup, say soup=BeautifulSoup(urllib2.urlopen(site)) .

  • 0

I have a webpage that I read using Python and BeautifulSoup, say soup=BeautifulSoup(urllib2.urlopen(site)).

I’m trying to grab a snippet of the site and parse it, so I use a pTag = soup.find("p", {"class":"secondary"}), which results in the following content.

<p class="secondary">
              Some address and street
              <br />
              City, State, ZIP
              (some) phone-number
             </p>

I would like to basically have variables address1, address2, and phone such that:

address1= "Some address and street"
address2= "City, State, ZIP"
phone= "(some) phone-number"

I’m not sure how to read the rows of a soup to selectively pick rows 1, 3, 4 (assuming starting row 0), but then again I’m also open to other ways of getting the data I want.

Thanks in advance! 🙂

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-30T07:51:55+00:00Added an answer on May 30, 2026 at 7:51 am

    Assuming address contains your raw address.

    <p class="secondary">
                  Some address and street
                  <br />
                  City, State, ZIP
                  (some) phone-number
                 </p>
    

    Then you can replace the break line with a comma, before finally splitting by comma. This is not ideal but for these scenarios when there is no clear separation between elements (spans, id’s etc…) then it all comes down to positional checking.

    address.find("br").replaceWith(",")
    addressComponents = address.text.split(",")
    

    That gives you the following four components in the addressComponents list.

    Some address and street
    City
     State
     ZIP
                  (some) phone-number
    

    As there is no break line for the ZIP and phone number there appears to be a newline character inserted. So to split the final component:

    addressSplit = addressComponents[3].split("\n")
    print addressSplit[0] # Zip code
    print addressSplit[1].strip() # Phone number
    
    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

Hello I am trying to read my xml document using xpath. I have been
I have a Web Handler that I'm using to read a file and then
I've read on the webpage of Mono that they are using the Boehm GC
I have a webpage that pulls information from a database, converts it to .csv
I have a webpage that is taking way too long and need to optimize
I have a webpage that implements a set of tabs each showing different content.
I have a webpage that I use h1 tags multiple times within various DIVs
I have a webpage that redirects to another webpage like this: http://www.myOtherServer.com/Sponsor.php?RedirectPage=http://mylink.com/whereIwasgoingtogo.html Then the
I have a webpage that displays a very large list of data. Since this
I have a webpage that I don't have the ability to change the underlying

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.