I know this is not very pretty code and that I’m sure there is

Question

0

Asked: May 30, 20262026-05-30T02:29:38+00:00 2026-05-30T02:29:38+00:00

I know this is not very pretty code and that I’m sure there is

0

I know this is not very pretty code and that I’m sure there is an easier way but I’m more concerned on why python is not stripping the characters I requested it to.

import urllib, sgmllib


zip_code = raw_input('Give me a zip code> ')
url = 'http://www.uszip.com/zip/' + zip_code
print url

conn = urllib.urlopen('http://www.uszip.com/zip/' + zip_code)

i = 0
while i < 1000:
    for line in conn.fp:
            if i == 1:
                print line[7:-10]
                i += 1
            elif i == 344:
                line1 = line.strip()
                line2 = line1.strip('<td>') #its not stripping the characters 
                print line2[17:-60]
                i += 1
            else:
            i += 1

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-30T02:29:41+00:00

The way you call it, it should remove any occurrence of the <, >, t, and d characters, and only at the beginning or end of the string:

>>> '<p>some test</p>'.strip('<td>')
'p>some test</p'

If you want to remove every occurrence of the substring <td>, use replace:

>>> '<td>some test</td>'.replace('<td>', '')
'some test</td>'

Note that if you want to use that for some kind of input sanitization, it can be easily circumvented:

>>> '<td<td>>some test</td>'.replace('<td>', '')
'<td>some test</td>'

This is only one of many ways how people typically get screwed if they try to write their own HTML parsing code, so maybe you rather want to use a HTML parsing library like BeautifulSoup or an XML parser like lxml.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I know this is not very pretty code and that I’m sure there is

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply