I have to write a program to remove all expressions of the form <word>

Question

0

Asked: May 31, 20262026-05-31T02:04:40+00:00 2026-05-31T02:04:40+00:00

I have to write a program to remove all expressions of the form <word>

0

I have to write a program to remove all expressions of the form <word> and </word> where word is any sequence of letters (lower and upper case) and
Remove all expressions of the form <word ..... > and </word> where word is the same as before. For example, remove <a href=”wwang3.htm” class=”c l”>

Until now my code looks like this:

def remove_1( file_location ):
    """"""

    import re
    file_variable = open( file_location )
    lines = file_variable.read()

    p = re.findall('<.*?>', lines)
    print p

    substitution = re.compile('<.*?>')
    print substitution.subn( ' ', p )

I get an error that points to the print.substitution.subn( ' ', p) in which it says that I expected a string or buffer while running the program. Any help is greatly appreciated.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-31T02:04:41+00:00

Editorial Team

2026-05-31T02:04:41+00:00Added an answer on May 31, 2026 at 2:04 am

You are trying to substitute into the string “p”. However, p is the result of findall which is a list.

I would suggest doing it like this:

lines = file_variable.read()
print re.subn('<.*?>', ' ', line)

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have to write a program to remove all expressions of the form <word>

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply