str = <test>0</test> print re.search(<.?>, str).group() print re.search(>.?<, str).group() >> <text> >> >0< How

Question

0

Asked: June 4, 20262026-06-04T11:42:23+00:00 2026-06-04T11:42:23+00:00

str = <test>0</test> print re.search(<.?>, str).group() print re.search(>.?<, str).group() >> <text> >> >0< How

0

str = "<test>0</test>"
print re.search("<.*?>", str).group()
print re.search(">.*?<", str).group()
>> <text>
>> >0<

How can I get it so that the resulting text is “test” and “0” and not include the two characters I used as markers in the regex?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-04T11:42:24+00:00

You shouldn’t be using regex to parse XML/HTML, see murgatroid99’s comment.

That being said, here is how you can get the results you want for this example using regex. Use a capturing group:

>>> s = "<test>0</test>"
>>> print re.search(r"<(.*?)>", s).group(1)
test
>>> print re.search(r">(.*?)<", s).group(1)
0

Note that you shouldn’t use str as a variable name, as it will mask the built-in type.

An alternative to a capturing group would be a lookbehind and lookahead:

>>> print re.search(r"(?<=<).*?(?=>)", s).group()
test
>>> print re.search(r"(?<=>).*?(?=<)", s).group()
0

Using raw string literals (r"...") isn’t necessary for these in particular, but it is good to get into the habit of using them when writing regular expressions to make sure that backslashes are handled properly.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

str = <test>0</test> print re.search(<.*?>, str).group() print re.search(>.*?<, str).group() >> <text> >> >0< How

Leave an answerCancel reply

1 Answer

str = <test>0</test> print re.search(<.?>, str).group() print re.search(>.?<, str).group() >> <text> >> >0< How

Leave an answer
Cancel reply