You could do this is you want to use chaining.…

Question

0

Asked: May 10, 20262026-05-10T22:37:37+00:00 2026-05-10T22:37:37+00:00

I’m using re.findall() to extract some version numbers from an HTML file: >>> import

0

I’m using re.findall() to extract some version numbers from an HTML file:

>>> import re >>> text = '<table><td><a href=\'url\'>Test0.2.1.zip</a></td><td>Test0.2.1</td></table> Test0.2.1' >>> re.findall('Test([\.0-9]*)', text) ['0.2.1.', '0.2.1', '0.2.1']

but I would like to only get the ones that do not end in a dot. The filename might not always be .zip so I can’t just stick .zip in the regex.

I wanna end up with:

['0.2.1', '0.2.1']

Can anyone suggest a better regex to use? 🙂

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

score 0 · Answer 1 · 2026-05-10T22:37:37+00:00

2026-05-10T22:37:37+00:00Added an answer on May 10, 2026 at 10:37 pm

re.findall(r'Test([0-9.]*[0-9]+)', text)

or, a bit shorter:

re.findall(r'Test([\d.]*\d+)', text)

By the way – you do not need to escape the dot in a character class. Inside [] the . has no special meaning, it just matches a literal dot. Escaping it has no effect.

0

Reply
Share
Share

- Report

Related Questions

How to approach applying for a job at a company ...

How to handle personal stress caused by utterly incompetent and ...

What is a programmer’s life like?

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions