I’m using re.split() to separate a string into tokens. Currently the pattern I’m using

Question

0

Asked: May 23, 20262026-05-23T12:20:09+00:00 2026-05-23T12:20:09+00:00

I’m using re.split() to separate a string into tokens. Currently the pattern I’m using

0

I’m using re.split() to separate a string into tokens. Currently the pattern I’m using as the argument is [^\dA-Za-z], which retrieves alphanumeric tokens from the string.

However, what I need is to also split tokens that have both numbers and letters into tokens with only one or the other, eg.

re.split(pattern, "my t0kens")

would return ["my", "t", "0", "kens"].

I’m guessing I might need to use lookahead/lookbehind, but I’m not sure if that’s actually necessary or if there’s a better way to do it.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-23T12:20:09+00:00

Editorial Team

2026-05-23T12:20:09+00:00Added an answer on May 23, 2026 at 12:20 pm

Try the findall method instead.

>>> print re.findall ('[^\d ]+', "my t0kens");
['my', 't', 'kens']
>>> print re.findall ('[\d]+', "my t0kens");
['0']
>>>

Edit: Better way from Bart’s comment below.

>>> print re.findall('[a-zA-Z]+|\\d+', "my t0kens")
['my', 't', '0', 'kens']
>>>

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m using re.split() to separate a string into tokens. Currently the pattern I’m using

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply