I want to extract the first number found in an e-mail body. With the

Question

0

Asked: May 27, 20262026-05-27T02:47:01+00:00 2026-05-27T02:47:01+00:00

I want to extract the first number found in an e-mail body. With the

0

I want to extract the first number found in an e-mail body. With the help with email library I extracted just the body from the mail to a string. But the problem is that before the real plain-text body begins there is some info about encoding and such (those contains digits). How can I skip those on a reliable way that is undependent on which client that created the e-mail and just ge the first number.

If I do a

match = re.search('\d+', string, re.MULTILINE)

It will get the first match in the info about encoding or something and not in the actual mail content.

Ok. I add a sample. This is how it could look (i will extract 123). But I suppose it could look different went sent from another client.

--14dae93404410f62f404b2e65e10
Content-Type: text/plain; charset=ISO-8859-1

Junk 123 Junk

--14dae93404410f62f404b2e65e10
Content-Type: text/html; charset=ISO-8859-1

<p>Junk 123 Junk</p>

--14dae93404410f62f404b2e65e10--

Update:
Now I’m stuck with the iterator :-/ I really tried. But I don’t get it. This code:

msg = email.message_from_string(raw_message)
for part in email.iterators.typed_subpart_iterator(msg, 'text', 'plain'):
    print part

outputs:

--14dae93404410f62f404b2e65e10
Content-Type: text/plain; charset=ISO-8859-1

Junk 123 Junk

--14dae93404410f62f404b2e65e10
Content-Type: text/html; charset=ISO-8859-1

<p>Junk 123 Junk</p>

--14dae93404410f62f404b2e65e10--

Why won’t it just output:

Junk 123 Junk

?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-27T02:47:02+00:00

Editorial Team

2026-05-27T02:47:02+00:00Added an answer on May 27, 2026 at 2:47 am

You may want to use the iterators to skip over the subpart headers.

http://docs.python.org/library/email.iterators.html#module-email.iterators

This example will print the body of each message subpart that is text/plain:

for part in email.iterators.typed_subpart_iterator(msg, 'text', 'plain'):
   for body_line in email.iterators.body_line_iterator(part):
       print body_line

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I want to extract the first number found in an e-mail body. With the

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply