How to search for email where the subject contains numbers

Question

I&#8217;m looking for emails where the title has information on how many Bitcoin I received, but as there&#8217;s a number in the email title, I want a way to find emails where the number is equal to or greater than that number. Example&#8230; I have an email title like &#8220;You received 0.000666703 BTC&#82…

Accepted Answer

If you only care about the subject, only fetch the subject.import imaplibfrom email.parser import HeaderParserfrom email.policy import default  # use Python >= 3.6 EmailMessage API... parser = HeaderParser(policy=default)server.select('INBOX')typ, data = server.search(None, '(FROM "no-reply@coinbase.com" SUBJECT "You received" SINCE "24-Sep-2021")')if typ == 'ok':    for num in data[0].split():       ok, fetched = server.fetch(num, '(BODY.PEEK[HEADER.FIELDS (SUBJECT)])')       if ok == 'ok':           subj = parser.parsestr(fetched[0][1].decode('us-ascii'))           if not subj.startswith('Subject: You received'):               continue           try:               amount = float(subj.split()[2])           except IndexError, ValueError:               continue           if amount > 0.000666703:               print('Message %i: %s', num, subj)The Subject: header is a bytes string which at a minimum you have to decode. However, there may also be a MIME wrapping (like maybe Subject: =?UTF-8?B?WW91IHJlY2VpdmVkIDAuMTIzIEJUQw==) which you need to decode using the email.parser.HeaderParser methods or something similar. The interface is a bit messy (you really wish there was a way to pass it bytes so you don&#8217;t have to separately decode).The BODY.PEEK method does not modify the message&#8217;s flags (whereas just BODY would mark the message as read, etc).Some IMAP servers support more complex search syntax (perhaps even regex) but this should be reasonably portable and robust, I hope.

Advertisement

Answer