Collecting IOCs from IMAP Folder
I've plenty of subscriptions to "cyber security" mailing lists that generate a lot of traffic. Even if we try to get rid of emails, that's a fact: email remains a key communication channel. Some mailing lists posts contain interesting indicators of compromize. So, I searched for a nice way to extract them in an automated way (and to correlate them with other data). I did not find a solution ready to use that matched my requirements:
- Connect to any mailbox (preferably via IMAP)
- Produce data easy to process (JSON)
- Be easy to deploy (Docker)
So, I built my own Docker image... It is based on the following components:
- procmail
- getmail
- some Python libraries
- The project es_mail_intel[1]
The last tool is an old project that achieves exactly why I expect: It extracts IOCs from emails and stores them in ElasticSearch. But, if you don't want ElasticSearch, it can also produce a JSON file! Parsing emails is a pain! So, I did not want to write my own parser.
Data are processed in this way: Emails are fetched via IMAP at regular intervals by getmail and pushed to procmail. It pushes them to the Python script that extracts interesting data.
IMAP data >> getmail >> procmail >> mail_parser2json_extract.py >> JSON data
Here is my Dockerfile:
FROM ubuntu:18.04
MAINTAINER Xavier Mertens <xavier@rootshell.be>
RUN apt-get update && \
DEBIAN_FRONTEND=noninteractive apt-get install --no-install-recommends -y \
ca-certificates \
getmail \
procmail \
git \
python \
python-ipaddress \
python-pdfminer \
python-elasticsearch \
python-xlrd \
&& \
apt-get clean && \
rm -rf /var/lib/apt/lists/*
RUN mkdir -p /root/.getmail
RUN git clone https://github.com/clverhack/es_email_intel.git /opt/es_email_intel
COPY getmail.conf /
RUN echo ":0" >>/procmailrc
RUN echo "|/opt/es_email_intel/mail_parser2json_extract.py 2 >>/log/ioc.json" >>/procmailrc
COPY run.sh /
RUN chmod u+x /run.sh
RUN touch /tmp/firstboot
CMD ["/run.sh"]
It needs a getmail.conf with the parameters of the mailbox you'd like to monitor:
[retriever]
type = SimpleIMAPSSLRetriever
server = CONF_SERVER
username = CONF_LOGIN
password = CONF_PASSWORD
[destination]
type = MDA_external
path = /usr/bin/procmail
user = getmail
group = getmail
arguments = ('/procmailrc', )
[options]
verbose=1
read_all=false
delete=false
delete_after=0
delete_bigger_than=0
max_bytes_per_session=0
max_message_size=0
max_messages_per_session=0
delivered_to=false
received=false
message_log=/log/getmail.log
message_log_syslog=false
message_log_verbose=true
getmail is a very powerful tool with plenty of options. Just have a look at the documentation[2] to find your best way to interact with your mailboxes. The script 'run.sh' will be executed by the container and, at first boot, configure your credentials:
#!/bin/bash
if [ -r /tmp/firstboot ]; then
sed -i "s|CONF_SERVER|$IMAP_SERVER|g" /getmail.conf
sed -i "s|CONF_LOGIN|$IMAP_USER|g" /getmail.conf
sed -i "s|CONF_PASSWORD|$IMAP_PASS|g" /getmail.conf
groupadd getmail
useradd -u $UID -g getmail -d /home/getmail getmail
mkdir /home/getmail && chown getmail:getmail /home/getmail
test -d /log || mkdir /log
touch /log/getmail.log /log/ioc.json
chown -R root:getmail /log
chmod -R g+w /log
rm /tmp/firstboot
fi
while true
do
/usr/bin/getmail -r /getmail.conf
sleep $IMAP_WAIT
done
And, finally, my docker-compose.yml file:
version: '3'
services:
iocollector:
build: .
image: "xme/iocollector"
restart: always
hostname: iocollector
container_name: iocollector
volumes:
- /etc/localtime:/etc/localtime:ro
- /data/iocollector/log:/log
environment:
- UID=1000
- IMAP_SERVER=<server_ip_or_fqdn>
- IMAP_USER=<username>
- IMAP_PASS=<password>
- IMAP_WAIT=30
network_mode: bridge
Start your docker and it will populate the mapped /log directory with an 'ioc.json' file:
{
"bitcoin_wallet": [
"1ncgvmeqno77x5pqbp7yi4779si",
"3k04tl9upoapvqrcbntcsnglvcr5yn",
"366vraic2k6ocpol7q9ck8vs2o5by0dp",
"1fgtytuwedagp1krbukpd1dkkpfs",
"3pmusoxmgn1qmrzedtxsqbua6dua6t",
"34pg4dq162op34bd1aospshvsyxwhsxyo",
"38wvzcnz49gvuwo29iuf8vxyssvmfi66w",
"1nvgzipjvq5hdd16iszjdx9jzvf3",
"1igrmo3w477lnb4e3tsz3gfdrciwo",
"1yenq1jfsx54ptvvdtbplcqumu2dz3we",
"3wusjglqxkkro3nfzd1mkwx1qqf",
"1tjkoizmzrof4kgkok3jciwysmrxprj7"
],
"ctime": "Thu Mar 5 17:54:23 2020",
"domain": [
"xpressvpngoodforpsdgo.duckdns.org",
"windowsfirewallsecurityauthorise.duckdns.org",
"cartonal.net",
"asco.az",
"duckdns.org",
"dnverificationfailedbymicrosoftthanks.duckdns.org",
"saranadinamika.co.id",
...
"windowsdefenderserversecuresofficeiq.duckdns.org",
"tecnol2.com",
"synergiespakistan.com",
"chxsndy3manufacturingandinsurancebusines.duckdns.org",
"ecomglobalpurchasecentercom.duckdns.org",
"globalsharesecurefilesgood.duckdns.org",
"ewrt55.com",
"matriximportsusa.com",
"pstv-energy.com",
"msofficecloudtransferfileprotocolsys.duckdns.org",
"workshop002.duckdns.org",
"globalfbdnsaddressgoogle.duckdns.org",
"www.virustotal.com",
"marroiq.com",
"challengerevertprocessupdate.duckdns.org",
"windowsdefenderserversecureserver.duckdns.org",
"windowsdefenderserversecuresofficew.duckdns.org"
],
"email": [
"1042@home7.solarlog-web.de",
...
],
"epoch": "1583427263",
"filename": [
"64443.exe",
"soa.xlsx",
"copy.xlsx",
"p.o.xlsx",
"agenttesla.xlsx",
"vbc.exe",
"products.xlsx",
"quote.pdf"
],
"ipv4": [
...
],
"md5": [],
"message_text": "...",
"mutex": [],
"sha1": [],
"sha256": [
"3a0965bd363e46e8c30590d7ddfd22bfc353728c6311c3ca61479da6f874bd5f",
"f3a98da7e0a2c7cfe0907f5b0b769a4ff3bad45d7dce7ed905d6c3c2bf835201",
"cb01d834237a37d3825f4075961cf6eda5f856af13526d699e4f5c93edf00c15",
"2ef88915bb08f6d0d4c5fb2a90777930ad4e0d0b757e4b2b51988813338470f9",
"df852235358fe361ba0577ad4c064b642148b2ac52d5857316d20bca33ea3d04",
"172542fe3f7aae11deaf6960652d0074a1a7d83ae5596951ad69ba151a9f9630"
],
"ssdeep": [],
"url": [
"windowsdefenderserversecureserver.duckdns.org/mich/vbc.exe",
"globalsharesecurefilesgood.duckdns.org/love/vbc.exe",
"windowsfirewallsecurityauthorise.duckdns.org/big/vbc.exe",
"workshop002.duckdns.org/vbc.exe",
"windowsdefenderserversecuresofficew.duckdns.org/windows/vbc.exe",
"dnverificationfailedbymicrosoftthanks.duckdns.org/lee/vbc.exe",
"dnverificationfailedbymicrosoftthanks.duckdns.org/lavin/vbc.exe",
"ecomglobalpurchasecentercom.duckdns.org/love/vbc.exe",
"windowsdefenderserversecuresofficew.duckdns.org/reg/vbc.exe",
...
"globalfbdnsaddressgoogle.duckdns.org/lvc/vbc.exe"
]
}
Note: The complete is email is parsed. You will find in the JSON file all SMTP headers, the email body, etc. Less relevant for IOC's but still interesting in some cases (by example, to analyze spam).
Here is a recap of the data flow:

[1] https://github.com/clverhack/es_email_intel.git
[2] http://pyropus.ca/software/getmail/
Xavier Mertens (@xme)
Senior ISC Handler - Freelance Cyber Security Consultant
PGP Key
| Reverse-Engineering Malware: Malware Analysis Tools and Techniques | London | Nov 3rd - Nov 8th 2025 |

Comments