Using Traffic Analysis to Detect Spam

Using Traffic Analysis

to Detect Email Spam

and 31 more valid destinations

HELO = lrhnow.usa.net 2007-05-19 23:11:22 [email protected] Size= 8339 -> [email protected] HELO = lkrw.hotmail.com 2007-05-19 23:11:24 [email protected] Size=11340 -> [email protected] HELO = pshw.netscape.net 2007-05-19 23:14:52 [email protected] Size= 6122 -> [email protected] HELO = zmgp.cs.com 2007-05-19 23:18:06 [email protected] Size= 6925 -> [email protected]

0 5 10 15 20 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23

0 5 10 15 20 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23

1 3 5 7 9 11 13 15 17 19 21 23 S1 S2 S3 S4 S5 S6 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24

Using Traffic Analysis to Detect Spam

Using Traffic Analysis

to Detect Email Spam

Richard Clayton

Summary

• Log processing for customers

• Log processing for non-customers

• Looking at sampled sFlow data

What problems do ISPs have?

3

Insecure customers

– very few real spammers sending directly !

• Botnets

– compromised end-user machines

• SOCKS proxies &c

– misconfiguration

• SMTP AUTH

ISP email

server

(smarthost)

yahoo.com

hotmail.com

example.com

example.co.uk

beispiel.de

etc.etc.etc

customer

customer

customer

customer

ISP

abuse@

team

spammer

spammer

Complaints

customer

customer

customer

customer

ISP’s

R

eal

P

roblem

• Blacklisting of IP ranges & smarthosts

–

[email protected]

• Rapid action necessary to ensure continued

service to all other customers

• But reports may go to the blacklist and not

to the ISP (or will lack essential details)

ISP email

server

(smarthost)

yahoo.com

hotmail.com

example.com

example.co.uk

beispiel.de

etc.etc.etc

customer

customer

customer

customer

BLACK

LIST

spammer

spammer

Complaints

customer

customer

customer

customer

Spotting outgoing spam

• Expensive to examine outgoing content

• Legal/contractual issues with blocking

– “false positives” could cost you customers

• Volume is not a good indicator of spam

– many customers with occasional mailshots

_R

_P

_{So virus infections are also detected}