Words with high probablity of spam.

This is based on mail that comes into my mailbox. I use bogofilter to detect spam. I am always curious what are the spammiest words. So this tells me on a regular basis. The scrip that actually extracts the data from bogofilter is bogo-counter.sh.

Rank Probability of Spam Number of SPAMs Number of Non-Spams Word
1 1.000000 23137 0 mime:Content-ID
2 1.000000 18474 0 mime:gif
3 0.999999 9600 0 to:billing
4 0.999999 9133 0 rcvd:billing
5 0.999999 6325 0 head:Not
6 0.999999 6314 0 head:detected
7 0.999999 6305 0 head:X-Spam
8 0.999999 6134 0 to:annegret.net
9 0.999999 6077 0 head:annegret.net
10 0.999999 14684 0 baseline
11 0.999999 14424 0 head:V6.00.2900.2180
12 0.999999 14329 0 head:star
13 0.999999 13787 0 head:The
14 0.999999 13419 0 to:info
15 0.999999 13329 0 to:star
16 0.999999 13324 0 head:Bat!
17 0.999999 13199 0 rcvd:info
18 0.999999 12074 0 rcvd:star
19 0.999999 11570 0 head:V6.00.2800.1106
20 0.999998 5550 0 rcvd:annegret.net
21 0.999998 5190 0 $49.95
22 0.999998 5045 0 Cialis
23 0.999998 4956 0 CS2
24 0.999998 4924 0 $69.95
25 0.999998 4915 0 sex
26 0.999998 4739 0 Remove
27 0.999998 4675 0 pills
28 0.999998 4490 0 Symbol
29 0.999998 4305 0 g-images.amazon.com
30 0.999998 4184 0 Corel
31 0.999998 4039 0 millions
32 0.999998 3997 0 head:Personal
33 0.999998 3918 0 Illustrator
34 0.999998 3908 0 head:billing
35 0.999998 3905 0 head:X-Message-Info
36 0.999998 3887 0 rcvd:webmaster
37 0.999998 3825 0 to:webmast

Return

Last updated on: Aug 30 2008 03:27:05 UTC