EmailDiscussions.com  

Go Back   EmailDiscussions.com > Email Service Provider-specific Forums > Runbox Forum
Register FAQ Members List Calendar Today's Posts
Stay in touch wirelessly

Runbox Forum Everything related to Runbox should go here: suggestions, comments, complaints, questions, technical issues, etc.

Reply
 
Thread Tools
Old 21 Jul 2004, 07:48 AM   #1
trond
Essential Contributor
 
Join Date: Oct 2003
Location: Oslo, Norway
Posts: 344
Trainable spam filter ready for beta testing

We're now ready to start beta testing of the trainable spamfilter. We're using a system called DSpam (http://www.nuclearelephant.com/projects/dspam/)

If anybody wants to test this new filter, send me a personal message here at the forum, and tell me your username.
After I've set up the system to filter your mail through dspam, you'll get an extra checkbox in the filter page that says "use trainable spamfilter". You'll have to check this, and then "save settings" to enable the trainable filter.

Beware that the normal behaviour of a trainable filter is to first be too lenient, and let pretty much everything slip through. After a while it'll start catching spam, and then it's likely to be too strict, and start catching legitimate mail - particularly newsletters and similar. So you need to check the spam folder regularly in the beginning, to make sure that there are no legitimate mails there. After a few hundred mails it'll start to get really good at recognizing the mails you don't want, and you can start ignoring your spamfolder.

Another thing to remember is that the filter trains itself, so if it misclassifies a mail, and you don't correct it, it's likely to repeat the mistake. In particular, if it starts to miss spams, it'll get increasingly worse if you just delete the spam instead of using the "report spam" button.
trond is offline   Reply With Quote

Old 21 Jul 2004, 08:45 PM   #2
BLuRReD
Essential Contributor
 
Join Date: Nov 2003
Location: somewhere
Posts: 297
are u willing to add a buttom called this is not spam as well ?
BLuRReD is offline   Reply With Quote
Old 21 Jul 2004, 09:45 PM   #3
trond
Essential Contributor
 
Join Date: Oct 2003
Location: Oslo, Norway
Posts: 344
There is one already

In the spam folder, the button says "not spam", everywhere else, it says "report spam"
trond is offline   Reply With Quote
Old 21 Jul 2004, 10:10 PM   #4
BLuRReD
Essential Contributor
 
Join Date: Nov 2003
Location: somewhere
Posts: 297
okay , another question , does this mean u creat a new folder called spam for users , or at least for beta users now or what ?
BLuRReD is offline   Reply With Quote
Old 21 Jul 2004, 10:13 PM   #5
trond
Essential Contributor
 
Join Date: Oct 2003
Location: Oslo, Norway
Posts: 344
No.

You decide for yourself what folder you want spam to be directed into. Just like before, really.
trond is offline   Reply With Quote
Old 21 Jul 2004, 10:15 PM   #6
BLuRReD
Essential Contributor
 
Join Date: Nov 2003
Location: somewhere
Posts: 297
but will i be able to report any email as a spam located in any other folder , right ?
BLuRReD is offline   Reply With Quote
Old 21 Jul 2004, 10:27 PM   #7
trond
Essential Contributor
 
Join Date: Oct 2003
Location: Oslo, Norway
Posts: 344
right.

The folder you direct spam to, is considered to be your spam folder. All other folders are not.

When training, you can train every mail already in the spam folder as "not spam". All mails that are not in the spam folder can be trained as spam.
trond is offline   Reply With Quote
Old 21 Jul 2004, 11:02 PM   #8
carverrn
Intergalactic Postmaster
 
Join Date: Jan 2002
Location: Chicago, IL
Posts: 5,606

Representative of:
Runbox.com
Quote:
Originally posted by trond
There is one already

In the spam folder, the button says "not spam", everywhere else, it says "report spam"
Minor issue ... when viewing a message my "Report Spam" link splits like this:

Inbox Compose Reply Reply to all Forward Delete Read/Unread Report
Spam Retrieve POP

Could you either make "Report Spam" non-breaking or just bite the bullet and make this a double line all the time:

Message View
Inbox Compose Reply Reply to all Forward Delete Read/Unread
Report Spam Retrieve POP

and

spam Compose Reply Reply to all Forward Delete Read/Unread
Not Spam Retrieve POP


Folder Listing
Refresh Compose Reply Reply to all Forward Delete Read/Unread
Report Spam Retrieve POP

and

Refresh Compose Reply Reply to all Forward Delete Read/Unread
Not Spam Retrieve POP

Thanks,
Rich
carverrn is offline   Reply With Quote
Old 22 Jul 2004, 02:30 AM   #9
carverrn
Intergalactic Postmaster
 
Join Date: Jan 2002
Location: Chicago, IL
Posts: 5,606

Representative of:
Runbox.com
Message bypassed DSPAM?

Trond,

I turned on the DSPAM option this morning. Saw that I was getting messages with the X-DSPAM headers in them as expected.

Just a little while ago I received a new message that does not have the X-DSPAM headers. It does hava the SpamAssassin X-Spam headers.

Any idea why it didn't get the X-DSPAM headers?

Regards,
Rich
carverrn is offline   Reply With Quote
Old 22 Jul 2004, 02:44 AM   #10
LrdVader
The "e" in e-mail
 
Join Date: Oct 2003
Location: USA
Posts: 2,550
So how does this fit in with SpamAssassin? What happens when one filter thinks a message is spam and the other doesn't?
LrdVader is offline   Reply With Quote
Old 22 Jul 2004, 02:58 AM   #11
trond
Essential Contributor
 
Join Date: Oct 2003
Location: Oslo, Norway
Posts: 344
Spamassassin has been set up to assist in training the trainable filter.

If the trainable thinks it's innocent, and Spamassassin thinks it's spam, the message is automatically trained as spam.
If the trainable thinks it's spam, but the sender is in the whitelist, it's automatically trained as not spam.

That means that, at least to begin with, it's best to not use the "reject if possible" option. When you think the filter has matured, you can turn on rejection again, so that you'll have less spam to wade through, looking for false positives.
trond is offline   Reply With Quote
Old 22 Jul 2004, 03:15 AM   #12
LrdVader
The "e" in e-mail
 
Join Date: Oct 2003
Location: USA
Posts: 2,550
Quote:
Originally posted by trond
If the trainable thinks it's innocent, and Spamassassin thinks it's spam, the message is automatically trained as spam.
I'm not so sure that's a good idea. The whole point of a trainable filter is that it can ultimately be smarter than SA, and be correct when it is wrong, right? It seems that having SA train it kind of defeats this purpose. And it means that using the "Not Spam" button to train the filter to avoid false positives won't accomplish much, since the filter will be re-trained to see them as spam the next time SA sees them.

Am I missing something?
LrdVader is offline   Reply With Quote
Old 22 Jul 2004, 04:26 AM   #13
carverrn
Intergalactic Postmaster
 
Join Date: Jan 2002
Location: Chicago, IL
Posts: 5,606

Representative of:
Runbox.com
Quote:
Originally posted by LrdVader
I'm not so sure that's a good idea. The whole point of a trainable filter is that it can ultimately be smarter than SA, and be correct when it is wrong, right?
I think they are just using Spam Assassin to "jump start" the training process.

Rich
carverrn is offline   Reply With Quote
Old 22 Jul 2004, 04:46 AM   #14
carverrn
Intergalactic Postmaster
 
Join Date: Jan 2002
Location: Chicago, IL
Posts: 5,606

Representative of:
Runbox.com
More messages bypassed DSPAM

14 more messages have arrived that do not have the X-DSPAM headers in them.

Rich
carverrn is offline   Reply With Quote
Old 22 Jul 2004, 06:33 AM   #15
Geir
The "e" in e-mail
 
Join Date: Sep 2001
Location: Oslo, Norway
Posts: 2,938

Representative of:
Runbox.com
SA has proven too lenient (e.g. excessive false negatives) and unreliable at lower thresholds than 4-5 (5 is currently default), while false positives have been very rare. Thus, Dspam is set up to catch spam that bypasses SA, in such a way that SA assists the user in training Dspam (to accelerate learning).

In the event of a false SA-positive, a "Not spam" action will SA-whitelist the sender to prevent subsequent misclassification.

If you'd like to see how great it works, PM Trond or me.

- Geir
Geir is offline   Reply With Quote
Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Forum Jump


All times are GMT +9. The time now is 05:36 AM.

 

Copyright EmailDiscussions.com 1998-2022. All Rights Reserved. Privacy Policy