|
|||||||||||
|
Re: A different approach to scoring spamassassin hits
From: Tom Allison <tom(at)tacocat.net>
Date: Sun Jul 01 2007 - 11:07:37 EDT On Jun 30, 2007, at 11:55 PM, Loren Wilton wrote: > Actually I have this but I don't have it writting the headers into the email. It' s sending the SA data as attached information so I can keep track of where it came from (header/body/metadata). I'm not sure that the scoring is going to cost me anything or cause any performance issues compared to getting the hits/misses. I think we're debating the cpu involved to determine a number for the score, not the scoring process itself. I have a question about the sub rules -- are they themselves adding up to an overall rule by means of hit/miss? Is there any conceptual advantage to pulling in rules and sub_rules to this process.
And the more I think about it, the more I don't need to "bloat every
mail with the names of all the rules".
--- By not putting in all the SA rules it might make it easier to establish the contribution of the scoring, but you have to know the intended target (RULE => spam or RULE => ham) which isn't an issue with todays rules (but you never know). Once you know this, the effectiveness of a rule would be measured by it's distance in probability from 0.500 toward 1.00. I can track this eventually, but I think I need to reset my database to be certain of it's value. Not a problem, I am my own admin. But the real challenge for me, as has always been the case with SA, is the proper care and feeding of the application when not using the standard spamc/spamd and spamassassin scripts. I suspect this starts with a lot of RTFM and then I can get to some real questions. The difficulty for me is trimming out all the steps in the application that I won't be benefitting from. I would like to start with something that is approximately: local "static" rules only, no user specific preferences, no learning or bayes or white/black listing. By local "static" I mean to use the rules based on email content analysis without network consultation (DNS, RBL, DCC...)Received on Sun Jul 1 11:08:13 2007 This archive was generated by hypermail 2.1.8 : Sun Jul 01 2007 - 11:10:04 EDT |
||||||||||
|
|||||||||||