Pantek Library
Hosting Provided By
CybrHost
High Speed Hosting

Re: PDF rule not matching -- split line content type?

From: Jo Rhett <jrhett(at)netconsonance.com>
Date: Thu Aug 16 2007 - 02:41:33 EDT


Since nobody is paying attention, let me clarify. The current rule is wrong:

mimeheader __TVD_MIME_ATT_AP Content-Type =~ /^application\/pdf/i mimeheader __TVD_MIME_ATT_AOPDF Content-Type =~ /^application\/octet-stream.*\.pdf/i

meta TVD_PDF_FINGER01 __TVD_MIME_CT_MM && __TVD_MIME_ATT_TP && __TVD_MIME_ATT && !__TVD_BODY This evaluates to exactly the same as this:

meta TVD_PDF_FINGER01 __TVD_MIME_CT_MM && __TVD_MIME_ATT_TP && !__TVD_BODY

I believe that the original rule's intent was this:

meta TVD_PDF_FINGER01 __TVD_MIME_CT_MM && __TVD_MIME_ATT && !__TVD_BODY

Can someone with commit rights please test and commit this change? Thank you.

Jo Rhett wrote:

> Well actually I think the rule has a bug.  Why OR the two mime types as 
> a new meta, and then require one of the two in the final meta?   The net 
> effect is that if ATT_TP is true it matches, but if ATT_AOPDF is true it 
> will never match.
> 
> I believe that the following will work better - work in every situation 
> that it worked before, and not fail when the mime type is octet-stream:
>    meta TVD_PDF_FINGER01           __TVD_MIME_CT_MM && __TVD_MIME_ATT && 
> !__TVD_BODY
> 
> Would someone kindly evaluate this change and possibly fix the rule?  
> Thanks.
> 
> On Aug 14, 2007, at 10:41 PM, Loren Wilton wrote:

>>>> rawbody __TVD_BODY /\S{4}/
>>
>> true
>>
>>>> header __TVD_MIME_CT_MM Content-Type =~ /^multipart\/mixed/i
>>
>> true
>>
>>>> mimeheader __TVD_MIME_ATT_AP Content-Type =~ /^application\/pdf/i
>>
>> false
>>
>>>> mimeheader __TVD_MIME_ATT_AOPDF Content-Type =~
>>>> /^application\/octet-stream.*\.pdf/i
>>
>> maybe true, maybe not. I would hope newlines were translated to
>> spaces by the mimehdr plugin, but maybe they weren't. Try /is instead
>> of /i and see if it helps.
>>
>>>> meta __TVD_MIME_ATT __TVD_MIME_ATT_AP ||
>>>> __TVD_MIME_ATT_AOPDF
>>
>> maybe true
>>
>>>> meta TVD_PDF_FINGER01
>> __TVD_MIME_CT_MM
>> true
>> && __TVD_MIME_ATT_TP
>> undefined here, can't say
>> && __TVD_MIME_ATT
>> maybe true
>> && !__TVD_BODY
>> true
>>
>> So, not knowing what is in __TVD_MIME_ATT_TP, I haven't a clue if it
>> will fire, since that is part of an 'and'. If I assume it to be true
>> then I'm still not sure because of the multiline possibility in
>> __TVD_MIME_ATT.
>>
>> Loren
>>
>>>> describe TVD_PDF_FINGER01 Mail matches standard pdf spam
>>>> fingerprint
>>
>>
>> ----- Original Message ----- From: "Jo Rhett" <jrhett@netconsonance.com>
>> To: "SpamAssassin Users" <users@spamassassin.apache.org>
>> Sent: Tuesday, August 14, 2007 10:16 PM
>> Subject: Re: PDF rule not matching -- split line content type?
>>
>>
>>> Can someone clue me in on why this rule isn't matching?
>>>
>>> Jo Rhett wrote:
>>>> So I've been getting a metric ton of PDF spam. Investigating the
>>>> rule that is supposed to match this, I see
>>>>
>>>> rawbody __TVD_BODY /\S{4}/
>>>> header __TVD_MIME_CT_MM Content-Type =~ /^multipart\/mixed/i
>>>> meta __TVD_MIME_ATT __TVD_MIME_ATT_AP ||
>>>> __TVD_MIME_ATT_AOPDF
>>>> meta TVD_PDF_FINGER01 __TVD_MIME_CT_MM &&
>>>> __TVD_MIME_ATT_TP && __TVD_MIME_ATT && !__TVD_BODY
>>>> describe TVD_PDF_FINGER01 Mail matches standard pdf spam
>>>> fingerprint
>>>>
>>>> mimeheader __TVD_MIME_ATT_AP Content-Type =~ /^application\/pdf/i
>>>> mimeheader __TVD_MIME_ATT_AOPDF Content-Type =~
>>>> /^application\/octet-stream.*\.pdf/i
>>>>
>>>> The following message appears to match perfectly with this, except
>>>> for perhaps that the content type is spread across two lines? I
>>>> haven't checked the code, but would this matter?
>>>>
>>>> Return-Path: <Yohann@nic.za.net>
>>>> Received: from mail.netconsonance.com ([unix socket])
>>>> by triceratops.netconsonance.com (Cyrus v2.3.8) with LMTPA;
>>>> Tue, 14 Aug 2007 06:27:16 -0700
>>>> Received: from [84.21.29.58] ([84.21.29.58])
>>>> by mail.netconsonance.com (8.14.1/8.14.1) with ESMTP id
>>>> l7EDR4UU095951
>>>> for <jrhett@lizardarts.com>; Tue, 14 Aug 2007 06:27:08 -0700 (PDT)
>>>> (envelope-from Yohann@nic.za.net)
>>>> X-Virus-Scanned: amavisd-new at netconsonance.com
>>>> X-Spam-Score: 2.033
>>>> X-Spam-Level: **
>>>> X-Spam-Status: No, score=2.033 tagged_above=-999 required=4
>>>> tests=[DK_POLICY_SIGNSOME=0.001, HTML_MESSAGE=0.001,
>>>> MIME_HTML_MOSTLY=0.699, RCVD_IN_BL_SPAMCOP_NET=1.332]
>>>> Received: from x-6of7ca27m39al ([158.187.61.7]) by [84.21.29.58]
>>>> with Microsoft SMTPSVC(6.0.3790.1830);
>>>> Tue, 14 Aug 2007 15:27:01 +0200
>>>> Message-ID: <001201c7de76$b89554d0$3a1d1554@x6of7ca27m39al>
>>>> From: "Yohann michels" <Yohann@nic.za.net>
>>>> To: jrhett@lizardarts.com
>>>> Subject: bill-jrhett
>>>> Date: Tue, 14 Aug 2007 15:26:28 +0200
>>>> MIME-Version: 1.0
>>>> Content-Type: multipart/mixed;
>>>> boundary="----=_NextPart_000_000E_01C7DE87.7C1E24D0"
>>>> X-Priority: 3
>>>> X-MSMail-Priority: Normal
>>>> X-Mailer: Microsoft Outlook Express 6.00.2900.3138
>>>> X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.3138
>>>>
>>>>
>>>> ------=_NextPart_000_000E_01C7DE87.7C1E24D0
>>>> Content-Type: multipart/alternative;
>>>> boundary="----=_NextPart_001_000F_01C7DE87.7C1E24D0"
>>>>
>>>>
>>>> ------=_NextPart_001_000F_01C7DE87.7C1E24D0
>>>> Content-Transfer-Encoding: quoted-printable
>>>> Content-Type: text/plain;
>>>> charset=windows-1250
>>>>
>>>>
>>>> ------=_NextPart_001_000F_01C7DE87.7C1E24D0
>>>> Content-Transfer-Encoding: quoted-printable
>>>> Content-Type: text/html;
>>>> charset=windows-1250
>>>>
>>>> <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
>>>> <HTML><HEAD>
>>>> <META http-equiv=3DContent-Type content=3D"text/html; =
>>>> charset=3Dwindows-1250">
>>>> <META content=3D"MSHTML 6.00.2900.3132" name=3DGENERATOR>
>>>> <STYLE></STYLE>
>>>> </HEAD>
>>>> <BODY bgColor=3D#ffffff>
>>>> <DIV><FONT face=3DArial size=3D2></FONT>&nbsp;</DIV></BODY></HTML>
>>>>
>>>> ------=_NextPart_001_000F_01C7DE87.7C1E24D0--
>>>>
>>>> ------=_NextPart_000_000E_01C7DE87.7C1E24D0
>>>> Content-Transfer-Encoding: base64
>>>> Content-Type: application/octet-stream;
>>>> name=marketing-jrhett.pdf
>>>> Content-Disposition: attachment;
>>>> filename=marketing-jrhett.pdf
>>>>
>>>> JVBERi0xLjUNJeLjz9MNCjIyIDAgb2JqPDwvSFs0MzYgMTQ4XS9MaW5lYXJpemVkIDEvRSAxNjU5
>>>> L0wgMTM1NzYvTiAxMC9PIDI2L1QgMTMwNzQ+Pg1lbmRvYmoNICAgICAgICAgICAgICAgICAgICAg
>>>> *snip*
>>>>
>>>>
>>>
>>>
>>> --
>>> Jo Rhett
>>> Net Consonance ... net philanthropy, open source and other randomness
>>
>>

>
-- 
Jo Rhett
Net Consonance ... net philanthropy, open source and other randomness
Received on Thu Aug 16 02:43:19 2007
Do you need help?X

This archive was generated by hypermail 2.1.8 : Wed Oct 24 2007 - 07:16:38 EDT


Contact Us  Legal Notices  Order Services Online 
Pantek Home  Privacy Policy  IT news  Site Map  Pantek Library