Hi Barry,
>>Actually, longer documents are *less* likely to trigger spam filters,
because spams are generally pretty short, and the algorithms account for
that implicitly and explicitly as needed.
Is this your opinion or can you point me to some documentation to back this
up?
Spam filters look for unique words or combinations of words so the longer
the message the more likely it is to contain those triggers.
In an article on spam filters Paul Graham (http://www.paulgraham.com/spam.html)
wrote:
"It discovered, of course, that terms like "virtumundo"
and "teens" were good indicators of spam. But it also discovered that "per"
and "FL" and "ff0000" are good indicators of spam. In fact, "ff0000" (html
for bright red) turns out to be as good an indicator of spam as any
pornographic term."
In his paper "Introduction to Latent Semantic Analysis" (http://lsa.colorado.edu/papers/dp1.LSAintro.pdf)
he shows how text can be evaluated on the basis of words or word pairs which
confirms the supposition that the longer the message the more likely to get
caught by the filter.
Programs like Spam Blocker analyze the messages for spam content.
Aldo's Spam Cleaner states it uses an innovative filter of allowed and
banned keywords, phrases, users or domains.
AOL's spam filter instructions say :"You also can banish all messages
containing clickable hyperlinks and any messages containing certain words
that you list."
In each of these cases the longer the email the more likely it will contain
spam triggers.
>>See much 24 MB spam? I didn't think so :)
Actually the newsletter is only 24mb in it's PDF format, which can only be
sent as an attachment and not imbedded in the body of the email.
If this file were converted to HTML or straight text for the emailed
newsletter then it would be much smaller.
Reg
First page SEO
results.
Content Management Systems, SEO,
Design, Marketing, and webmaster's resource .
MetaCryL - Business Promotion - We JV.
Interested?
-
Zero Grief Budget Hosting - World Class Support
See my
Ryze Network, "Telling it Straight".