Elliott C. Back: Internet & Technology

Here’s one way to beat baysian and other content filtering

Posted in Spam by Elliott Back on October 16th, 2005.

Bayesian Filtering Doesn't Work

Just add enough realistic text at the bottom to balance out the img tag, and now you can include arbitrary spam content without a keyword penalty!

This entry was posted on Sunday, October 16th, 2005 at 7:47 am and is tagged with img tag. You can follow any responses to this entry through the RSS 2.0 feed. You can leave a response, or trackback.

4 Responses to “Here’s one way to beat baysian and other content filtering”

  1. Elliot Lee says:

    In your screenshot, “Thunderbird thinks this message is junk.” Looks like it worked after all!

    And if not, I don’t see why bayesian filters don’t consider image tags as well. Sure, you might not be able to decipher the text on-the-run; but you know there’s an image there, and the size of the image. If it’s just a little clipart or icon, it could pass. You could then program your filter to look down on messages that start off with a large image, for example.

  2. Steven says:

    This is not really true. Get a better Bayesian filter. Words that are not in your corpus as either good or bad will not be used by the Bayesian filter to judge the message one way or another.

  3. Elliott Back says:

    What’s sad is that this is not even the best way to beat bayesian filters… just a rather clever dumb one.

  4. Marco says:

    I got many, MANY prescription-meds spam last year using this technique. They just add a load of garbage text and it beats any spamfilter.

Leave a Reply

Powered by WP Hashcash