CastleCops, Internet Crime Fighters
Need help? Click here to register for free! Absolutely zero advertisements on this site!

Donation/Premium
spacer
block bottom
Security Central
spacer
· Home
· PIRT/Fried Phish
· MIRT
· SIRT
· Deutsch
· Wiki
· Newsletter
· O16/ActiveX
· CLSID List
· Contest2007
· Downloads
· Feedback (send)
· Forums
· HijackThis
· Hijacktrend
· LSPs
· My Downloads
· O18
· O20
· O21
· O22
· O23
· O9
· Premium
· Private Messages
· Proxomitron
· Reviews
· Search
· StartupList
· Stories Archive
· Submit News
· WsIRT
· Your Account
· Acceptable Use Policy
block bottom
spacer spacer

Filter for multiple ASCII ( &# 033; - &# 126;)
Goto page 1, 2  Next
 
Post new topic   Reply to topic       All -> FavForums -> Product Suggestions [del.icio.us!] [digg it!] [reddit!]
View previous topic :: View next topic  
Author Message
denn988

Guest
IP: 66.44.*.*






PostPosted: Fri Dec 12, 2003 7:14 pm    Post subject: Filter for multiple ASCII ( &# 033; - &# 126;)
Reply with quote

If anyone is interested, I have a new RegExp filter that can be used to spot those messages where the Spammers try to obfuscate their SPAM using ASCII charactor codes. (&# 033 etc)

This Expression will look for any ASCII charactor that would normally not need to be encoded...charactors 33-126 listed as follows:

Code:

!"#$%&')*+,-./0123456789:;<=>?@ABCDEFGHIJKLMNOPQRSTUVWXYZ[\]^_`abcdefghijklmnopqrstuvwxyz{}~


The RegExp is triggered by the number of these encoded charactors it detects in the targeted text.

The RegExp is:

Code:
((�*(3[3-9]|[4-9]\d|1[01]\d|12[0-6]);).*?){6}


The above code is set for a sensitivitey of 6. It makes no difference what is in between the ASCII encoded charactors as long as there are 6 of the above charactors encoded into the body. You can change the sensitivity to anything you want by changing the iterator at the end {6}.

NOTE:

The ASCII encode charactor can contain any number of leading zeroes, or no leading zeroes. The filter takes care of them.


Sugested use:

The body...
contains RegExpr....
Code:
((�*(3[3-9]|[4-9]\d|1[01]\d|12[0-6]);).*?){6}


Mark for deletion.

Auto-delete if desired...but only after testing to ensure that you want to auto delete.

Back to top
Ikeb

Special Response Team
Forums Admin

Joined: Apr 20, 2003
Posts: 16535

Forums Admin Moderators MVP Premium SRT Team CC Committee Team F@H

PostPosted: Fri Dec 12, 2003 7:37 pm    Post subject:
Reply with quote

I've been running a filter like this for several weeks now but have to date only had a single hit (# of encoded characters set to 15).

Back to top
View users profile Send private message
denn988

Guest
IP: 66.44.*.*






PostPosted: Fri Dec 12, 2003 8:23 pm    Post subject:
Reply with quote

Ikeb,

Is it limited to the charactors that would normally not require encoding?

If someone is encoding charactors that don't require encoding (which is what the filter I posted looks for) it is a BIG FLAG that it is SPAM.

A lower threshold than 15 can be used in that case. If you are looking for any encoded charactor....you probably do need a higher threshold to prevent false positives...and will have a less effective filter.

Back to top
denn988

Guest
IP: 66.44.*.*






PostPosted: Fri Dec 12, 2003 9:02 pm    Post subject:
Reply with quote

Ikeb,

For the last few weeks I have left extended error logging on so that I can see what was in the messages that my filters were auto-deleting.

One of the things that I have noticed is an increase in the kind of obfuscations that the filter that I am targeting is going after.

Right now it might only trap about 5% of the incoming SPAM...but as that kind of trick seems to be on the increase, it might be a good idea to get the filter ready for it.

I am not posting all of my filter strategies because I don't want to give too much to the Spammers who may be monitoring this forum.

My IANA-Reserved filter was one that I posted here, partly as a help to all....but also as a 'plant' to see if this forum was being watched.

That IANA-Reserved filter has been steadily decreasing in hits since I posted it here. Before I posted it, it was trapping more than 35% of all incoming SPAM....now it is trapping less than 10%. That filter has remained at the top of my list so it is the first thing that will trap anything that it is aimed for.

I have been studying the messages that have been auto-deleted and although the number of messages trapped by IANA-Reserved has been decreasing, the number of messages that have forged Received: lines in the header has not been.

This tells me that it is not the NEW Anti-Spam Law...and it's provisions for criminalizing the forging of header information that is responsible for the decrease in hits by IANA-Reserved. It is more than likely the compromise of the strategy to the Spammers.

I am willing to accept the loss of IANA-Reserved as an effective filter if the Spammers have responded to it. I will not accept the loss of certain other filter strategies though.

In the case of the ASCII encoding filter...I look at it this way. I want the Spammers to know that there is a means to trap that kind of obfuscation.

That is why I posted it.

It may not trap a lot in the future....but it may prevent a lot of that from being sent in the future also.

Back to top
stan_qaz

Premium Member


Joined: Mar 31, 2003
Posts: 10635

Premium

PostPosted: Fri Dec 12, 2003 10:13 pm    Post subject:
Reply with quote

Denn hits on the key problem with static filters, if you make them public the spammers will use them to test their outgoing e-mail until it passes before doing a spam run. Not sharing will make your filter more effective for longer unless it is generic enough that it can't be bypassed and that is almost impossible to do.

First Alert faces the same problem, spammers are probably on the beta testing team and on the current pre-release version working like the dickens to find a way to get past the system.

Baysean filters are usually unshared or only an initial set of data is shared and the user then further trains the filter system. The result is that a spammer can't train his system to beat your filters since he has no access to them.


_________________
Questions? Try the wiki
http://wiki.castlecops.com/MailWasher_Pro
Back to top
View users profile Send private message
Ikeb

Special Response Team
Forums Admin

Joined: Apr 20, 2003
Posts: 16535

Forums Admin Moderators MVP Premium SRT Team CC Committee Team F@H

PostPosted: Sat Dec 13, 2003 1:41 am    Post subject:
Reply with quote

stan_qaz wrote:
Denn hits on the key problem with static filters, if you make them public the spammers will use them to test their outgoing e-mail until it passes before doing a spam run. Not sharing will make your filter more effective for longer unless it is generic enough that it can't be bypassed and that is almost impossible to do.

First Alert faces the same problem, spammers are probably on the beta testing team and on the current pre-release version working like the dickens to find a way to get past the system.

Keep in mind that MailWasher is but one anti-SPAM impediment to SPAMers. I'm not so sure that MailWasher is commanding a lot of attention simply because the great majority of email users haven't even heard of MailWasher, never mind even thought of applying it.

stan_qaz wrote:
Baysean filters are usually unshared or only an initial set of data is shared and the user then further trains the filter system. The result is that a spammer can't train his system to beat your filters since he has no access to them.

That's the promise of Bayesian filters alright. The SPAMer would have to match the "good" words to each individual's list so as to balance off the "bad" words that are also being looked for -- a tall order. That's not to say they aren't trying. The POPFile forums continue to report some interesting strategies that SPAMers are employing in attempts to defeat Bayesian filters.

Back to top
View users profile Send private message
denn988

Guest
IP: 66.44.*.*






PostPosted: Sat Dec 13, 2003 1:49 am    Post subject:
Reply with quote

Stan,

One of the things that I did not like about the whole idea of CFS was that the Spammers will probably be monitoring how the system works by subscribing to it themselves.

The one thing that the system may have going for it is that only a certain amount of the 'fingerprint' strategy would need to be incorporated into the software that the user installs on their machine.

If Firetrust does it right, they could have one hell of an anti-spam system going with 'First Alert'. If they place too much of their strategy into the user's software....they are going to pay a price for it.

As far as sharing filters goes...I may not post all of my filter strategies here....but I will help out anyone I can....as much as I can....to develop any strategy they may suggest here.

I have seen some pretty nice strategies suggested by other users on this forum...some of which could be compromised by posting here, and some that would be unaffected.

The trick for keeping many of these filter strategies safe usually lies in the variations that can be developed. Once you have a filter strategy developed, you can post some of the basics....but keep the many possible variations of those basics to yourself. In that way you can help out as many other users as possible...and still keep the Spammers clueless as to what you are actually doing.

The people at the bottom of the filter food chain will derive some benefit from these posts, though not as much as those who will take the time to really look at and understand the filters, and their underlying stategies.

For those that do take the time, they will get the maximum from the filters posted to this forum.


But...They will have to figure out many of those higher level strategies on their own.


By the way...Ikeb is a fast learner. If he plays his hand right, he will be able to counter just about anything the Spammers might come up with. I think he realizes the difference between what tools and strategies he can safely put on the table....and what he should keep to himself.

Back to top
Ikeb

Special Response Team
Forums Admin

Joined: Apr 20, 2003
Posts: 16535

Forums Admin Moderators MVP Premium SRT Team CC Committee Team F@H

PostPosted: Sat Dec 13, 2003 1:55 am    Post subject:
Reply with quote

denn988 wrote:
Is it limited to the charactors that would normally not require encoding?

If someone is encoding charactors that don't require encoding (which is what the filter I posted looks for) it is a BIG FLAG that it is SPAM.

A lower threshold than 15 can be used in that case. If you are looking for any encoded charactor....you probably do need a higher threshold to prevent false positives...and will have a less effective filter.

No, my filter has been looking for any encoded character. I thought I'd start there but since I didn't find much encoding of any sort being used I figured it couldn't be too important to distinguish character types. Also I reasoned that if a SPAMer uses encoding, there would likely be many more than 15 characters encoded. Like I say, I haven't had a hit yet.

I'll give yours a spin though. If yours yields hits while mine continues to lolly gag, I'll dig a bit further. Wink

Edit: Whoops! I added your filter only to notice that my filter's regex was set to check only the Subject! Surprised Embarassed Actually it was set to look for 10 encoded characters but no wonder it wasn't finding anything! Rolling Eyes



Last edited by Ikeb on Sat Dec 13, 2003 2:53 am, edited 1 time in total
Back to top
View users profile Send private message
Ikeb

Special Response Team
Forums Admin

Joined: Apr 20, 2003
Posts: 16535

Forums Admin Moderators MVP Premium SRT Team CC Committee Team F@H

PostPosted: Sat Dec 13, 2003 2:40 am    Post subject:
Reply with quote

denn988 wrote:
By the way...Ikeb is a fast learner. If he plays his hand right, he will be able to counter just about anything the Spammers might come up with. I think he realizes the difference between what tools and strategies he can safely put on the table....and what he should keep to himself.

We all have to do our little bit I suppose.....

I really haven't been thinking about keeping my filters close to my vest ... other than the stuff I'm still working on and which may or may not pan out. I suppose that's mainly because I'm not convinced that SPAMers are all that worked up about MailWasher.

But if the SPAMers are listening in ... just wait until MWP implements Bayesian filters in combination with regex filters! I'm running a POPFile proxy in front of MWP and the combination of a POPFile "Unclassified" threshold of 1000000 (that is to say, a message has to be at least 1000000 times more "probable" of being in the most likely category than in the next most likely category), and ANDing the POPFile "SPAM" flag with a hit on any one of my top ten regex filters is a "guaranteed" SPAM!

That means I don't have to look at 95% of the SPAM at all. Those are gone. The remainder is "interesting" stuff that is fodder for training POPFile of course and for possible future regex filters. If the SPAMers need to defeat MWP to stay in business, they better shape up! Wink

Back to top
View users profile Send private message
stan_qaz

Premium Member


Joined: Mar 31, 2003
Posts: 10635

Premium

PostPosted: Sat Dec 13, 2003 3:16 am    Post subject:
Reply with quote

I'm just guessing but I'd be surprised if the folks writing the spamming software don't have a list of features that includes all the spam tools and filtering methods that it is designed to avoid.

Doing the research to avoid mailwasher would be too much for joe chickenboner but a major spam operation or a spam software vendor should find it well worth the effort.


_________________
Questions? Try the wiki
http://wiki.castlecops.com/MailWasher_Pro
Back to top
View users profile Send private message
Ikeb

Special Response Team
Forums Admin

Joined: Apr 20, 2003
Posts: 16535

Forums Admin Moderators MVP Premium SRT Team CC Committee Team F@H

PostPosted: Sat Dec 13, 2003 6:40 am    Post subject:
Reply with quote

Sure, I'll admit that's possible. But it still isn't going to make me "hold back" information that could help other users. I figure if everyone does the same, it will make life more difficult for SPAMers. By "holding back" it doesn't help me in the long run because I will have allowed SPAMers to successfully reach more email users, thus contributing towards even more SPAM in the future.

Back to top
View users profile Send private message
stan_qaz

Premium Member


Joined: Mar 31, 2003
Posts: 10635

Premium

PostPosted: Sun Dec 14, 2003 4:55 am    Post subject:
Reply with quote

I didn't suggest that you hold back information on filters, only that you be aware of the risks of releasing the information.

For the filter savvy it shouldn't be much more work to take a filter that is working for them and that they want to protect and make minor changes to it that don't make much difference in its effectiveness and post that version.

For lots of filters like the URL blocking one I got help with (and that is still my biggest hitter) public release doesn't matter in the least as each user adds the data needed to make it work for them.


_________________
Questions? Try the wiki
http://wiki.castlecops.com/MailWasher_Pro
Back to top
View users profile Send private message
Ikeb

Special Response Team
Forums Admin

Joined: Apr 20, 2003
Posts: 16535

Forums Admin Moderators MVP Premium SRT Team CC Committee Team F@H

PostPosted: Sun Dec 14, 2003 5:25 am    Post subject:
Reply with quote

stan_qaz wrote:
I didn't suggest that you hold back information on filters, only that you be aware of the risks of releasing the information.

For the filter savvy it shouldn't be much more work to take a filter that is working for them and that they want to protect and make minor changes to it that don't make much difference in its effectiveness and post that version.

I'm not sure that I'm clever enough to do that without outsmarting myself. Razz

stan_qaz wrote:
For lots of filters like the URL blocking one I got help with (and that is still my biggest hitter) public release doesn't matter in the least as each user adds the data needed to make it work for them.

If I wasn't so lazy, I'd implement one myself. I just hope FireTrust adds URL blacklisting as another filtering technique all on it's own. Such a blacklist would and should be widely circulated! We wouldn't want to "hold back" those domain names!

Sure the SPAMers could change domain names but it gets expensive. OK so $50 isn't all that much money but there's also paperwork and time required to make it happen. So, unlike the email address situation, changing the SPAMversized domain name is certainly less economical than adding that domain name to a shared URL blacklist!

Back to top
View users profile Send private message
stan_qaz

Premium Member


Joined: Mar 31, 2003
Posts: 10635

Premium

PostPosted: Sun Dec 14, 2003 6:46 am    Post subject:
Reply with quote

Oh how I wish it was $50 for a domain name, if you don't mind a spammy host you can bulk but .biz domain names for a tenth of that or less. Don't have the link handy but a google on bulk registry and biz might turn up some scary numbers.

I'd like to find a way to capture the spamcop spamvertized domain listing on a regular basis off their web page and plunk that into a filter. Even better would be a 7 day running list that you could update as often as you liked, sort of like the RBL listings work now.


_________________
Questions? Try the wiki
http://wiki.castlecops.com/MailWasher_Pro
Back to top
View users profile Send private message
Ikeb

Special Response Team
Forums Admin

Joined: Apr 20, 2003
Posts: 16535

Forums Admin Moderators MVP Premium SRT Team CC Committee Team F@H

PostPosted: Sun Dec 14, 2003 8:59 am    Post subject:
Reply with quote

stan_qaz wrote:
Oh how I wish it was $50 for a domain name, if you don't mind a spammy host you can bulk but .biz domain names for a tenth of that or less. Don't have the link handy but a google on bulk registry and biz might turn up some scary numbers.

Hmmm. So they're pretty cheap eh? Crying or Very sad Still costs money. I like it when SPAMers are forced to shell out. Mr. Green

stan_qaz wrote:
I'd like to find a way to capture the spamcop spamvertized domain listing on a regular basis off their web page and plunk that into a filter. Even better would be a 7 day running list that you could update as often as you liked, sort of like the RBL listings work now.

Surely not the listing from http://www.spamcop.net/fom-serve/cache/340.html. I'd have to save lots of pennies for that! Shocked Evil or Very Mad

Or are these the sites reported at http://www.spamcop.net/w3m?action=inprogress&type=www ? OK so a page download every half hour, consolidation of new URLs with existing ones, and output of a parsed list for copy and paste into filter.txt will do the trick! I'll get right on that. Rolling Eyes

Back to top
View users profile Send private message
Display posts from previous:   
Post new topic   Reply to topic       All -> FavForums -> Product Suggestions All times are GMT
Goto page 1, 2  Next
Page 1 of 2

 
Quick Reply:
Username: 

Quote the last message
Attach signature (signatures can be changed in profile)
 
You can post new topics in this forum
You can reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum
You can attach files in this forum
You can download files in this forum


Powered by phpBB © 2001 phpBB Group
spacer spacer