|
Donation/Premium |
|
 |
|
|
|
|
|
|
|
 |
 |
| View previous topic :: View next topic |
| Author |
Message |
Bill_Bright
General
 Premium Member
 Joined: Jan 16, 2004 Posts: 9046 Location: Nebraska, USA
|
Posted: Sat Aug 11, 2007 11:44 pm Post subject: |
|
|
As a Beta tester for MailWasher, among other applications, I have been testing these filters for over a week now. While I appreciate all the hard work that went into them, I am terminating my test and uninstalling the entire Wizcrafts filter set. My conclusion is the Wizcrafts Filters add no value, falsely tag an unacceptable level of legitimate emails as spam, and actually increase workload and decrease user productivity.
Note the following:
- I had to delete the following filters out of the set because they repeatedly tagged legitimate email from various places in the US as spam:
- Blocked Countries
- APNIC (Asian-Pacific)
- Pill Filter
- HTML Tricks (tagged multiple newletters, including the latest from Zonelabs)
- Misspelled Drug (this actually tagged today's Mailwasher Pro summary for the last week!
)
Fortunately, I had already gone through the filter set and changed all the automatic bouncing, blacklisting and deleting or I would have lost important legitimate emails! See below for more details.
- I have just two of my own filters. One tags my SpamCop reporting emails, and the other tags spam notifications from my ISP. The Wizcrafts filters have yet to be the sole method used to identify real spam - that is, all spam identified by Wizcrafts filters were also identified by MailWasher's own Learning Filter, DNSBL servers, and/or my 2 previously mentioned simple filters. In other words, they would have been caught anyway, without placing any performance burden on MWP.
- Since using the Wizcrafts filter set, I continue to get untagged spam at about the same rate, 1 or 2 out of 100/day - the same with or without the Wizcrafts filters. They added no advantage/value.
- I reported to my colleagues earlier in the week my displeasure at seeing the bounce feature frequently, though inconsistently, used throughout the feature set. I am happy to see that the bounce feature was removed from the filter set on Aug 8 - although the reason given in the filter set is not entirely accurate. Yes, the address is usually forged, meaning the bounce will go to the wrong (likely innocent) address, but bouncing also adds more unwanted email on the already crowded Internet backbone, wasting even more bandwidth. Bouncing email is also a good way to get your address added to blacklists! For anyone creating their own filters, see What is Wrong With Bouncing Spam to see why bouncing spam is a bad idea.
- There are many other inconsistent use of options that make managing the filter set difficult at best - especially considering that NO spam filter or spam blocker program is perfect, and there can be no guarantee of no false positives. For example, within the 9 Image Spam filters, some set the spam for Hidden, some not. Some set the spam for Automatic, some do not. The inconsistencies make understanding what is happening difficult.
- The use of "Automatic Process" is unacceptable! You should NEVER use filters set to automatically delete a message until you are 100% certain no false positives/legitimate email will be irretrievably lost.
- Automatically hiding email is almost as dangerous as automatically deleting. Unless you are sure what the filter is hiding, don't hide.
- There is an apparent haphazard and inconsistent use of the "Blacklist" feature. It is important to understand that many experts agree blacklists have limited effectiveness in detecting spam. This is because even wannabe badguys dump addresses after a day or two as they are aware that once the address is known, it will be worthless. So adding a worthless address to your blacklist simply clutters up your blacklist and Mailwasher. For this reason, I do not use, nor do I recommend the use of blacklists. I do use white/friend lists, as changes to that are infrequent and easy to manage.
The increase workload/decrease user productivity when using this filter set occurs as it requires considerable user intervention to ensure no good email is falsely tagged, processed, blacklisted, and/or deleted.
Filters are great, but I think too much emphasis (often obsession) is placed on creating a huge set of filters that can only attempt to catch 100% of all spam, 100% of the time, with NO false positives. That's just unrealistic. Once again, I appreciate and recognize all the hard work that went into creating/consolidating this filter set, but there are just too many inconsistencies, false positives, and improper use of automatic processing and blacklisting to make this filter set worthwhile. There is too great a risk that important legitimate email will missed.
JMHO and YMMV. _________________
Bill, AFE7Ret
Freedom is NOT Free!
|
|
| Back to top |
|
 |
Wizcrafts
Sergeant
 Premium Member
 Joined: Jun 05, 2003 Posts: 95 Location: Michigan
|
Posted: Sun Aug 12, 2007 1:24 am Post subject: |
|
|
Bill;
Thanks for your astute input. I realize that some of my filters need to be readjusted and I have been working on that, one at a time, in between other pursuits. I will remove automatic deletions, blacklisting and hiding from the public filters I publish and get them online, hopefully this weekend. The reason there are so many auto delete filters is because that's how I use them and they are extremely effective for me. When I update a filter rule I normally copy and paste it into the published list as is. I will undertake to cleanse them of automatic deletions and blacklisting in the future. As you noted I have already removed all bounce directives.
Concerning false positives, I get them occasionally as well and when I do I try to fix the bad rule that triggered the flag. The HTML Tricks filter is one that I work on frequently, which is why I moved it to the bottom of the list, in filters2.txt. It is still useful for detecting spam that the rest of my filters, and the learning filter have missed. That filter is not set to hide flagged messages, just to mark them for manual deletion.
You are correct about the increase in processing load and I have mentioned this several times. It is something I work on to the best of my abilities. I will continue to make my (sanitized) filters available for those who want to use them, as long as I use MailWasher Pro, which will probably be for a long time.
Thanks again for letting me know about these problems. _________________ Submitted by Wiz
Guarding the Castle against spammers and scammers
|
|
| Back to top |
|
 |
Wizcrafts
Sergeant
 Premium Member
 Joined: Jun 05, 2003 Posts: 95 Location: Michigan
|
Posted: Sun Aug 12, 2007 2:25 am Post subject: |
|
|
After considering Bill_Bright's comments and feedback from other MailWasher Pro users, I have just removed all instances I could find of the action directives "Automatic," "Blacklist," and "Hidden," in the MailWasher Pro Filters on my website. This includes both the long and short filter sets. I will keep one extra set online for those who prefer to have those (drastic) actions by default. Send me a PM for the URL to that set of filters.
I hope that this will minimize problems caused by false positives, as you will now have a chance to review the files that have been flagged as spam and decide if they really are spam. If not, mark them as legitimate and add them to your Friends List.
If you find that some of my filters are indeed 100% correct in ID-ing spam, edit the actions as you wish, from within the MailWasher Pro interface, via the Filter Sidebar. Use Control + F7 to display the sidebar, then click on the Filters tab. _________________ Submitted by Wiz
Guarding the Castle against spammers and scammers
|
|
| Back to top |
|
 |
Bill_Bright
General
 Premium Member
 Joined: Jan 16, 2004 Posts: 9046 Location: Nebraska, USA
|
Posted: Sun Aug 12, 2007 4:28 am Post subject: |
|
|
Thank you for addressing these issues so quickly! | Quote: | | I hope that this will minimize problems caused by false positives | It is important for all to understand and accept that total elimination of all false positives is an impossible goal. And it is also impossible for any spam blocker to correctly tag every piece of spam, as spam, 100% of the time. Therefore, it is inevitable that we will occasionally get a false positive, and we will also occasionally get spam that makes it past all the filters. That's the beauty of MailWasher; unlike all other spam blocker programs, the unidentified spam that makes it past all the filters still remains on the server, and is not downloaded on to your PC and already in the front door, if you will.
Many of us consider email an invaluable communications tool and the risk of losing even a single email from an important client or contact is too great. It is because of those two unobtainable goals that automatic processing/deleting of email should never be done. Of course, if you don't consider your email important and don't care if you lose some of it, those may be acceptable risks.
But then there are always exceptions. If you want to automatically delete all email from your X's email address, for example, you can create a filter that addresses that specific parameter. But, when dealing with variables, such as these filters, that's where automatic processing/deleting becomes an issue.
I think with the changes you have made, you eliminated the dangers of the filters. Added value vs added workload managing them is still a factor that one would have to weigh. Although your filters certainly identified most of the spam I received, as I mentioned before, the other spam tools within MWP tagged the same ones too.
That said, once again, I appreciate all the hard work that went into compiling the set and I will try them again soon. If nothing else, seeing the Junk Mail pie chart breakdown in statistics is interesting.  _________________
Bill, AFE7Ret
Freedom is NOT Free!
|
|
| Back to top |
|
 |
davews
Sergeant

 Joined: May 03, 2003 Posts: 75
|
Posted: Sun Aug 12, 2007 6:05 am Post subject: |
|
|
| Wizcrafts wrote: | | davews wrote: |
By the way, when editing filters.txt it should be noted that it is in MW's internal format which is UTF16 (16 bit encoding). Most text editors including current versions of Windows Notepad will open this properly, but some programs only accept 8 bit coding. I found this a problem with Notepad+, an excellent Notepad replacement, which does not recognise UTF16 and I had to use an alternative.
Dave |
Dave;
Thanks for that behind the scenes information. Even I didn't realize that about the 16 bit encoding. I use NoteTab Pro for all of my text and HTML editing and it saves MailWasher filters properly as well. |
I would add that MW converts any text you paste as 8 bit into 16 bit the first time you run it with that filter. If you have existing filters already in the file you will see those as just garbage when you open it with an 8bit only text editor. Paste the new filters in (as 8 bit) and you will lose those existing filters when you save the file. Be warned, it happened to me and I had to retrieve them from a backup on my other machine...
I read with interest the comments from Bill Wright about your filters. I tend to agree with him, the Bayesian copes with 99% of my spam, I only use your two PDF ones and one of the ZIP ones to cover the ones it misses. But the learning seems to be doing the job for some of those now as well, and trying to keep up with changing spam patterns seems a lot of work for just a couple of extra manual ticks a day.
|
|
| Back to top |
|
 |
Ikeb
Special Response Team Forums Admin
 Joined: Apr 20, 2003 Posts: 16543
|
|
| Back to top |
|
 |
JEfromCanada
Trooper

 Joined: Mar 25, 2004 Posts: 19 Location: Canada
|
Posted: Sun Aug 12, 2007 2:38 pm Post subject: I'm liking what I see so far |
|
|
I've been using these filters now for about a day. By removing the HIDDEN and AUTOMATIC directives, I was able to examine the effects of your filters.
I have been very impressed at its ability to remove ALL of the newer spam types. In my limited time, I have received NO false positives, and NO spam messages have made it through the added layer provided by your filter.
I am trying to decide whether to delete any of my own filters due to the superior coverage of yours; however, with 15 email accounts and an average of 300-400 spam mails received daily, the extra processing time added by the new filters with their regular expressions are adding about 3 minutes to the already lengthy processing time each time I read email.
That may also be affected by my decision to scan the maximum number of lines per email, and keep my deleted email records for the maximum allowable time.
Edit: My statement about false positives was a bit incorrect. I happen to correspond with a few people in Asia and the Middle East, but hadn't actually added them to my friends list. I needed to do that after installing your filter due to the country filtering.
|
|
| Back to top |
|
 |
Bill_Bright
General
 Premium Member
 Joined: Jan 16, 2004 Posts: 9046 Location: Nebraska, USA
|
Posted: Sun Aug 12, 2007 3:17 pm Post subject: |
|
|
| Ikeb wrote: | Actually it's Bill Bright (he's sharp but not always (w)right? ) | Thanks - I think.
| Quote: | The whole idea is to multiply probabilities and thus reduce FPs sufficiently enough that automatic deletion is a reasonable proposition. I've estimated the overall FP rate to be less than one in 10,000 msgs ... better than my eyball FP rate I figure.  | I don't know if I can claim a <.01% FP rate, but I can claim that prior to installing this filter set, the only FP I got in the 5 months prior to this was caused by my ISP's spam blocker tagging my ISP's own newsletter!
I do have an extensive friends list and that certainly picks the vast majority of my "keepers". And, of course, unless otherwise specified in a particular filter, the Friends list takes precedence over all other filters - this fact alone eliminates nearly all potential FP incidents. (See note below)
You say your filters are better than your eyeball or manual checking, I guess I would say, "how do you know?" I personally think my manual scan rate is darn close to 100% accurate - I think it is pretty easy to spot legitimate emails as legitimate senders don't tend to use "obfuscation fingerprints" or the like, and instead, the senders (mostly my contacts or forums/sites/newsletters I visit/subscribe to), tend to use recognizable addresses and subject titles. I check my email several times a day - in fact MWP is up and running almost 24/7 (which is why the memory leak issue is an issue - but for another discussion) so scanning through a dozen or so, already sorted, emails takes just a couple seconds. The vast majority of the time, all I have to do is click on the top spam message, hold down the shift key and click on the last spam message, tic one of the SpamCop column boxes and hit process. All that's left is the good stuff.
The point I was making earlier is while I was testing these filters, I made it a point to hover over the Status field and see what tagged each of the spam messages. And in every case, those tagged by one of these filters were also tagged by one of MWP's native spam tools.
I wonder what your FP rate would be if you disabled most of your filters that key on "variables", and instead use MWP's spam tools and your friends list? My bet is it will still be very near 100% accurate.
I will also add this, I am not a programmer. I don't like writing code so when possible, I avoid writing code - writing regex filters is writing code. Some folks like and enjoy writing code, others don't. You wrote or modified your regex filters for you. You understand them. That makes a world of difference when placing trust in them.
****
Note: This illustrates yet another inconsistency in the Wizcrafts filters I missed in my earlier review. The Japanese filter "TakesPrecedence" setting is set, yet the Chinese, Turkey, Russia, Nigerian, Korean and other geographical filters are not!?? I still have military friends stationed in Japan - and Korea and Turkey too. The Dangerous Attachment filter is another set to take precedence over the friends list if it finds a .exe or .bat file, for example, but the Zip filter does not. Downloadable filters should not be set, by default, to override user's Friends lists. And of course, in the case of attachments, everyone uses a good, updated AV scanner to scan all attachments before opening, right? That's just part of practicing safe computing! _________________
Bill, AFE7Ret
Freedom is NOT Free!
|
|
| Back to top |
|
 |
Wizcrafts
Sergeant
 Premium Member
 Joined: Jun 05, 2003 Posts: 95 Location: Michigan
|
Posted: Sun Aug 12, 2007 9:14 pm Post subject: |
|
|
| Bill_Bright wrote: |
Note: This illustrates yet another inconsistency in the Wizcrafts filters I missed in my earlier review. The Japanese filter "TakesPrecedence" setting is set, yet the Chinese, Turkey, Russia, Nigerian, Korean and other geographical filters are not!?? I still have military friends stationed in Japan - and Korea and Turkey too. The Dangerous Attachment filter is another set to take precedence over the friends list if it finds a .exe or .bat file, for example, but the Zip filter does not. Downloadable filters should not be set, by default, to override user's Friends lists. And of course, in the case of attachments, everyone uses a good, updated AV scanner to scan all attachments before opening, right? That's just part of practicing safe computing! |
Bill;
The "Takes Precedence Over Friends" in the Japanese filter was a mistake. I have edited it out of that rule and uploaded the corrected rule to the public filters.
In searching for instances of "Takes Precedence over Friends List" I found a rule for "Restored by MailWasher," which must remain in place to allow messages restored from the MWP Recycle Bin to be displayed. Phishing scams that forge eBay as the sender need to take precedence if you are an eBayer and have added the entire ebay.com domain to your friends list. I also removed "Takes Precedence" from another filter, which I am watching. The spammer has forged my own domain in the past.
I am still reviewing the use of Takes Precedence in the virus filters, all of which are in the larger filters.txt only. This is because they target specific (and by now outdated) Worms or viruses that harvest email contacts from the Windows Address Book and send copies of themselves to known contacts, using the known sender's account and possibly their name in the From field. By having "Takes Precedence over Friends List" checked I was able to weed those messages out when they were sent in the (possibly forged) name of known contacts or friends, who may or may not have been infected with email harvesting Worms.
I am in the process of refining those virus filters right now. I may remove ".exe" from the "Dangerous Attachments" rules and create it's own filter, just to alert you to the presence of the .exe attachment coming from someone on your Friends list. I am also looking into removing "Delete" from some of the Virus filters, just leaving them flagged as containing possibly hostile attachments (already done on the Zipfile Attachment filter and uploaded this afternoon). I'll post again when I figure out the best way to deal with the remaining few filters in question.
Again, the Virus and Worm filters are in the larger set; filters.txt, not in the smaller more current set; filters2.txt. The only virus filter in the "current filters" set, at this moment, is to detect the Beautiful Screensaver Trojan-in-a-zipfile threat. I am going over the entire large filters.txt to weed out useless outdated stuff, but this is a slow process, one line at a time.
The "Blocked Countries" rule is based on spam I receive from overseas, where I have no friends or contacts in those countries and only spam comes to me from those places. The Blocked Countries rule should be scrutinized by users before being applied pell mell. As it is - it only flags and marks for manual deletion. All of the filters targeting specific countries should be also reviewed by each user to ensure that they aren't going to mark desired foreign messages as spam. If you regularly receive email from a Country that is targeted by a particular filter, disable that filter!
Note: The reason for placing the various Country filters and the HTML Tricks filter near the bottom of the list is so they can act as last chance filters for variants of spam that pass all known spam rules. This happens to me frequently when a new variant of spam doesn't match my preset conditions. 99% of those messages are flagged by one of the country, or blocked countries filters, or by the HTML Tricks filter.
BTW: Today I moved the Blocked Countries filter to the bottom of the list, just below HTML Tricks. I also rearranged and re-labeled others, to show that they contain suspicious link destinations (geocities.com and .hk links). All of the email I receive containing links to these URLs is pure spam; some of it really nasty stuff. Your mileage may vary.
This is still a work in progress (my MWP filters) that is adapting as quickly as possible to changes in spam techniques. For the longest time I kept these filters to myself. It is inevitable that since making them available to the public that they will come under tough scrutiny and criticism. My goal here is to try to help as many MWP users as possible suppress as much spam as possible, without deleting legitimate messages in the process. I don't want to throw out the baby with the wash water! This is a hell of a task which I have undertaken voluntarily.
One good thing about my filters, despite all of their shortcomings, is that if a DNS blocklist provider goes offline, or folds (it happens), and the Bayesian filter doesn't recognize a new variant as spam (this also happens), my regular expressions filters may well recognize it and flag it as spam, or delete it, as determined by the end user. That makes me smile to myself. _________________ Submitted by Wiz
Guarding the Castle against spammers and scammers
|
|
| Back to top |
|
 |
Bill_Bright
General
 Premium Member
 Joined: Jan 16, 2004 Posts: 9046 Location: Nebraska, USA
|
|
| Back to top |
|
 |
Wizcrafts
Sergeant
 Premium Member
 Joined: Jun 05, 2003 Posts: 95 Location: Michigan
|
Posted: Mon Aug 13, 2007 2:45 am Post subject: |
|
|
Thanks Bill, I appreciate your encouragement. I guess I spend too much time under the hood tinkering with the timing to appreciate the problems others might have driving this hotrod. I'll keep at it! _________________ Submitted by Wiz
Guarding the Castle against spammers and scammers
|
|
| Back to top |
|
 |
Ikeb
Special Response Team Forums Admin
 Joined: Apr 20, 2003 Posts: 16543
|
Posted: Mon Aug 13, 2007 5:43 am Post subject: |
|
|
| Bill_Bright wrote: | I don't know if I can claim a <.01% FP rate, but I can claim that prior to installing this filter set, the only FP I got in the 5 months prior to this was caused by my ISP's spam blocker tagging my ISP's own newsletter!
I do have an extensive friends list and that certainly picks the vast majority of my "keepers". And, of course, unless otherwise specified in a particular filter, the Friends list takes precedence over all other filters - this fact alone eliminates nearly all potential FP incidents. (See note below)
You say your filters are better than your eyeball or manual checking, I guess I would say, "how do you know?" I personally think my manual scan rate is darn close to 100% accurate - I think it is pretty easy to spot legitimate emails as legitimate senders don't tend to use "obfuscation fingerprints" or the like, and instead, the senders (mostly my contacts or forums/sites/newsletters I visit/subscribe to), tend to use recognizable addresses and subject titles. |
Indeed my eyeball error rate is purely a guess on my part but keep in mind that I may not always be as attentive as I should be. (I have been known to nod off at the keyboard once in a while eh )
| Bill_Bright wrote: | The point I was making earlier is while I was testing these filters, I made it a point to hover over the Status field and see what tagged each of the spam messages. And in every case, those tagged by one of these filters were also tagged by one of MWP's native spam tools.
I wonder what your FP rate would be if you disabled most of your filters that key on "variables", and instead use MWP's spam tools and your friends list? My bet is it will still be very near 100% accurate. |
Not sure what you mean by "variables" ... once I've soaked a filter for a month or so, add the POPFile rule and turn on the automatic switch, I pretty much put it out of mind. Re: Friends list, actually I haven't relied on it (all but a few filters override Friends List). My reasoning is that if I miss a friend's email (which shouldn't happen but there is a .01% chance I might), my friend will catch up to me by some other means: it's the person who I don't know well, someone who might send me something out of the blue that I don't want to miss.
| Bill_Bright wrote: | | I will also add this, I am not a programmer. I don't like writing code so when possible, I avoid writing code - writing regex filters is writing code. Some folks like and enjoy writing code, others don't. You wrote or modified your regex filters for you. You understand them. That makes a world of difference when placing trust in them. |
Yup. Agree 100% with these sentiments. What I'm doing is just for geeks .. unfortunately. I wish that someone would come up with a regex filter GUI (or whatever you want to call it) that could input layman-level logic and spit out the filter. Nothing like that exists I'm afraid.
All that said, dealing with FPs would be made so much simpler if some sort of weighting scheme were added to MWP. No need to fart around with an external app like POPFile plus the ability to factor in DNSBL findings to the outcome would help as well.
|
|
| Back to top |
|
 |
Wizcrafts
Sergeant
 Premium Member
 Joined: Jun 05, 2003 Posts: 95 Location: Michigan
|
Posted: Mon Aug 13, 2007 7:07 am Post subject: |
|
|
I just finished refining the (virus/worm) various attachments filters and trimmed them down to only 4 rules. I have tested them and they seem to work as designed, but I'm open to suggestions after you test them. Note that I have kept the Friends List override on all 4 if only to alert the user that somebody has sent him/her an attachment that is a .exe, or .zip, or .rar. or .gz. There is no action preselected for the two filters dealing with those file types. OTOH. there are two filters that detect truely dangerous attachments and combinations of extensions, and these are marked for (manual) deletion.
The reason I left the Takes Precedence option checked is because viruses and Worms still arrive with a From address or domain that many people may have on their friends lists, or even from a forged account on your own domains. I am open to arguments against having these overrides in place, for some or all of these 4 filters, but it seems prudent to me.
I'm finished for tonight and will take up the project tomorrow. zzzzzzzzzzz _________________ Submitted by Wiz
Guarding the Castle against spammers and scammers
|
|
| Back to top |
|
 |
mlrichardson
Captain
 Premium Member
 Joined: Jun 16, 2005 Posts: 424 Location: Capitola, California, USA on the shores of the blue - and cold - Pacific
|
Posted: Mon Aug 13, 2007 8:07 am Post subject: A new filter set for MWP users brought to you by Wizcrafts! |
|
|
Not necessarily anything new, but I thought I'd add my perspective to the discussion on filters, etc.
First, I rely heavily on the Bayesian filters in the Learning Tool. I figure 95% of my selections are taken care of before I even see the emails. My only filters are some specifics to keep multiple addresses that really belong to one organization together - so I can see who it is in the Status column, by name, and the filter to allow retrieval of emails MW has deleted, plus firends and (a very shortlived) blacklist. I then do a quick scan of the sorted emails with most of my attention paid to the ones not marked for deletion. I also use PopFile, but I remember a mention earlier on, I think in this thread, about dumping all the spam/viral/any other junk at the ISP's server. That adds greatly to my peace of mind. If a piece of junk of the viral kind makes it to my computer, I'm not happy. That's too close. Hopefully PopFile will then ID it, but I'm not about to open it to find out if it is really viral. I won't even do that if the email is on the ISP server, since opening the email can, in some cases, initiate the virus. At least, if it's id'ed on the server I won't ever even have it on my hard drive. I really only use PopFile as a truly last resort to separate emails I'd like to decide on versus friends and those I definitely want to read. By the time they get to PopFile, the numbers are much smaller (approx 10% of what shows up at the server), and I don't mind manual selection.
I have to add: I'm a retired programmer/analyst and have done my share of code. But I don't even like Regex. I didn't when I had to use it professionally. Because I'm retired, I have the time to manually refine my email filtering. And since I'm only using MWP on one account, I'm not taxing anything very much. I know others have lots more email and need to automate as much as possible, but I agree with everything said about FP's and the need for some refined selection rather than automatically deleting anything. If I used email to do a part of my business life, I would never have anything auto-deleted. Too much of a chance a piece of vital email could go missing without me knowing it. If there are dollars associated with that email, woe is me. I'm the only one I want making a decision like that. _________________ Mike Richardson
MS Windows XP, SP2;
Vipre (Beta) Anti-Virus & Anti-Spyware
Firefox 3.0, Thunderbird 2, MWPro 6 (beta), WP Office X3
It is for the superfluous we sweat.
-Seneca (the younger)
|
|
| Back to top |
|
 |
Bill_Bright
General
 Premium Member
 Joined: Jan 16, 2004 Posts: 9046 Location: Nebraska, USA
|
Posted: Mon Aug 13, 2007 1:47 pm Post subject: |
|
|
| Ike wrote: | | Not sure what you mean by "variables" | Perhaps this illustrates my ignorance with coding - but back in my AIX days, (IBM's version of UNIX for those that blocked that out ) a regex string was used to find every instance of a particular pattern, much the same way as DOS wildcard notations match on a pattern - only much more powerful.
By "variable" I mean, not a "constant". For a simple example, if I want to delete the text file "filename.txt", entering the "variable" expression: del *.txt will delete every .txt file in that directory including filename.txt - the danger being that it may delete a .txt file I did not want to delete. Consequently, the use of wildcards (and in my mind, regexps) must be done with great care and understanding of what exactly may happen.
While I know my way around the DOS command prompt, regex is much more powerful and dangerous and therefore, rather than using the "variable" del *.txt to delete the file "filename.txt", it is much safer using del filename.txt instead knowing that "constant" will delete that one file, and only that one file - no variables.
So, as with wildcards, it seems to me using regex to tag something for deletion comes with the risk that pattern matching may match a keeper. And with that possibility comes the requirement for some sort of recovery. There are undelete utilities for DOS/Windows, but with a regex filter set to auto-process, and no admin access to my ISP's mail server, where's the ability to recover?
Sure, a friend or family member can followup if I don't respond to an email - but sadly, it may mean I missed an event during the gap. And if it is someone else I miss, well, that may be a potential new client I miss who then, by word of mouth (my only method of advertising), tells more potential clients I don't answer my email. Too risky for me.
I appreciate the power of regex and the skills of those that have mastered it, and I can certainly see the value in using regex to identify "potential" spam, but coupling that with auto-processing/auto-delete is not something I am willing to accept - that 1 in 10,000 emails may be the 1 I can't afford to miss!
| Quote: | | I wish that someone would come up with a regex filter GUI (or whatever you want to call it) | A "Wizard" would be great!
| Mike R. wrote: | | ...I agree with everything said about FP's and the need for some refined selection rather than automatically deleting anything. If I used email to do a part of my business life, I would never have anything auto-deleted. Too much of a chance a piece of vital email could go missing without me knowing it. If there are dollars associated with that email, woe is me. I'm the only one I want making a decision like that. | Bingo!
| Wizcrafts wrote: | | Note that I have kept the Friends List override on all 4 if only to alert the user that somebody has sent him/her an attachment that is a .exe, or .zip, or .rar. or .gz. There is no action preselected for the two filters dealing with those file types. OTOH. there are two filters that detect truely dangerous attachments and combinations of extensions, and these are marked for (manual) deletion. | I can live with that. _________________
Bill, AFE7Ret
Freedom is NOT Free!
|
|
| Back to top |
|
 |
|
|
|