winzip icon

Bayesian Anti-Spam Filter/Word Classification (Statistical) Class

Email
Submitted on: 1/7/2015 10:09:00 AM
By: Luis Cantero (from psc cd)  
Level: Intermediate
User Rating: By 3 Users
Compatibility: ASP (Active Server Pages), VbScript (browser/client side)
Views: 2256
 
     Bayesian Filter technology (Bayes Filter) is one of the most effective and most promising ways to filter spam (or any unwanted/wanted content). I made this class based on some ideas I found on the internet and some tweaks and tests of mine. It should be very easy to port to other languages and can be used for spam filtering, bulletin boards or chat rooms to flag unwanted (or wanted) posts. It needs to be trained with "good/ham" (meaning wanted) and "bad/spam" (unwanted) posts, and after enough training it can calculate the probability of a text being spam/bad. It also has an auto-learn feature and database support. There are many uses for it, for example, it can be used on PSC to flag unwanted posts, or in RAC to flag posts that are against the guidelines. Or by law enforcement to evaluate chat rooms in search for pedophiles (child molesters), and the like. I've included an access database with a form that can also be used to train the program. I've tested many algorithms in order to improve the code for speed, accuracy, efficiency and reliability. If you have any suggestions or improvements, please let me know. Also, please vote if you like it :)

 
winzip iconDownload code

Note: Due to the size or complexity of this submission, the author has submitted it as a .zip file to shorten your download time. Afterdownloading it, you will need a program like Winzip to decompress it.Virus note:All files are scanned once-a-day by Planet Source Code for viruses, but new viruses come out every day, so no prevention program can catch 100% of them. For your own safety, please:
  1. Re-scan downloaded files using your personal virus checker before using it.
  2. NEVER, EVER run compiled files (.exe's, .ocx's, .dll's etc.)--only run source code.

If you don't have a virus scanner, you can get one at many places on the net including:McAfee.com


Other 2 submission(s) by this author

 


Report Bad Submission
Use this form to tell us if this entry should be deleted (i.e contains no code, is a virus, etc.).
This submission should be removed because:

Your Vote

What do you think of this code (in the Intermediate category)?
(The code with your highest vote will win this month's coding contest!)
Excellent  Good  Average  Below Average  Poor (See voting log ...)
 

Other User Comments


 There are no comments on this submission.
 

Add Your Feedback
Your feedback will be posted below and an email sent to the author. Please remember that the author was kind enough to share this with you, so any criticisms must be stated politely, or they will be deleted. (For feedback not related to this particular code, please click here instead.)
 

To post feedback, first please login.