Share Email Print
cover

Proceedings Paper

New anti-spam filter based on data mining and analysis of email security
Author(s): Yu Wu; Zhijun Li; Ping Luo; Guoyin Wang
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

One main technical means of anti-Spam is to build filters in email transfer route. However, the design of many junk mail filters hasn't made use of the whole security information in an email, which exists mostly in mail header rather than in the text and accessory. In this paper, data mining based on rough sets is introduced to design a new anti-Spam filter. Firstly, by recording and analyzing the header of every collected email sample, we get all necessary original raw data. Next, by selecting and computing features from the original header data, we obtain our decision table including several condition attributes and one decision attribute. Then, a data mining technique based on rough sets, which mainly includes relative reduction and rule generation, is introduced to mine this decision table. And we obtain some useful anti-Spam knowledge from all the email headers. Finally, we have made tests by using our rules to judge different mails. Tests demonstrate that when mining on selected baleful email corpus with specific Spam rate, our anti-Spam filter has high efficiency and high identification rate. By mining email headers, we can find potential security problems of some email systems and cheating methods of Spam senders.

Paper Details

Date Published: 21 March 2003
PDF: 8 pages
Proc. SPIE 5098, Data Mining and Knowledge Discovery: Theory, Tools, and Technology V, (21 March 2003); doi: 10.1117/12.484894
Show Author Affiliations
Yu Wu, Chongqing Univ. of Posts and Telecommunications (China)
Zhijun Li, Chongqing Univ. of Posts and Telecommunications (China)
Ping Luo, Chongqing Univ. of Posts and Telecommunications (China)
Guoyin Wang, Chongqing Univ. of Posts and Telecommunications (China)


Published in SPIE Proceedings Vol. 5098:
Data Mining and Knowledge Discovery: Theory, Tools, and Technology V
Belur V. Dasarathy, Editor(s)

© SPIE. Terms of Use
Back to Top