Content-Based Approach for Trfcking Concept Drift in Email Spam Filtering

Zi Hayat ,  Morteza; Basiri ,  Javad; Seyedhossein,  Leila; Shakery ,  Azadeh

Volume 2, Issue 3 (9-2010) itrc 2010, 2(3): 59-65 | Back to browse issues page

Mendeley

Zotero

RefWorks

Zi Hayat M, Basiri J, Seyedhossein L, Shakery A. Content-Based Approach for Trfcking Concept Drift in Email Spam Filtering . itrc 2010; 2 (3) :59-65
URL: http://ijict.itrc.ac.ir/article-1-258-en.html

Content-Based Approach for Trfcking Concept Drift in Email Spam Filtering

Morteza Zi Hayat

¹, Javad Basiri²

, Leila Seyedhossein²

, Azadeh Shakery²

1- School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran
2- School of Electrical and Computer Engineering University of Tehran, Tehran, Iran

Abstract: (2627 Views)

The continued growth of Email usage, which is naturally followed by an increase i unsolicited emails so called spams, motivates research in spam filtering area. In the context of spam filtering systems, addressing th evolving nature of spams, which leads to obsolete the related models, has been always a challenge. In this paper an adaptive spam filtering system based on language model is proposed which can detect concept drift based on computing the deviation in email contents distribution. The proposed method can be used a ong with any existing classifier; particularly in this paper we use Naive Bayes method as classifier. The proposed method has been evaluated with Enron data set. The results indicate the efficiency of the method in detectin concept drift and its superiority over Naive Bayes classifier in terms of accuracy.

Keywords: component, spam.filtering, concept drift, KL divergence, language model

Full-Text [PDF 1087 kb] (701 Downloads)

Type of Study: Research | Subject: Information Technology

Rights and permissions
	This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Principal Contact