Computer Science and Information Systems
The international journal published by ComSIS Consortium 

FP-Outlier: Frequent Pattern Based Outlier Detection

 

UDC 681.5

Zengyou He, Xiaofei Xu,
Department of Computer Science and Engineering, Harbin Institute of Technology, P.R. China
Joshua Zhexue Huang 2 ,
E-Business Technology Institute, The University of Hong Kong, Pokfulam, Hong Kong, P.R.China
Shengchun Deng
Department of Computer Science and Engineering, Harbin Institute of Technology, P. R. China


Abstract. An outlier in a dataset is an observation or a point that is considerably dissimilar to or inconsistent with the remainder of the data. Detection of such outliers is important for many applications and has recently attracted much attention in the data mining research community. In this paper, we present a new method to detect outliers by discovering frequent patterns (or frequent itemsets) from the data set. The outliers are defined as the data transactions that contain less frequent patterns in their itemsets. We define a measure called FPOF (Frequent Pattern Outlier Factor) to detect the outlier transactions and propose the FindFPOF algorithm to discover outliers. The experimental results have shown that our approach outperformed the existing methods on identifying interesting outliers.





Volume 02 , Issue 01 (June 2005) table of contents
Year of Publication: 2005
ISSN:1820-0214
Publisher ComSIS Consortium
Full text available: Pdf
 
Home 
ComSIS Consortium
Aims and Scope 
Editorial Board
Editorial Council
Managing Board
Information for Contributors
Copyright Transfer Form
Current Issue
Archive
Forthcoming Articles
Subscription
Contact Info