• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Zhu, Cui (Zhu, Cui.) | Kitagawa, Hiroyuki (Kitagawa, Hiroyuki.) | Papadimitriou, Spiros (Papadimitriou, Spiros.) | Faloutsos, Christos (Faloutsos, Christos.)

收录:

SCIE

摘要:

Outlier detection is a useful technique in such areas as fraud detection, financial analysis and health monitoring. Many recent approaches detect outliers according to reasonable, pre-defined concepts of an outlier (e.g., distance-based, density-based, etc.). However, the definition of an outlier differs between users or even datasets. This paper presents a solution to this problem by including input from the users. Our OBE (Outlier By Example) system is the first that allows users to provide examples of outliers in low-dimensional datasets. By incorporating a small number of such examples, OBE can successfully develop an algorithm by which to identify further outliers based on their outlierness. Several algorithmic challenges and engineering decisions must be addressed in building such a system. We describe the key design decisions and algorithms in this paper. In order to interact with users having different degrees of domain knowledge, we develop two detection schemes: OBE-Fraction and OBE-RF. Our experiments on both real and synthetic datasets demonstrate that OBE can discover values that a user would consider outliers.

关键词:

Data mining Machine learning Outlier detection Outlier example

作者机构:

  • [ 1 ] [Zhu, Cui]Beijing Univ Technol, Coll Comp Sci, Beijing 100124, Peoples R China
  • [ 2 ] [Kitagawa, Hiroyuki]Univ Tsukuba, Grad Sch Syst & Informat Engn, Ctr Computat Sci, Tsukuba, Ibaraki 3058577, Japan
  • [ 3 ] [Papadimitriou, Spiros]IBM T J Watson, Hawthorne, NY USA
  • [ 4 ] [Faloutsos, Christos]Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

通讯作者信息:

  • [Zhu, Cui]Beijing Univ Technol, Coll Comp Sci, Beijing 100124, Peoples R China

查看成果更多字段

相关关键词:

相关文章:

来源 :

JOURNAL OF INTELLIGENT INFORMATION SYSTEMS

ISSN: 0925-9902

年份: 2011

期: 2

卷: 36

页码: 217-247

3 . 4 0 0

JCR@2022

ESI学科: COMPUTER SCIENCE;

ESI高被引阀值:156

JCR分区:3

中科院分区:4

被引次数:

WoS核心集被引频次: 9

SCOPUS被引频次: 9

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 1

在线人数/总访问数:2684/2935179
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司