• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Mao, Guojun (Mao, Guojun.) | Gao, Mingxia (Gao, Mingxia.) | Yao, Wenji (Yao, Wenji.)

收录:

CPCI-S

摘要:

This paper proposes an algorithm for clustering XML data stream using sliding window. It is a dynamic clustering algorithm based on XML structure. Firstly, we use level structure to represent XML document, which is based on temporal clustering feature. This structure is suitable for extracting information from XML document structure and calculating similarity between XML documents. Secondly, we use the sliding window technique, which adopts exponential histogram of XML cluster feature as a micro-cluster of it. By using the model, we can dynamically accept the new data and get rid of the old data thereby getting a better distribution feature of the current window. Finally, the experimental results based on real and synthetic XML datasets show that our algorithm not only achieves the real-time requirements of the online clustering, but also gains better clustering quality and faster processing speed.

关键词:

XML data stream sliding window

作者机构:

  • [ 1 ] [Mao, Guojun]Cent Univ Finance & Econ, Coll Informat, Beijing, Peoples R China
  • [ 2 ] [Gao, Mingxia]Beijing Univ Technol, Sch Comp Sci, Beijing, Peoples R China
  • [ 3 ] [Yao, Wenji]Beijing Univ Technol, Sch Comp Sci, Beijing, Peoples R China

通讯作者信息:

  • [Mao, Guojun]Cent Univ Finance & Econ, Coll Informat, Beijing, Peoples R China

查看成果更多字段

相关关键词:

相关文章:

来源 :

DBKDA 2011: THE THIRD INTERNATIONAL CONFERENCE ON ADVANCES IN DATABASES, KNOWLEDGE, AND DATA APPLICATIONS

年份: 2011

页码: 96-101

语种: 英文

被引次数:

WoS核心集被引频次: 1

SCOPUS被引频次:

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 1

在线人数/总访问数:585/3901902
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司