• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Fu, Weiqi (Fu, Weiqi.) | Liao, Husheng (Liao, Husheng.) (学者:廖湖声) | Jin, Xueyun (Jin, Xueyun.)

收录:

CPCI-S

摘要:

Data mining is used to find useful information from massive data. Frequent pattern mining is one important task of data mining. Recently, the researches on frequent pattern mining for semi-structured data have made some progresses, and it also have a lot of focuses for data stream. However, only a few studies focus on both semi-structured data and data stream. This paper proposes an algorithm named SPrefixTreeISpan. We segment the semi-structured data stream first, and then uses the pattern-growth method to mine each segment. In the end, we maintain all the results on a structure called patternTree. At the same time, the mining algorithm is optimized by the inevitable parent-child relationship and the inevitable child-parent relationship extracted from XML schema. Experiment shows that SPrefixTreeISpan has better performance.

关键词:

frequent pattern mining schema feature semi-structured data stream

作者机构:

  • [ 1 ] [Fu, Weiqi]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
  • [ 2 ] [Liao, Husheng]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
  • [ 3 ] [Jin, Xueyun]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China

通讯作者信息:

  • [Fu, Weiqi]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China

查看成果更多字段

相关关键词:

相关文章:

来源 :

PROCEEDINGS OF THE 2017 5TH INTERNATIONAL CONFERENCE ON FRONTIERS OF MANUFACTURING SCIENCE AND MEASURING TECHNOLOGY (FMSMT 2017)

ISSN: 2352-5401

年份: 2017

卷: 130

页码: 1329-1336

语种: 英文

被引次数:

WoS核心集被引频次: 0

SCOPUS被引频次:

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 1

归属院系:

在线人数/总访问数:817/2978609
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司