• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Yang, Suhua (Yang, Suhua.) | Yan, Jianzhuo (Yan, Jianzhuo.) | Gao, Chen (Gao, Chen.) | Tan, Guohua (Tan, Guohua.)

收录:

EI Scopus

摘要:

In this paper, a new blogger's interest mining module is proposed, which is based on Chinese text classification. In fact, the problem of the interest mining is transformed into the problem of Chinese text categorization. Before the Chinese text categorization, the text is pre-processed for the text representation. The Chinese text is represented in vector space model and classified by support vector machine classification, while filter algorithm which filters the unrelated interest text is proposed. After the filtering, the text can get it's interest category. Finally the new module has been made use of to carry out an interest mining experiment, and the other experiment which has not filter algorithm is also carried in order to compare with the new module. The two experimental results show that the support vector machine is a effective algorithm, and the comparing data of the two experiments shows that new module make the interest mining more effective. © 2011 Springer-Verlag Berlin Heidelberg.

关键词:

Blogs Classification (of information) Filtration Support vector machines Text processing Vectors Vector spaces

作者机构:

  • [ 1 ] [Yang, Suhua]College of Electronic Information and Control Engineering, Beijing University of Technology, Beijing, China
  • [ 2 ] [Yan, Jianzhuo]College of Electronic Information and Control Engineering, Beijing University of Technology, Beijing, China
  • [ 3 ] [Gao, Chen]College of Electronic Information and Control Engineering, Beijing University of Technology, Beijing, China
  • [ 4 ] [Tan, Guohua]College of Electronic Information and Control Engineering, Beijing University of Technology, Beijing, China

通讯作者信息:

电子邮件地址:

查看成果更多字段

相关关键词:

相关文章:

来源 :

ISSN: 1867-5662

年份: 2011

卷: 100

页码: 611-618

语种: 英文

被引次数:

WoS核心集被引频次: 0

SCOPUS被引频次: 2

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 3

归属院系:

在线人数/总访问数:1444/2904313
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司