• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Yin, Shen (Yin, Shen.) | Jiang, Zongli (Jiang, Zongli.) (学者:蒋宗礼)

收录:

CPCI-S EI Scopus CPCI-SSH

摘要:

Feature selection is an important process to choose a subset of features relevant to a particular application in text classification. Based on the mutual information method, we designed variance-mean based feature selection (VM). After computing and ranking the variance of class discrimination value vector for each word, we can choose the most distinguishable features. This method has advantages in the case of choosing smaller number of features, especially for classes with small number of training documents. It keeps the best features, and thus improves the final performance of the classification system. The experiment results indicate the effectiveness of the proposed feature selection method in a text classification.

关键词:

feature selection text classification variance-mean

作者机构:

  • [ 1 ] [Yin, Shen]Beijing Univ Technol, Beijing, Peoples R China
  • [ 2 ] [Jiang, Zongli]Beijing Univ Technol, Beijing, Peoples R China

通讯作者信息:

查看成果更多字段

相关关键词:

相关文章:

来源 :

PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND COMPUTER SCIENCE, VOL III

年份: 2009

页码: 519-522

语种: 英文

被引次数:

WoS核心集被引频次: 3

SCOPUS被引频次: 4

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 1

归属院系:

在线人数/总访问数:934/3621960
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司