A Variance-mean Based Feature Selection in Text Classification - Details

Author：

Yin, Shen (Yin, Shen.) | Jiang, Zongli (Jiang, Zongli.) (Scholars：蒋宗礼)

Indexed by：

CPCI-S EI Scopus CPCI-SSH

Abstract：

Feature　selection　is　an　important　process　to　choose　a　subset　of　features　relevant　to　a　particular　application　in　text　classification.　Based　on　the　mutual　information　method,　we　designed　variance-mean　based　feature　selection　(VM).　After　computing　and　ranking　the　variance　of　class　discrimination　value　vector　for　each　word,　we　can　choose　the　most　distinguishable　features.　This　method　has　advantages　in　the　case　of　choosing　smaller　number　of　features,　especially　for　classes　with　small　number　of　training　documents.　It　keeps　the　best　features,　and　thus　improves　the　final　performance　of　the　classification　system.　The　experiment　results　indicate　the　effectiveness　of　the　proposed　feature　selection　method　in　a　text　classification.

Keyword：

feature selection text classification variance-mean

Author Community：

[ 1 ] [Yin, Shen]Beijing Univ Technol, Beijing, Peoples R China
[ 2 ] [Jiang, Zongli]Beijing Univ Technol, Beijing, Peoples R China

Reprint Author's Address：

Email：

yinshen135@gmail.com |
jiangzl@bjut.edu.cn

Show more details

Related Keywords：

Feature Selection Based on Neighborhood Systems and Rough Set Theory
2009，2nd International Workshop on Knowledge Discovery Data Mining
Feature selection for tumor classification based on improved SVM-RFE
2007，2nd International Symposium on Intelligence Computation and Application (ISICA 2007)
The Method of Narrow-band Audio Classification based on Universal Noise Background Model
2013，5th International Conference on Machine Vision (ICMV) - Algorithms, Pattern Recognition and Basic Technologies
A Novel Supervised Learning Algorithm for Musical Instrument Classification
2012，35th International Conference on Telecommunications and Signal Processing (TSP)

Source ：

PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND COMPUTER SCIENCE, VOL III

Year： 2009

Page： 519-522

Language： English

Cited Count：

WoS CC Cited Count： 3

SCOPUS Cited Count： 4

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

学院待认领

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to