Feature extraction of text classification based on word clustering - Details

Author：

Jiang, Zong-Li (Jiang, Zong-Li.) (Scholars：蒋宗礼) | Xu, Xue-Ke (Xu, Xue-Ke.) | Li, Shuai (Li, Shuai.)

Indexed by：

EI Scopus PKU CSCD

Abstract：

Feature　extraction　is　essential　for　text　classification.　In　this　paper　we　discussed　the　basic　ideas　behind　word-clustering-based　feature　extraction.　Then　a　text　classification　method　for　feature　extraction　by　the　means　of　words　clustering　was　presented.　It　employed　an　improved　tree-structured　growing　self-organization　map　(TGSOM)　to　carry　out　word　clustering.　Also　a　new　formula　for　calculating　weights　was　developed　by　taking　account　of　the　distinction　between　clustered　word　features　and　plain　word　features.　Finally,　the　SPRINT　decision　tree　was　applied　to　complete　the　text　classification.　Experiments　showed　that　the　precision　of　text　classification　using　the　proposed　method　is　improved　by　4.32%.

Keyword：

Decision trees Classification (of information) Text processing Feature extraction Extraction

Author Community：

[ 1 ] [Jiang, Zong-Li]College of Computer Science, Beijing University of Technology, Beijing 100022, China
[ 2 ] [Xu, Xue-Ke]College of Computer Science, Beijing University of Technology, Beijing 100022, China
[ 3 ] [Li, Shuai]Department of Electric Engineering, Tsinghua University, Beijing 100084, China

Reprint Author's Address：

Email：

jiangzl@bjut.edu.cn

Show more details

Related Keywords：

Multi domain fusion feature extraction and classification of ECG based on PCA-ICA
2020，4th IEEE Information Technology, Networking, Electronic and Automation Control Conference, ITNEC 2020
An one-class classification approach to detecting porn image
2012，27th Image and Vision Computing New Zealand Conference, IVCNZ 2012
Driver fatigue detection based on AdaBoost global features
2009，Journal of Computational Information Systems
Handling over-fitting in test cost-sensitive decision tree learning by feature selection, smoothing and pruning
2010，Journal of Systems and Software

Source ：

Journal of Harbin Engineering University

ISSN： 1006-7043

Year： 2008

Issue： 11

Volume： 29

Page： 1205-1209

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

信息学部计算机学院

Get Fulltext

Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to