Telecom fraud identification based on ADASYN and random forest - Details

Author：

Lu, Chaol (Lu, Chaol.) | Lin, Shaofu (Lin, Shaofu.) | Liu, Xiliang (Liu, Xiliang.) | Shi, Hui (Shi, Hui.)

Indexed by：

EI Scopus

Abstract：

With　the　development　of　information　and　communication　technology,　the　situation　of　communication　frauds　is　becoming　more　and　more　serious,　how　to　identify　fraudulent　telephone　accurately　and　effectively　has　become　an　urgent　task　in　telecom　operation　at　present.　Affected　by　the　power　law　distribution,　existing　machine　learning　methods　are　used　to　identify　the　unbalanced　distribution　data　set　of　positive　and　negative　samples　with　low　recognition　accuracy.　This　paper　proposes　ADASYN+RF　model.　First　of　all,　for　the　problem　of　unbalanced　data　sets,　this　paper　chooses　the　ADASYN(Adaptive　Synthetic　Sampling)　algorithm　to　rebalance　the　original　data　set.　Secondly,　we　choose　the　random　forest　algorithm　is　employed　to　train　the　new　data　set　to　avoid　overfitting.　Finally,　two　groups　of　comparative　experiments　are　carried　out　respectively,　and　the　results　show　that:　(1)　For　the　processing　of　biased　data,　the　ADASYN　algorithm　used　in　this　paper　is　more　advantageous　than　the　traditional　SMOTE　algorithm;(2)　Compared　with　Nonintegrated　learning　model,　the　accuracy,　recall　rate　and　F1　value　of　the　ADASYN+RF　model　are　significantly　improved.　©　2020　IEEE.

Keyword：

Decision trees Random forests Data handling Distributed database systems Learning systems

Author Community：

[ 1 ] [Lu, Chaol]Beijing University of Technology, Faculty of Information Technology, Beijing, China
[ 2 ] [Lin, Shaofu]Beijing University of Technology, Beijing Institute of Smart City, Beijing, China
[ 3 ] [Liu, Xiliang]Beijing University of Technology, Beijing Institute of Smart City, Beijing, China
[ 4 ] [Shi, Hui]Beijing University of Technology, Beijing Advanced Innovation Center for Future Internet Technology, Beijing, China

Reprint Author's Address：

[liu, xiliang]beijing university of technology, beijing institute of smart city, beijing, china

Email：

liuxl@bjut.edu.cn

Show more details

Related Keywords：

Research on motion recognition algorithm based on accelerometer
2017，2017 International Conference on Computer Systems, Electronics and Control, ICCSEC 2017
A High-Dimensional and Multi-granularity Feature Selection Method Based on CNN and RF
2020，15th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery, ICNC-FSKD 2019, co-located with the 5th International Conference on Harmony Search, Soft Computing and Applications, ICHSA 2019
Industrial intelligent forecast of TFT-LCD based on R-SVM
2019，2019 IEEE International Conference on Industry 4.0, Artificial Intelligence, and Communications Technology, IAICT 2019
Dioxin Emission Concentration Forecasting Model for MSWI Process with Random Forest-Based Transfer Learning
2020，39th Chinese Control Conference, CCC 2020

Source ：

Year： 2020

Page： 447-452

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 32

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

信息学部

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to