• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Akhtar, Faheem (Akhtar, Faheem.) | Li, Jianqiang (Li, Jianqiang.) (学者:李建强) | Azeem, Muhammad (Azeem, Muhammad.) | Chen, Shi (Chen, Shi.) | Pan, Hui (Pan, Hui.) | Wang, Qing (Wang, Qing.) | Yang, Ji-Jiang (Yang, Ji-Jiang.)

收录:

EI Scopus SCIE

摘要:

A newborn with a birth weight above the 90th percentile of same gestational age is termed as large for gestational age. Large for gestational age suffers from serious complications during and after the antepartum period because they do not get earlier identification of the disease. Earlier recognition of large for gestational age infants could slow progression and prevent further complication of the disease. In medical science, prevention and mitigation of disease require examination of biochemical indicators. Machine learning has been evolved and envisioned as a tool to predict large for gestational age infants with most deterministic characteristics. This study aims to identify most deterministic biochemical indicators for large for gestational age prediction with minimal computational overhead. To the best of my knowledge, this is the first time a study is carried out to identify the most deterministic risk factors associated with large for gestational age and to develop large for gestational age prediction model using machine learning techniques. To develop an efficient large for gestational age prediction model, we conducted three group of experiments that considered basic machine learning methods; feature selection; and imbalanced data, respectively. Support vector machine, logistic regression, Naive Bayes and Random Forest were trained using tenfold cross-validation on large for gestational age dataset; we selected precision and area under the curve as a performance evaluation metrics; information gain an entropy-based feature selection method was adopted to rank features; we introduced an ensemble data imbalance technique in the last group of experiments. For each group of experiments, support vector machine performed best compared to other machine learning classifiers by producing the highest prediction precision score of 85%. All of the classifiers performed best with thirty ranked features subset, which validates the applied method to recognize the most deterministic risk factors associated with large for gestational age prediction.

关键词:

Data imbalance Ensemble technique Prediction model Feature selection Risk factors Large for gestational age Machine learning

作者机构:

  • [ 1 ] [Akhtar, Faheem]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 2 ] [Li, Jianqiang]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 3 ] [Azeem, Muhammad]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 4 ] [Akhtar, Faheem]Sukkur IBA Univ, Dept Comp Sci, Sukkur 65200, Pakistan
  • [ 5 ] [Chen, Shi]Chinese Acad Med Sci & Peking Union Med Coll, Peking Union Med Coll Hosp, Dept Endocrinol, Beijing 100730, Peoples R China
  • [ 6 ] [Pan, Hui]Chinese Acad Med Sci & Peking Union Med Coll, Peking Union Med Coll Hosp, Dept Endocrinol, Beijing 100730, Peoples R China
  • [ 7 ] [Wang, Qing]Tsinghua Univ, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
  • [ 8 ] [Yang, Ji-Jiang]Tsinghua Univ, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China

通讯作者信息:

  • [Yang, Ji-Jiang]Tsinghua Univ, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China

查看成果更多字段

相关关键词:

来源 :

JOURNAL OF SUPERCOMPUTING

ISSN: 0920-8542

年份: 2020

期: 8

卷: 76

页码: 6219-6237

3 . 3 0 0

JCR@2022

ESI学科: COMPUTER SCIENCE;

ESI高被引阀值:132

被引次数:

WoS核心集被引频次: 15

SCOPUS被引频次: 17

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 1

归属院系:

在线人数/总访问数:642/3901965
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司