• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Halim, Zahid (Halim, Zahid.) | Sargana, Hussain Mahmood (Sargana, Hussain Mahmood.) | Aadam (Aadam.) | Uzma (Uzma.) | Waqas, Muhammad (Waqas, Muhammad.)

收录:

EI Scopus SCIE

摘要:

Clustering is an unsupervised learning task that models data as coherent groups. Multiple approaches have been proposed in the past to cluster large volumes of data. Graphs provide a logical mapping of many real-world datasets rich enough to reflect various peculiarities of numerous domains. Apart from k-means, k-medoid, and other well-known clustering algorithms, utilization of random walk-based approaches to cluster data is a prominent area of data mining research. Markov clustering algorithm and limited random walk-based clustering are the prominent techniques that utilize the concept of random walk. The main goal of this work is to address the task of clustering graphs using an efficient random walk-based method. A novel walk approach in a graph is presented here that determines the weight of the edges and the degree of the nodes. This information is utilized by the pseudo-guidance model to guide the random walk procedure. This work introduces the friends-of-friends concept during the random walk process so that the edges? weights are determined utilizing an inclusive criterion. This concept enables a random walk to be initiated from the highest degree node. The random walk continues until the walking agent cannot find any unvisited neighbor(s). The agent walks to its neighbors if it finds a weight of one or more, otherwise the agent?s stopping criteria is met. The nodes visited in this walk form a cluster. Once a walk comes to halt, the visited nodes are removed from the original graph and the next walk starts in the remaining graph. This process continues until all nodes of the graph are traversed. The focus of this work remains random walk-based clustering of graphs. The proposed approach is evaluated using 18 real-world benchmark datasets utilizing six cluster validity indices, namely Davies-Bouldin index (DBI), Dunn index (DI), Silhouette coefficient (SC), Calinski-Harabasz index (CHI), modularity index, and normalized cut. This proposal is compared with seven closely related approaches from the same domain, namely, limited random walk, pairwise clustering, personalized page rank clustering, GAKH (genetic algorithm krill herd) graph clustering, mixing time of random walks, density-based clustering of large probabilistic graphs, and Walktrap. Experiments suggest better performance of this work based on the evaluation metrics.

关键词:

Community detection Random walk Graph clustering Efficient clustering

作者机构:

  • [ 1 ] [Halim, Zahid]Ghulam Ishaq Khan Inst Engn Sci & Technol, Fac Comp Sci & Engn, Machine Intelligence Res Grp MInG, Topi, Pakistan
  • [ 2 ] [Aadam]Ghulam Ishaq Khan Inst Engn Sci & Technol, Fac Comp Sci & Engn, Machine Intelligence Res Grp MInG, Topi, Pakistan
  • [ 3 ] [Uzma]Ghulam Ishaq Khan Inst Engn Sci & Technol, Fac Comp Sci & Engn, Machine Intelligence Res Grp MInG, Topi, Pakistan
  • [ 4 ] [Waqas, Muhammad]Ghulam Ishaq Khan Inst Engn Sci & Technol, Fac Comp Sci & Engn, Machine Intelligence Res Grp MInG, Topi, Pakistan
  • [ 5 ] [Sargana, Hussain Mahmood]Khwaja Fareed Univ Engn & Informat Technol, Dept Comp Sci, Ryk, Pakistan
  • [ 6 ] [Waqas, Muhammad]Beijing Univ Technol, Engn Res Ctr Intelligent Percept & Autonomous Con, Fac Informat Technol, Beijing 100124, Peoples R China

通讯作者信息:

  • [Halim, Zahid]Ghulam Ishaq Khan Inst Engn Sci & Technol, Fac Comp Sci & Engn, Machine Intelligence Res Grp MInG, Topi, Pakistan

电子邮件地址:

查看成果更多字段

相关关键词:

来源 :

JOURNAL OF COMPUTATIONAL SCIENCE

ISSN: 1877-7503

年份: 2021

卷: 51

3 . 3 0 0

JCR@2022

ESI学科: COMPUTER SCIENCE;

ESI高被引阀值:87

JCR分区:1

被引次数:

WoS核心集被引频次: 13

SCOPUS被引频次: 14

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 2

归属院系:

在线人数/总访问数:1717/4231570
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司