• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Ji, Sai (Ji, Sai.) | Cheng, Yukun (Cheng, Yukun.) | Tan, Jingjing (Tan, Jingjing.) | Zhao, Zhongrui (Zhao, Zhongrui.)

收录:

Scopus SCIE

摘要:

Correlation clustering problem (CorCP) is a classical clustering problem, which clusters data based on the similarity of data set, and has many applications in interaction networks, cross-lingual link detection, and communication networks, etc. In this paper, we study a practical generalization of the CorCP, called the capacitated correlation clustering problem (the capacitated CorCP), by constructing a labeled complete graph. On this labeled complete graph, each vertex represents a piece of data. If two pieces of data are similar, then the edge between the corresponding vertices is marked by a positive label +. Otherwise, this edge is marked by a negative label -. The objective of the capacitated CorCP is to group some similar data sets into one cluster as far as possible, while satisfying the cluster capacity constraint. To achieve this objective, we shall partition the vertex set of the labeled complete graph into several clusters, each cluster's size subjecting to an upper bound, so as to minimize the number of disagreements. Here the number of disagreements is defined as the total number of the edges with positive labels between clusters and the edges with negative labels within clusters. Different with the previous algorithm in [18], which subjects to the constraint on the cluster size by a penalty measure, we design an algorithm for the capacitated CorCP to directly output a feasible solution by iteratively constructing clusters based on a preset threshold. Through carefully setting the threshold and sophisticatedly analyzing, our algorithm is proved to have an improved approximation ratio of 5.37. In addition, we also conduct a series of numerical experiments to demonstrate the effectiveness of our algorithm.

关键词:

approximation algorithms LP-rounding capacitated correlation clustering Clustering problem

作者机构:

  • [ 1 ] [Ji, Sai]Hebei Univ Technol, Inst Math, Tianjin 300401, Peoples R China
  • [ 2 ] [Cheng, Yukun]Suzhou Univ Sci & Technol, Sch Business, Suzhou 215009, Peoples R China
  • [ 3 ] [Tan, Jingjing]Weifang Univ, Sch Math & Informat Sci, Weifang 261061, Peoples R China
  • [ 4 ] [Zhao, Zhongrui]Beijing Univ Technol, Dept Operat Res & Informat Engn, Beijing 100124, Peoples R China

通讯作者信息:

查看成果更多字段

相关关键词:

来源 :

INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE

ISSN: 0129-0541

年份: 2023

期: 06

卷: 35

页码: 757-774

0 . 8 0 0

JCR@2022

ESI学科: COMPUTER SCIENCE;

ESI高被引阀值:19

被引次数:

WoS核心集被引频次:

SCOPUS被引频次:

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 1

归属院系:

在线人数/总访问数:386/4985831
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司