• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Cui, Lishan (Cui, Lishan.) | Zhang, Xiuzhen (Zhang, Xiuzhen.) | Qin, A. K. (Qin, A. K..) | Sellis, Timos (Sellis, Timos.) | Wu, Lifang (Wu, Lifang.) (学者:毋立芳)

收录:

EI Scopus SCIE

摘要:

Individuals use Twitter for personal communication, whereas businesses, politicians and celebrities use Twitter for branding purposes. Distinguishing Personal from Branding Twitter accounts is important for Twitter analytics. Existing studies of Twitter account classification apply classical supervised learning, which requires intensive manual annotation for training. In this paper, we propose CDS (Collaborative Distant Supervision), a novel learning scheme for Twitter account classification that does not require intensive manual labelling. Twitter accounts are automatically labelled using heuristics for distant supervision learning. To achieve effective learning from heuristic labels, active learning is applied to identify and correct false positive labels, and semi-supervised learning is applied to further use false negatives missed by labelling heuristics for learning. Extensive experiments on Twitter data showed that CDS achieved high classification accuracy. (C) 2017 Elsevier Ltd. All rights reserved.

关键词:

Active learning Classification Distant supervision Semi-supervised learning Twitter

作者机构:

  • [ 1 ] [Cui, Lishan]RMIT Univ, Melbourne, Vic, Australia
  • [ 2 ] [Zhang, Xiuzhen]RMIT Univ, Melbourne, Vic, Australia
  • [ 3 ] [Qin, A. K.]Swinburne Univ Technol, Melbourne, Vic, Australia
  • [ 4 ] [Sellis, Timos]Swinburne Univ Technol, Melbourne, Vic, Australia
  • [ 5 ] [Wu, Lifang]Beijing Univ Technol, Beijing, Peoples R China

通讯作者信息:

  • [Zhang, Xiuzhen]RMIT Univ, Melbourne, Vic, Australia

查看成果更多字段

相关关键词:

来源 :

EXPERT SYSTEMS WITH APPLICATIONS

ISSN: 0957-4174

年份: 2017

卷: 83

页码: 94-103

8 . 5 0 0

JCR@2022

ESI学科: ENGINEERING;

ESI高被引阀值:92

中科院分区:1

被引次数:

WoS核心集被引频次: 9

SCOPUS被引频次: 9

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 1

归属院系:

在线人数/总访问数:2639/2962593
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司