• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Xu, Peng (Xu, Peng.) | Huang, Yongye (Huang, Yongye.) | Yuan, Tongtong (Yuan, Tongtong.) | Xiang, Tao (Xiang, Tao.) | Hospedales, Timothy M. (Hospedales, Timothy M..) | Song, Yi-Zhe (Song, Yi-Zhe.) | Wang, Liang (Wang, Liang.)

Indexed by:

EI Scopus SCIE

Abstract:

In this paper, we focus on learning semantic representations for large-scale highly abstract sketches that were produced by the practical sketch-based application rather than the excessively well dawn sketches obtained by crowd-sourcing. We propose a dual-branch CNN-RNN network architecture to represent sketches, which simultaneously encodes both the static and temporal patterns of sketch strokes. Based on this architecture, we further explore learning the sketch-oriented semantic representations in two practical settings, i.e., hashing retrieval and zero-shot recognition on million-scale highly abstract sketches produced by practical online interactions. Specifically, we use our dual-branch architecture as a universal representation framework to design two sketch-specific deep models: (i) We propose a deep hashing model for sketch retrieval, where a novel hashing loss is specifically designed to further accommodate both the abstract and messy traits of sketches. (ii) We propose a deep embedding model for sketch zero-shot recognition, via collecting a large-scale edge-map dataset and proposing to extract a set of semantic vectors from edge-maps as the semantic knowledge for sketch zero-shot domain alignment. Both deep models are evaluated by comprehensive experiments on million-scale abstract sketches produced by a global online game QuickDraw and outperform state-of-the-art competitors.

Keyword:

edge-map dataset retrieval Feature extraction Quantization (signal) semantic representation Speech recognition Semantics hashing Practical sketch-based application Task analysis Visualization zero-shot recognition Games

Author Community:

  • [ 1 ] [Xu, Peng]Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
  • [ 2 ] [Huang, Yongye]ByteDance, Shenzhen 518000, Peoples R China
  • [ 3 ] [Yuan, Tongtong]Beijing Univ Technol, Informat Technol Sch, Beijing 100124, Peoples R China
  • [ 4 ] [Xiang, Tao]Univ Surrey, Ctr Vis Speech & Signal Proc CVSSP, Guildford GU2 7XH, Surrey, England
  • [ 5 ] [Song, Yi-Zhe]Univ Surrey, Ctr Vis Speech & Signal Proc CVSSP, Guildford GU2 7XH, Surrey, England
  • [ 6 ] [Hospedales, Timothy M.]Univ Edinburgh, Sch Informat, Edinburgh EH8 9YL, Midlothian, Scotland
  • [ 7 ] [Wang, Liang]Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China

Reprint Author's Address:

  • [Xu, Peng]Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore

Show more details

Related Keywords:

Related Article:

Source :

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY

ISSN: 1051-8215

Year: 2021

Issue: 9

Volume: 31

Page: 3366-3379

8 . 4 0 0

JCR@2022

ESI Discipline: ENGINEERING;

ESI HC Threshold:87

JCR Journal Grade:1

Cited Count:

WoS CC Cited Count: 9

SCOPUS Cited Count: 12

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 5

Affiliated Colleges:

Online/Total:347/5775088
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.