• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Zheng MeiXia (Zheng MeiXia.) | Jia XiBin (Jia XiBin.) (Scholars:贾熹滨)

Indexed by:

CPCI-S

Abstract:

The paper aims to establish a effective feature form of visual speech to realize the Chinese viseme recognition. We propose and discuss a representation model of the visual speech which bases on the local binary pattern (LBP) and the discrete cosine transform (DCT) of mouth images. The joint model combines the advantages of the local and global texture information together, which shows better performance than using the global feature only. By computing LBP and DCT of each mouth frame capturing during the subject speaking, the Hidden Markov Model (HMM) is trained based on the training dataset and is employed to recognize the new visual speech. The experiments show this visual speech feature model exhibits good performance in classifying the difference speaking states.

Keyword:

Visual speech feature HMM LBP DCT

Author Community:

  • [ 1 ] [Zheng MeiXia]Beijing Univ Technol, Beijing, Peoples R China
  • [ 2 ] [Jia XiBin]Beijing Univ Technol, Beijing, Peoples R China

Reprint Author's Address:

  • [Zheng MeiXia]Beijing Univ Technol, Beijing, Peoples R China

Show more details

Related Keywords:

Related Article:

Source :

AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION

ISSN: 1867-5662

Year: 2012

Volume: 137

Page: 101-107

Language: English

Cited Count:

WoS CC Cited Count: 1

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 0

Affiliated Colleges:

Online/Total:719/5280442
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.