Video based visual speech feature model construction - Details

Author：

Jia, Xibin (Jia, Xibin.) (Scholars：贾熹滨) | Zheng, Meixia (Zheng, Meixia.)

Indexed by：

EI Scopus

Abstract：

This　paper　aims　to　give　a　solutions　for　the　construction　of　chinese　visual　speech　feature　model　based　on　HMM.　We　propose　and　discuss　three　kind　representation　model　of　the　visual　speech　which　are　lip　geometrical　features,　lip　motion　features　and　lip　texture　features.　The　model　combines　the　advantages　of　the　local　LBP　and　global　DCT　texture　information　together,　which　shows　better　performance　than　the　single　feature.　Equally　the　model　combines　the　advantages　of　the　local　LBP　and　geometrical　information　together　is　better　than　single　feature.　By　computing　the　recognition　rate　of　the　visemes　from　the　model,　the　paper　shows　the　HMM　which　describing　the　dynamic　of　speech,　coupled　with　the　combined　feature　for　describing　the　global　and　local　texture　is　the　best　model.　©　(2012)　Trans　Tech　Publications,　Switzerland.

Keyword：

Speech Textures

Author Community：

[ 1 ] [Jia, Xibin]Multimedia and Intelligent Software Technology, Beijing Municipal Key Laboratory, Beijing University of Technology, Beijing, 100124, China
[ 2 ] [Zheng, Meixia]Multimedia and Intelligent Software Technology, Beijing Municipal Key Laboratory, Beijing University of Technology, Beijing, 100124, China

Reprint Author's Address：

Email：

jiaxibin@bjut.edu.cn

Show more details

Related Keywords：

Joint LBP and DCT model for visual speech
2012，2012 International Conference on Affective Computing and Intelligent Interaction, ICACII 2012
Influence on emotional impression of voice by changing prosodic features
2009，2009 International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE 2009
Analysis of variation on intra-speakers speech recognition performances
2007，International Conference on Natural Language Processing and Knowledge Engineering, IEEE NLP-KE 2007
Automatic utterance segmentation tool for speech corpus
2007，International Conference on Natural Language Processing and Knowledge Engineering, IEEE NLP-KE 2007

Source ：

ISSN： 1660-9336

Year： 2012

Volume： 182-183

Page： 1367-1371

Language： English

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 0

Affiliated Colleges：

城市建设学部建筑工程学院

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to