Spoken English Intelligibility Remediation with Pocketsphinx Alignment and Feature Extraction Improves Substantially over the State of the Art - Details

Author：

Gao, Yuan (Gao, Yuan.) | Lal Srivastava, Brij Mohan (Lal Srivastava, Brij Mohan.) | Salsman, James (Salsman, James.)

Indexed by：

EI Scopus

Abstract：

W279　use　automatic　speech　recognition　to　assess　spoken　English　learner　pronunciation　based　on　the　authentic　intelligibility　of　the　learners＇　spoken　responses　determined　from　support　vector　machine　(SVM)　classifier　or　deep　learning　neural　network　model　predictions　of　transcription　correctness.　Using　numeric　features　produced　by　PocketSphinx　alignment　mode　and　many　recognition　passes　searching　for　the　substitution　and　deletion　of　each　expected　phoneme　and　insertion　of　unexpected　phonemes　in　sequence,　the　SVM　models　achieve　82%　agreement　with　the　accuracy　of　Amazon　Mechanical　Turk　crowdworker　transcriptions,　up　from　75%　reported　by　multiple　independent　researchers.　Using　such　features　with　SVM　classifier　probability　prediction　models　can　help　computer-aided　pronunciation　teaching　(CAPT)　systems　provide　intelligibility　remediation.　©　2018　IEEE.

Keyword：

Speech intelligibility Computer aided instruction Predictive analytics Alignment Speech recognition Deep learning Deep neural networks Support vector machines Information management

Author Community：

[ 1 ] [Gao, Yuan]Beijing University of Technology, Beijing, China
[ 2 ] [Lal Srivastava, Brij Mohan]International Institute of Information Technology, Hyderabad, India
[ 3 ] [Salsman, James]17zuoye.com, China
[ 4 ] [Salsman, James]TalkNicer.com, LLC, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

DNN-based speech enhancement using MBE model
2018，16th International Workshop on Acoustic Signal Enhancement, IWAENC 2018
Speech enhancement via generative adversarial LSTM networks
2018，16th International Workshop on Acoustic Signal Enhancement, IWAENC 2018
An ideal wiener filter correction-based cIRM speech enhancement method using deep neural networks with skip connections
2018，14th IEEE International Conference on Signal Processing, ICSP 2018
Speech enhancement based on cepstral mapping and deep neural networks
2018，4th IEEE International Conference on Computer and Communications, ICCC 2018

Source ：

Year： 2018

Page： 924-927

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 7

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 5

Affiliated Colleges：

学院待认领

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to