Distributed speech recognition of Mandarin digits string

Yih-Ru Wang*, Bo Xuan Lu, Yuan Fu Liao, Sin-Horng Chen

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

In this paper, the performance of the pitch detection algorithm in ETSI ES-202-212 XAFE standard is evaluated on a Mandarin digit string recognition task. Experimental results showed that the performance of the pitch detection algorithm degraded seriously when the SNR of speech signal was lower than 10dB. This makes the recognizer using pitch information perform inferior to the original recognizer without using pitch information in low SNR environments. A modification of the pitch detection algorithm is therefore proposed to improve the performance of pitch detection in low SNR environments. The recognition performance can be improved for most SNR levels by integrating the recognizers with and without using pitch information. Overall recognition rates of 82.1% and 86.8% were achieved for clean and multi-condition training cases.

Original languageEnglish
Title of host publicationChinese Spoken Language Processing - 5th International Symposium, ISCSLP 2006, Proceedings
Pages370-379
Number of pages10
DOIs
StatePublished - 1 Dec 2006
Event5th International Symposium on Chinese Spoken Language Processing, ISCSLP 2006 - Singapore, Singapore
Duration: 13 Dec 200616 Dec 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4274 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference5th International Symposium on Chinese Spoken Language Processing, ISCSLP 2006
CountrySingapore
CitySingapore
Period13/12/0616/12/06

Keywords

  • Distributed speech recognition
  • Extended Advanced Front-end
  • Tonal language speech recognition

Fingerprint Dive into the research topics of 'Distributed speech recognition of Mandarin digits string'. Together they form a unique fingerprint.

  • Cite this

    Wang, Y-R., Lu, B. X., Liao, Y. F., & Chen, S-H. (2006). Distributed speech recognition of Mandarin digits string. In Chinese Spoken Language Processing - 5th International Symposium, ISCSLP 2006, Proceedings (pp. 370-379). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4274 LNAI). https://doi.org/10.1007/11939993_40