A new approach of using temporal information in mandarin speech recognition

Jyh Her Yang, Yuan Fu Liao, Yih-Ru Wang, Sin-Horng Chen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper, a new approach of using temporal information to assist in Mandarin speech recognition is discussed. It incorporates two types of temporal information into the recognition search. One is a statistical syllable duration model which considers the influences of 411 basesyllables, 5 tones, 4 position-in-word factors, and 3 positionin- sentence factors on syllable duration. Another is the timing information of modeling three types of inter-syllable boundary including intra-word, inter-word without punctuation mark (PM), and inter-word with PM. The uses of these two types of temporal information are expected to be useful for improving the segmentation accuracies in both acoustic decoding and linguistic decoding. Experimental results showed that the base-syllable/character/word recognition rates were slightly improved for both MATBN and Treebank datbase.

Original languageEnglish
Title of host publication3rd International Conference on Speech Prosody 2006
EditorsR. Hoffmann, H. Mixdorff
PublisherInternational Speech Communications Association
ISBN (Electronic)9780000000002
StatePublished - 2006
Event3rd International Conference on Speech Prosody, SP 2006 - Dresden, Germany
Duration: 2 May 20065 May 2006

Publication series

NameProceedings of the International Conference on Speech Prosody
ISSN (Print)2333-2042

Conference

Conference3rd International Conference on Speech Prosody, SP 2006
CountryGermany
CityDresden
Period2/05/065/05/06

Fingerprint Dive into the research topics of 'A new approach of using temporal information in mandarin speech recognition'. Together they form a unique fingerprint.

Cite this