A novel syllable duration modeling approach for Mandarin speech

Wen Hsing Lai, Sin-Horng Chen

Research output: Contribution to journalArticle

4 Scopus citations

Abstract

In this paper, a novel syllable duration modeling approach for Mandarin speech is proposed. It explicitly takes several main affecting factors as multiplicative companding parameters and estimates all model parameters by an EM algorithm. Experimental results showed that the variance of the observed syllable duration was greatly reduced from 183.4 frame (1 frame = 5 ms) to 18.5 frame by eliminating effects from these affecting factors. Besides, the estimated companding values of these affecting factors agreed well to our prior linguistic knowledge. A preliminary study of applying the proposed model to predict syllable duration for TTS is also performed. Experimental results showed that it outperformed the conventional regressive prediction method. Lastly, an extension of the approach to incorporate initial and final duration modeling is presented. This leads to a better understanding of the relation between the companding factors of initial and final duration models and those of syllable duration model.

Original languageEnglish
Pages (from-to)93-96
Number of pages4
JournalICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume1
DOIs
StatePublished - 1 Jan 2001

Fingerprint Dive into the research topics of 'A novel syllable duration modeling approach for Mandarin speech'. Together they form a unique fingerprint.

  • Cite this