In this paper, the multiplicative syllable duration model proposed previously for Mandarin speech is extended in some aspects. First, the three basic Tone 3 patterns (i.e., full tone, half tone and sandhi tone) are properly considered via using three different companding factors (CFs) to separate their affections. Second, the CFs of the model are analyzed in detail. Third, the syllable duration modeling method is applied to an automatically-segmented, 500-speaker, telephone-speech database. Fourth, a comparative study to parallelly construct an additive syllable duration model is done.
|Journal||ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings|
|State||Published - 11 Jul 2002|
|Event||2002 IEEE International Conference on Acustics, Speech, and Signal Processing - Orlando, FL, United States|
Duration: 13 May 2002 → 17 May 2002