An investigation on the mandarin prosody of a parallel multi-speaking rate speech corpus

Chen Yu Chiang*, Cheng Chang Tang, Hsiu Min Yu, Yih-Ru Wang, Sin-Horng Chen

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Scopus citations

Abstract

In this paper, the prosody of a parallel multispeaking rate Mandarin read speech corpus is investigated. The corpus contains four parallel speech datasets uttered by a female professional announcer with various speech rates (SRs) of 4.40 (fast), 3.82 (normal), 2.97 (median) and 2.45 (slow) syllables/second. By using the unsupervised joint prosody labeling and modeling (PLM) method proposed previously, the relationship between SR and various prosodic features, including pause duration, patterns of three high-level prosodic constituents, and the break labels, are investigated. The analyses reported in this study could be very informative in developing prosody generation mechanism for text-to-speech and prosody modeling for automatic speech recognition in various SRs.

Original languageEnglish
Title of host publication2009 Oriental COCOSDA International Conference on Speech Database and Assessments, ICSDA 2009
Pages148-153
Number of pages6
DOIs
StatePublished - 10 Dec 2009
Event2009 Oriental COCOSDA International Conference on Speech Database and Assessments, ICSDA 2009 - Urumqi, China
Duration: 10 Aug 200912 Aug 2009

Publication series

Name2009 Oriental COCOSDA International Conference on Speech Database and Assessments, ICSDA 2009

Conference

Conference2009 Oriental COCOSDA International Conference on Speech Database and Assessments, ICSDA 2009
CountryChina
CityUrumqi
Period10/08/0912/08/09

Fingerprint Dive into the research topics of 'An investigation on the mandarin prosody of a parallel multi-speaking rate speech corpus'. Together they form a unique fingerprint.

  • Cite this

    Chiang, C. Y., Tang, C. C., Yu, H. M., Wang, Y-R., & Chen, S-H. (2009). An investigation on the mandarin prosody of a parallel multi-speaking rate speech corpus. In 2009 Oriental COCOSDA International Conference on Speech Database and Assessments, ICSDA 2009 (pp. 148-153). [5278360] (2009 Oriental COCOSDA International Conference on Speech Database and Assessments, ICSDA 2009). https://doi.org/10.1109/ICSDA.2009.5278360