A new model-based prosody coder for mandarin speech

Chen Yu Chiang, Yu Ping Hung, Sin-Horng Chen, Yih-Ru Wang

研究成果: Conference contribution

摘要

In this paper, a novel parametric prosody coding approach for Mandarin speech is proposed. It employs a hierarchical prosodic model (HPM) as a prosody generating model in the encoder to analyze the speech prosody of the input utterance to obtain a parametric representation of four prosodic-acoustic features of syllable pitch contour, syllable duration, syllable energy level, and syllable-juncture pause duration for encoding. In the decoder, the four prosodic-acoustic features are reconstructed by a synthesis operation using the decoded HPM parameters. The reconstructed prosodic features are lastly used in an HMM-based speech synthesizer to help to generate the reconstructed speech. Experimental results show that the reconstructed speech has good quality at low data rates of 114.9 bits/s for a speaker-dependent task. An informal listening test confirmed decoded speeches sounded very fluently.

原文English
主出版物標題Proceedings - 2013 9th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2013
發行者IEEE Computer Society
頁面60-63
頁數4
ISBN(列印)9780769551203
DOIs
出版狀態Published - 1 一月 2013
事件9th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2013 - Beijing, China
持續時間: 16 十月 201318 十月 2013

出版系列

名字Proceedings - 2013 9th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2013

Conference

Conference9th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2013
國家China
城市Beijing
期間16/10/1318/10/13

指紋 深入研究「A new model-based prosody coder for mandarin speech」主題。共同形成了獨特的指紋。

  • 引用此

    Chiang, C. Y., Hung, Y. P., Chen, S-H., & Wang, Y-R. (2013). A new model-based prosody coder for mandarin speech. 於 Proceedings - 2013 9th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2013 (頁 60-63). [6846580] (Proceedings - 2013 9th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2013). IEEE Computer Society. https://doi.org/10.1109/IIH-MSP.2013.24