Rich prosodic information exploration on spontaneous Mandarin speech

Cheng Hsien Lin, Chung Long You, Chen Yu Chiang, Yih-Ru Wang, Sin-Horng Chen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

In this paper, rich prosodic information of spontaneous Mandarin speech is explored. The joint prosody labeling and modeling algorithm proposed previously for read speech is extended to spontaneous-speech prosody modeling by additionally considering the modeling of disfluency speech parts. It trains a hierarchical prosodic model and performs prosody labeling from a large speech corpus automatically. Rich prosodic information is then explored via analyzing model parameters and labeling results. By comparing the resulting prosodic model with that of read speech, we find that most affecting patterns, such as F0 contour patterns of 4 tones, have similar shapes or same trends but with much less dynamic ranges. Besides, the prosodic characteristics of various disfluency events, including repetition, restart, repair, contraction, and hesitation, are intensively investigated based on the labeling results. The information explored increases our knowledge about the phonology of spontaneous speech, and should be useful for assisting in ASR.

Original languageEnglish
Title of host publicationProceedings of 2016 10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016
EditorsHsin-Min Wang, Qingzhi Hou, Yuan Wei, Tan Lee, Jianguo Wei, Lei Xie, Hui Feng, Jianwu Dang, Jianwu Dang
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781509042937
DOIs
StatePublished - 2 May 2017
Event10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016 - Tianjin, China
Duration: 17 Oct 201620 Oct 2016

Publication series

NameProceedings of 2016 10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016

Conference

Conference10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016
CountryChina
CityTianjin
Period17/10/1620/10/16

Keywords

  • Disfluency event
  • Prosodic information
  • Prosody labeling
  • Prosody modeling
  • Spontaneous Mandarin speech

Fingerprint Dive into the research topics of 'Rich prosodic information exploration on spontaneous Mandarin speech'. Together they form a unique fingerprint.

Cite this