Consistency analysis of the duration parameter within a syllable for mandarin speech

Cheng Yu Yeh*, Kuan Lin Chen, Shaw-Hwa Hwang

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

This work presents a study of Mandarin speech focusing on consistency analysis of the duration parameter within syllables. Identified as a result of inspection of the human pronunciation process, this consistency can be interpreted as a high correlation between the warping curves of the spectrum and the prosody intra a syllable. Through three steps in the procedure of the consistency analysis, the HMM algorithm is used firstly to decode HMM-state sequences within a syllable at the same time as to divide them into three segments. Secondly, based on a designated syllable, the vector quantization (VQ) with the Linde-Buzo-Gray algorithm is employed to train the VQ codebooks of each segment. Thirdly, the duration vector of each segment is encoded as an index by VQ codebooks, and then the probability of each possible path is evaluated as a prerequisite to analyze the consistency. It is demonstrated experimentally that a consistency is definitely acquired in case the syllable is located exactly in the same word. These results offer a research direction that the time warping process intra a syllable must be considered in a TTS system to improve the synthesized speech quality.

Original languageEnglish
Pages (from-to)124-130
Number of pages7
JournalInformation Technology and Control
Volume42
Issue number2
DOIs
StatePublished - 19 Jun 2013

Keywords

  • Consistency analysis
  • Hidden markov model (HMM)
  • Speech synthesis
  • Text-to-speech (TTS)
  • Vector quantization (VQ)

Fingerprint Dive into the research topics of 'Consistency analysis of the duration parameter within a syllable for mandarin speech'. Together they form a unique fingerprint.

Cite this